The first dataset on shadow generation for the foreground object in real-world scenes.

Last update: Dec 30, 2022

Overview

Object-Shadow-Generation-Dataset-DESOBA

Object Shadow Generation is to deal with the shadow inconsistency between the foreground object and the background in a composite image, that is, generating shadow for the foreground object according to background information, to make the composite image more realistic.

Our dataset DESOBA is a synthesized dataset for Object Shadow Generation. We build our dataset on the basis of Shadow-OBject Association dataset SOBA, which collects real-world images in complex scenes and provides annotated masks for object-shadow pairs. Based on SOBA dataset, we remove all the shadows to construct our DEshadowed Shadow-OBject Association(DESOBA) dataset, which can be used for shadow generation task and other shadow-related tasks as well. We illustrate the process of our DESOBA dataset construction based on SOBA dataset in the figure below.

Illustration of DESOBA dataset construction: The green arrows illustrate the process of acquiring paired data for training and evaluation. Given a ground-truth target image I_g, we manually remove all shadows to produce a deshadowed image I_d. Then, we randomly select a foreground object in I_g, and replace its shadow area with the counterpart in I_d to synthesize a composite image I_c without foreground shadow. I_c and I_g form a pair of input composite image and ground-truth target image. The red arrow illustrates our shadow generation task. Given I_c and its foreground mask M_fo, we aim to generate the target image I_g with foreground shadow.

Our DESOBA dataset contains 840 training images with totally 2,999 object-shadow pairs and 160 test images with totally 624 object-shadow pairs. The DESOBA dataset is provided in Baidu Cloud (access code: sipx), or Google Drive.

Prerequisites

Python
Pytorch
PIL

Getting Started

Installation

Clone this repo:

git clone https://github.com/bcmi/Object-Shadow-Generation-Dataset-DESOBA.git
cd Object-Shadow-Generation-Dataset-DESOBA

Download the DESOBA dataset.
We provide the code of obtaining training/testing tuples, each tuple contains foreground object mask, foreground shadow mask, background object mask, background shadow mask, shadow image, and synthetic composite image without foreground shadow mask. The dataloader is available in /data_processing/data/DesobaSyntheticImageGeneration_dataset.py, which can be used as dataloader in training phase or testing phase.
We also provide the code of visualization of training/testing tuple, run:

python Vis_Desoba_Dataset.py

Vis_Desoba_Dataset.py is available in /data_processing/.

We show some examples of training/testing tuples in below:

from left to right: synthetic composite image without foreground shadow, target image with foreground shadow, foreground object mask, foreground shadow mask, background object mask, and background shadow mask.

Bibtex

If you find this work is useful for your research, please cite our paper using the following BibTeX [arxiv]:

@article{hong2021shadow,
  title={Shadow Generation for Composite Image in Real-world Scenes},
  author={Hong, Yan and Niu, Li and Zhang, Jianfu and Zhang, Liqing},
  journal={arXiv preprint arXiv:2104.10338},
  year={2021}
}

The first dataset on shadow generation for the foreground object in real-world scenes.

Related tags

Overview

Object-Shadow-Generation-Dataset-DESOBA

Prerequisites

Getting Started

Installation

Bibtex

Owner

BCMI

PyTorch code for Composing Partial Differential Equations with Physics-Aware Neural Networks

Implementation for Shape from Polarization for Complex Scenes in the Wild

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

Milano is a tool for automating hyper-parameters search for your models on a backend of your choice.

Group project for MFIN7036. Our goal is to predict firm profitability with text-based competition measures.

TensorFlow ROCm port

Shuwa Gesture Toolkit is a framework that detects and classifies arbitrary gestures in short videos

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

An NVDA add-on to split screen reader and audio from other programs to different sound channels

Source code for our EMNLP'21 paper 《Raise a Child in Large Language Model: Towards Effective and Generalizable Fine-tuning》

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

Neural Scene Flow Prior (NeurIPS 2021 spotlight)

https://arxiv.org/abs/2102.11005

Source code for PairNorm (ICLR 2020)

Project page for the paper Semi-Supervised Raw-to-Raw Mapping 2021.

Streamlit Tutorial (ex: stock price dashboard, cartoon-stylegan, vqgan-clip, stylemixing, styleclip, sefa)

Exponential Graph is Provably Efficient for Decentralized Deep Training

PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, EfficientNetV2, NFNet, Vision Transformer, MixNet, MobileNet-V3/V2, RegNet, DPN, CSPNet, and more

The official implementation for "FQ-ViT: Fully Quantized Vision Transformer without Retraining".

ELSED: Enhanced Line SEgment Drawing