[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Last update: Dec 11, 2022

Related tags

Deep Learning CTSDG

Overview

CTSDG

Paper | Pre-trained Models | BibTex

Image Inpainting via Conditional Texture and Structure Dual Generation

Xiefan Guo, Hongyu Yang, Di Huang
In ICCV'2021

Introduction

Generator. Image inpainting is cast into two subtasks, i.e., structure-constrained texture synthesis (left, blue) and texture-guided structure reconstruction (right, red), and the two parallel-coupled streams borrow encoded deep features from each other. The Bi-GFF module and CFA module are stacked at the end of the generator to further refine the results.

Discriminator. The texture branch estimates the generated texture, while the structure branch guides structure reconstruction.

Prerequisites

Python >= 3.6
PyTorch >= 1.0
NVIDIA GPU + CUDA cuDNN

Getting Started

Installation

Clone this repo:

git clone https://github.com/Xiefan-Guo/CTSDG.git
cd CTSDG

Install PyTorch and dependencies from http://pytorch.org
Install python requirements:

pip install -r requirements.txt

Datasets

Image Dataset. We evaluate the proposed method on the CelebA, Paris StreetView, and Places2 datasets, which are widely adopted in the literature.

Mask Dataset. Irregular masks are obtained from Irregular Masks and classified based on their hole sizes relative to the entire image with an increment of 10%.

Training

Analogous to PConv by Liu et.al, initial training followed by finetuning are performed.

python train.py \
  --image_root [path to image directory] \
  --mask_root [path mask directory]

python train.py \
  --image_root [path to image directory] \
  --mask_root [path to mask directory] \
  --pre_trained [path to checkpoints] \
  --finetune True

Distributed training support. You can train model in distributed settings.

python -m torch.distributed.launch --nproc_per_node=N_GPU train.py

Testing

To test the model, you run the following code.

python test.py \
  --pre_trained [path to checkpoints] \
  --image_root [path to image directory] \
  --mask_root [path to mask directory] \
  --result_root [path to output directory] \
  --number_eval [number of images to test]

Citation

If any part of our paper and repository is helpful to your work, please generously cite with:

@InProceedings{Guo_2021_ICCV,
    author    = {Guo, Xiefan and Yang, Hongyu and Huang, Di},
    title     = {Image Inpainting via Conditional Texture and Structure Dual Generation},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {14134-14143}
}

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

Related tags

Overview

CTSDG

Paper | Pre-trained Models | BibTex

Introduction

Prerequisites

Getting Started

Installation

Datasets

Training

Testing

Citation

Owner

Xiefan Guo

Code for a real-time distributed cooperative slam(RDC-SLAM) system for ROS compatible platforms.

Open source implementation of "A Self-Supervised Descriptor for Image Copy Detection" (SSCD).

Super-BPD: Super Boundary-to-Pixel Direction for Fast Image Segmentation (CVPR 2020)

An Extendible (General) Continual Learning Framework based on Pytorch - official codebase of Dark Experience for General Continual Learning

This is a pytorch implementation for the BST model from Alibaba https://arxiv.org/pdf/1905.06874.pdf

AdaSpeech 2: Adaptive Text to Speech with Untranscribed Data

A Collection of LiDAR-Camera-Calibration Papers, Toolboxes and Notes

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Differentiable Surface Triangulation

The ICS Chat System project for NYU Shanghai Fall 2021

This is a collection of our NAS and Vision Transformer work.

Implementation detail for paper "Multi-level colonoscopy malignant tissue detection with adversarial CAC-UNet"

A hifiasm fork for metagenome assembly using Hifi reads.

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Bayesian Inference Tools in Python

FACIAL: Synthesizing Dynamic Talking Face With Implicit Attribute Learning. ICCV, 2021.

Corgis are the cutest creatures; have 30K of them!

ImageNet Adversarial Image Evaluation

Improving Calibration for Long-Tailed Recognition (CVPR2021)

Large scale and asynchronous Hyperparameter Optimization at your fingertip.