Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

Last update: Dec 27, 2022

Overview

GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

[Project] [Paper] [Demo] [Related Work: A2RL (for Auto Image Cropping)] [Colab]
Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending

Overview

source	destination	mask	composited	blended

The author's implementation of GP-GAN, the high-resolution image blending algorithm described in:
"GP-GAN: Towards Realistic High-Resolution Image Blending"
Huikai Wu, Shuai Zheng, Junge Zhang, Kaiqi Huang

Given a mask, our algorithm can blend the source image and the destination image, generating a high-resolution and realsitic blended image. Our algorithm is based on deep generative models Wasserstein GAN.

Contact: Hui-Kai Wu ([email protected])

Citation

@article{wu2017gp,
  title   = {GP-GAN: Towards Realistic High-Resolution Image Blending},
  author  = {Wu, Huikai and Zheng, Shuai and Zhang, Junge and Huang, Kaiqi},
  journal = {ACMMM},
  year    = {2019}
}

Getting started

The code is tested with python==3.5 and chainer==6.3.0 on Ubuntu 16.04 LTS.

Download the code from GitHub:

git clone https://github.com/wuhuikai/GP-GAN.git
cd GP-GAN

Install the requirements:

pip install -r requirements/test/requirements.txt

Download the pretrained model blending_gan.npz or unsupervised_blending_gan.npz from Google Drive, and then put them in the folder models.

Run the script for blending_gan.npz:

python run_gp_gan.py --src_image images/test_images/src.jpg --dst_image images/test_images/dst.jpg --mask_image images/test_images/mask.png --blended_image images/test_images/result.png

Or run the script for unsupervised_blending_gan.npz:

python run_gp_gan.py --src_image images/test_images/src.jpg --dst_image images/test_images/dst.jpg --mask_image images/test_images/mask.png --blended_image images/test_images/result.png --supervised False

Type python run_gp_gan.py --help for a complete list of the arguments.

Train GP-GAN step by step

Train Blending GAN

Download Transient Attributes Dataset here.

Crop the images in each subfolder:

python crop_aligned_images.py --data_root [Path for imageAlignedLD in Transient Attributes Dataset]

Train Blending GAN:

python train_blending_gan.py --data_root [Path for cropped aligned images of Transient Attributes Dataset]

Training Curve
Visual Result

Training Set Validation Set

Training Unsupervised Blending GAN

Requirements

pip install git+git://github.com/mila-udem/[email protected]

Download the hdf5 dataset of outdoor natural images: ourdoor_64.hdf5 (1.4G), which contains 150K landscape images from MIT Places dataset.

Train unsupervised Blending GAN:

python train_wasserstein_gan.py --data_root [Path for outdoor_64.hdf5]

Training Curve
Samples after training

Visual results

Mask	Copy-and-Paste	Modified-Poisson	Multi-splines	Supervised GP-GAN	Unsupervised GP-GAN

Official Chainer implementation of GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

Related tags

Overview

GP-GAN: Towards Realistic High-Resolution Image Blending (ACMMM 2019, oral)

Overview

Citation

Getting started

Train GP-GAN step by step

Train Blending GAN

Training Unsupervised Blending GAN

Visual results

Owner

Wu Huikai

SEOVER: Sentence-level Emotion Orientation Vector based Conversation Emotion Recognition Model

Multi Task Vision and Language

UPSNet: A Unified Panoptic Segmentation Network

Objective of the repository is to learn and build machine learning models using Pytorch. 30DaysofML Using Pytorch

Code accompanying paper: Meta-Learning to Improve Pre-Training

Implementation of H-UCRL Algorithm

Gym for multi-agent reinforcement learning

A keras implementation of ENet (abandoned for the foreseeable future)

[NeurIPS 2020] Blind Video Temporal Consistency via Deep Video Prior

[IEEE Transactions on Computational Imaging] Self-Gated Memory Recurrent Network for Efficient Scalable HDR Deghosting

Code to reproduce results from the paper "AmbientGAN: Generative models from lossy measurements"

An Open Source Machine Learning Framework for Everyone

Pytorch code for our paper Beyond ImageNet Attack: Towards Crafting Adversarial Examples for Black-box Domains)

TFOD-MASKRCNN - Tensorflow MaskRCNN With Python

Make a surveillance camera from your raspberry pi!

This is the research repository for Vid2Doppler: Synthesizing Doppler Radar Data from Videos for Training Privacy-Preserving Activity Recognition.

A PyTorch port of the Neural 3D Mesh Renderer

MegEngine implementation of YOLOX

CCNet: Criss-Cross Attention for Semantic Segmentation (TPAMI 2020 & ICCV 2019).

IDA file loader for UF2, created for the DEFCON 29 hardware badge