AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

Last update: Dec 27, 2022

Overview

AttentionGAN-v2 for Unpaired Image-to-Image Translation

AttentionGAN-v2 Framework

The proposed generator learns both foreground and background attentions. It uses the foreground attention to select from the generated output for the foreground regions, while uses the background attention to maintain the background information from the input image. Please refer to our papers for more details.

Comparsion with State-of-the-Art Methods

Selfie To Anime Translation

Horse to Zebra Translation

Zebra to Horse Translation

Apple to Orange Translation

Orange to Apple Translation

Map to Aerial Photo Translation

Aerial Photo to Map Translation

Style Transfer

Visualization of Learned Attention Masks

Selfie to Anime Translation

Horse to Zebra Translation

Zebra to Horse Translation

Apple to Orange Translation

Orange to Apple Translation

Map to Aerial Photo Translation

Aerial Photo to Map Translation

Extended Paper | Conference Paper

AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks.
Hao Tang¹, Hong Liu², Dan Xu³, Philip H.S. Torr³ and Nicu Sebe¹.
¹University of Trento, Italy, ²Peking University, China, ³University of Oxford, UK.
In TNNLS 2021 & IJCNN 2019 Oral.
The repository offers the official implementation of our paper in PyTorch.

Are you looking for AttentionGAN-v1 for Unpaired Image-to-Image Translation?

Paper | Code

Are you looking for AttentionGAN-v1 for Multi-Domain Image-to-Image Translation?

Paper | Code

Facial Expression-to-Expression Translation

Order: The Learned Attention Masks, The Learned Content Masks, Final Results

Facial Attribute Transfer

Order: The Learned Attention Masks, The Learned Content Masks, Final Results

Order: The Learned Attention Masks, AttentionGAN, StarGAN

License

The code is released for academic research use only. For commercial use, please contact [email protected].

Installation

Clone this repo.

git clone https://github.com/Ha0Tang/AttentionGAN
cd AttentionGAN/

This code requires PyTorch 0.4.1+ and python 3.6.9+. Please install dependencies by

pip install -r requirements.txt (for pip users)

./scripts/conda_deps.sh (for Conda users)

To reproduce the results reported in the paper, you would need an NVIDIA Tesla V100 with 16G memory.

Dataset Preparation

Download the datasets using the following script. Please cite their paper if you use the data. Try twice if it fails the first time!

sh ./datasets/download_cyclegan_dataset.sh dataset_name

The selfie2anime dataset can be download here.

AttentionGAN Training/Testing

Download a dataset using the previous script (e.g., horse2zebra).
To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097.
Train a model:

sh ./scripts/train_attentiongan.sh

To see more intermediate results, check out ./checkpoints/horse2zebra_attentiongan/web/index.html.
How to continue train? Append --continue_train --epoch_count xxx on the command line.
Test the model:

sh ./scripts/test_attentiongan.sh

The test results will be saved to a html file here: ./results/horse2zebra_attentiongan/latest_test/index.html.

Generating Images Using Pretrained Model

You need download a pretrained model (e.g., horse2zebra) with the following script:

sh ./scripts/download_attentiongan_model.sh horse2zebra

The pretrained model is saved at ./checkpoints/{name}_pretrained/latest_net_G.pth.
Then generate the result using

python test.py --dataroot ./datasets/horse2zebra --name horse2zebra_pretrained --model attention_gan --dataset_mode unaligned --norm instance --phase test --no_dropout --load_size 256 --crop_size 256 --batch_size 1 --gpu_ids 0 --num_test 5000 --epoch latest --saveDisk

The results will be saved at ./results/. Use --results_dir {directory_path_to_save_result} to specify the results directory. Note that if you want to save the intermediate results and have enough disk space, remove --saveDisk on the command line.

For your own experiments, you might want to specify --netG, --norm, --no_dropout to match the generator architecture of the trained model.

Image Translation with Geometric Changes Between Source and Target Domains

For instance, if you want to run experiments of Selfie to Anime Translation. Usage: replace attention_gan_model.py and networks with the ones in the AttentionGAN-geo folder.

Test the Pretrained Model

Download data and pretrained model according above instructions.

python test.py --dataroot ./datasets/selfie2anime/ --name selfie2anime_pretrained --model attention_gan --dataset_mode unaligned --norm instance --phase test --no_dropout --load_size 256 --crop_size 256 --batch_size 1 --gpu_ids 0 --num_test 5000 --epoch latest

Train a New Model

python train.py --dataroot ./datasets/selfie2anime/ --name selfie2anime_attentiongan --model attention_gan --dataset_mode unaligned --pool_size 50 --no_dropout --norm instance --lambda_A 10 --lambda_B 10 --lambda_identity 0.5 --load_size 286 --crop_size 256 --batch_size 4 --niter 100 --niter_decay 100 --gpu_ids 0 --display_id 0 --display_freq 100 --print_freq 100

Test the Trained Model

python test.py --dataroot ./datasets/selfie2anime/ --name selfie2anime_attentiongan --model attention_gan --dataset_mode unaligned --norm instance --phase test --no_dropout --load_size 256 --crop_size 256 --batch_size 1 --gpu_ids 0 --num_test 5000 --epoch latest

Evaluation Code

FID: Official Implementation
KID or Here: Suggested by UGATIT. Install Steps: conda create -n python36 pyhton=3.6 anaconda and pip install --ignore-installed --upgrade tensorflow==1.13.1. If you encounter the issue AttributeError: module 'scipy.misc' has no attribute 'imread', please do pip install scipy==1.1.0.

Citation

If you use this code for your research, please cite our papers.

@article{tang2021attentiongan,
  title={AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks},
  author={Tang, Hao and Liu, Hong and Xu, Dan and Torr, Philip HS and Sebe, Nicu},
  journal={IEEE Transactions on Neural Networks and Learning Systems (TNNLS)},
  year={2021} 
}

@inproceedings{tang2019attention,
  title={Attention-Guided Generative Adversarial Networks for Unsupervised Image-to-Image Translation},
  author={Tang, Hao and Xu, Dan and Sebe, Nicu and Yan, Yan},
  booktitle={International Joint Conference on Neural Networks (IJCNN)},
  year={2019}
}

Acknowledgments

This source code is inspired by CycleGAN, GestureGAN, and SelectionGAN.

Contributions

If you have any questions/comments/bug reports, feel free to open a github issue or pull a request or e-mail to the author Hao Tang ([email protected]).

Collaborations

I'm always interested in meeting new people and hearing about potential collaborations. If you'd like to work together or get in contact with me, please email [email protected]. Some of our projects are listed here.

Figure out what you like. Try to become the best in the world of it.

AttentionGAN for Unpaired Image-to-Image Translation & Multi-Domain Image-to-Image Translation

Related tags

Overview

AttentionGAN-v2 for Unpaired Image-to-Image Translation

AttentionGAN-v2 Framework

Comparsion with State-of-the-Art Methods

Selfie To Anime Translation

Horse to Zebra Translation

Zebra to Horse Translation

Apple to Orange Translation

Orange to Apple Translation

Map to Aerial Photo Translation

Aerial Photo to Map Translation

Style Transfer

Visualization of Learned Attention Masks

Selfie to Anime Translation

Horse to Zebra Translation

Zebra to Horse Translation

Apple to Orange Translation

Orange to Apple Translation

Map to Aerial Photo Translation

Aerial Photo to Map Translation

Extended Paper | Conference Paper

Are you looking for AttentionGAN-v1 for Unpaired Image-to-Image Translation?

Are you looking for AttentionGAN-v1 for Multi-Domain Image-to-Image Translation?

Facial Expression-to-Expression Translation

Facial Attribute Transfer

License

Installation

Dataset Preparation

AttentionGAN Training/Testing

Generating Images Using Pretrained Model

Image Translation with Geometric Changes Between Source and Target Domains

Test the Pretrained Model

Train a New Model

Test the Trained Model

Evaluation Code

Citation

Acknowledgments

Contributions

Collaborations

Owner

Hao Tang

Code to accompany the paper "Finding Bipartite Components in Hypergraphs", which is published in NeurIPS'21.

The Environment I built to study Reinforcement Learning + Pokemon Showdown

DvD-TD3: Diversity via Determinants for TD3 version

Pythonic particle-based (super-droplet) warm-rain/aqueous-chemistry cloud microphysics package with box, parcel & 1D/2D prescribed-flow examples in Python, Julia and Matlab

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

ChebLieNet, a spectral graph neural network turned equivariant by Riemannian geometry on Lie groups.

Learning What and Where to Draw

competitions-v2

COLMAP - Structure-from-Motion and Multi-View Stereo

Code for unmixing audio signals in four different stems "drums, bass, vocals, others". The code is adapted from "Jukebox: A Generative Model for Music"

Save-restricted-v-3 - Save restricted content Bot For telegram

Accelerated Multi-Modal MR Imaging with Transformers

Scripts and a shader to get you started on setting up an exported Koikatsu character in Blender.

Tutorial in Python targeted at Epidemiologists. Will discuss the basics of analysis in Python 3

Ladder Variational Autoencoders (LVAE) in PyTorch

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

PyTorch implementation of "VRT: A Video Restoration Transformer"

Measure WWjj polarization fraction

The software associated with a paper accepted at EMNLP 2021 titled "Open Knowledge Graphs Canonicalization using Variational Autoencoders".

Github for the conference paper GLOD-Gaussian Likelihood OOD detector