The Official TensorFlow Implementation for SPatchGAN (ICCV2021)

Last update: Dec 30, 2022

Related tags

Deep Learning SPatchGAN

Overview

SPatchGAN: Official TensorFlow Implementation

Paper

"SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation" (ICCV 2021)
- arxiv

Environment

CUDA 10.0
Python 3.6
pip install -r requirements.txt

Dataset

Dataset structure (dataset_struct='plain')

- dataset
    - <dataset_name>
        - trainA
            - 1.jpg
            - 2.jpg
            - ...
        - trainB
            - 3.jpg
            - 4.jpg
            - ...
        - testA
            - 5.jpg
            - 6.jpg
            - ...
        - testB
            - 7.jpg
            - 8.jpg
            - ...

Supported extensions: jpg, jpeg, png
An additional level of subdirectories is also supported by setting dataset_struct to 'tree', e.g.,

- trainA
    - subdir1
        - 1.jpg
        - 2.jpg
        - ...
    - subdir2
        - ...

Selfie-to-anime:
- The dataset can be downloaded from U-GAT-IT.
Male-to-female and glasses removal:
- The datasets can be downloaded from Council-GAN.
- The images must be center cropped from 218x178 to 178x178 before training or testing.
- For glasses removal, only the male images are used in the experiments in our paper. Note that the dataset from Council-GAN has already been split into two subdirectories, "1" for male and "2" for female.

Training

Set the suffix to anything descriptive, e.g., the date.
Selfie-to-Anime

python main.py --dataset selfie2anime --augment_type resize_crop --n_scales_dis 3 --suffix scale3_cyc20_20210831 --phase train

Male-to-Female

python main.py --dataset male2female --cyc_weight 10 --suffix cyc10_20210831 --phase train

Glasses Removal

python main.py --dataset glasses-male --cyc_weight 30 --suffix cyc30_20210831 --phase train

Find the output in ./output/SPatchGAN_<dataset_name>_<suffix>
The same command can be used to continue training based on the latest checkpoint.
For a new task, we recommend to use the default setting as the starting point, and adjust the hyperparameters according to the tips.
Check configs.py for all the hyperparameters.

Testing with the latest checkpoint

Replace --phase train with --phase test

Save a frozen model (.pb)

Replace --phase train with --phase freeze_graph
Find the saved frozen model in ./output/SPatchGAN_<dataset_name>_<suffix>/checkpoint/pb

Testing with the frozon model

cd frozen_model
python test_frozen_model.py --image <input_image_or_dir> --output_dir <output_dir> --model <frozen_model_path>

Pretrained Models

Download the pretrained models from google drive, and put them in the output directory.
You can test the checkpoints (in ./checkpoint) or the frozen models (in ./checkpoint/pb). Either way produces the same results.
The results generated by the pretrained models are slightly different from those in the paper, since we have rerun the training after code refactoring.
We set n_scales_dis to 3 for the pretrained selfie2anime model to further improve the performance. It was 4 in the paper. See more details in the tips.
We also provide the generated results of the last 100 test images (in ./gen, sorted by name, no cherry-picking) for the calibration purpose.

Other Implementations

We provide a PyTorch implementation of the SPatchGAN discriminator in spatchgan_discriminator_pytorch.py.

Citation

@inproceedings{SPatchGAN2021,
  title={SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation},
  author={Xuning Shao and Weidong Zhang},
  booktitle={IEEE International Conference on Computer Vision (ICCV)},
  year={2021}
}

Acknowledgement

Our code is partially based on U-GAT-IT.

The Official TensorFlow Implementation for SPatchGAN (ICCV2021)

Related tags

Overview

SPatchGAN: Official TensorFlow Implementation

Paper

Environment

Dataset

Training

Testing with the latest checkpoint

Save a frozen model (.pb)

Testing with the frozon model

Pretrained Models

Other Implementations

Citation

Acknowledgement

Owner

Using machine learning to predict undergrad college admissions.

Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets" (ECCV 2020 Spotlight)

Code for "Steerable Pyramid Transform Enables Robust Left Ventricle Quantification"

Dynamic Bottleneck for Robust Self-Supervised Exploration

BraTs-VNet - BraTS(Brain Tumour Segmentation) using V-Net

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Combine Tacotron2 and Hifi GAN to generate speech from text

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Code in both PyTorch and TensorFlow

Official PyTorch implementation of MX-Font (Multiple Heads are Better than One: Few-shot Font Generation with Multiple Localized Experts)

A new video text spotting framework with Transformer

A large-scale video dataset for the training and evaluation of 3D human pose estimation models

This is the official implementation of the paper "Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance Segmentation".

TensorFlow implementation of original paper : https://github.com/hszhao/PSPNet

[Nature Machine Intelligence' 21] "Advancing COVID-19 Diagnosis with Privacy-Preserving Collaboration in Artificial Intelligence"

Vowpal Wabbit is a machine learning system which pushes the frontier of machine learning with techniques such as online, hashing, allreduce, reductions, learning2search, active, and interactive learning.

Fit Fast, Explain Fast

The official implementation of ELSA: Enhanced Local Self-Attention for Vision Transformer

EMNLP 2021 paper The Devil is in the Detail: Simple Tricks Improve Systematic Generalization of Transformers.

Voice of Pajlada with model and weights.

The NEOSSat is a dual-mission microsatellite designed to detect potentially hazardous Earth-orbit-crossing asteroids and track objects that reside in deep space