Official PyTorch implementation of GDWCT (CVPR 2019, oral)

Last update: Dec 02, 2022

Overview

This repository provides the official code of GDWCT, and it is written in PyTorch.

Paper

Image-to-Image Translation via Group-wise Deep Whitening-and-Coloring Transformation (link)
Wonwoong Cho¹⁾, Sungha Choi^1,2), David Keetae Park¹⁾, Inkyu Shin³⁾, Jaegul Choo¹⁾
¹⁾Korea University, ²⁾LG Electronics, ³⁾Hanyang University
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019 (Oral)

Additional resources for comprehending the paper

Presentation material
Poster (this will be uploaded soon)

Comparison with baselines on CelebA dataset

Comparison with baselines on Artworks dataset

Prerequisites

Python 3.6
PyTorch 0.4.0+
Linux and NVIDIA GPU + CUDA CuDNN

Instructions

Installation

git clone https://github.com/WonwoongCho/GDWCT.git
cd GDWCT

Dataset

Artworks dataset Please go to the github repository of CycleGAN (link) and download monet2photo, cezanne2photo, ukiyoe2photo, and vangogh2photo.
CelebA dataset Our data loader necessitates data whose subdirectories are composed of 'trainA', 'trainB', 'testA', and 'testB'. Hence, after downloading CelebA dataset, you need to preprocess CelebA data by separating the data according to a target attribute of a translation. i.e., A: Male, B: Female.
CelebA dataset can be easily downloaded with the following script.

bash download.sh celeba

BAM dataset Similar to CelebA, you need to preprocess the data after downloading. Downloading the data is possible if you fulfill a given task (segmentation labeling). Please go to the link in order to download it.

We wish to directly provide the data we used in the paper, however it cannot be allowed because the data is preprocessed. We apologize for this.

Train and Test

Settings and hyperparameters are set in the config.yaml file. Please refer to specific descriptions provided in the file as comments. After setting, GDWCT can be trained or tested by the following script (NOTE: the values of 'MODE', 'LOAD_MODEL', and 'START' should be changed if a user want to test the model.):

python run.py

Pretrained models

Run the script if you need to download pretrained models (Smile <=> Non-Smile), (Bangs <=> Non-Bangs). The pretrained models will be downloaded and unzipped into ./pretrained_models/ directory.

bash download.sh pretrained

In order to test the pretrained models, please change several options in the config file, as described in the script below.
If the name of a pretrained model is G_A_CelebA_Bangs_G4_320000.pth,

N_GROUP: 4
SAVE_NAME: CelebA_Bangs_G4
MODEL_SAVE_PATH: pretrained_models/
START: 320000
LOAD_MODEL: True
MODE: test

Results

Citation

Please cite our paper if our work including this code is helpful for your research.

@InProceedings{GDWCT2019,
author = {Wonwoong Cho, Sungha Choi, David Keetae Park, Inkyu Shin, Jaegul Choo},
title = {Image-to-Image Translation via Group-wise Deep Whitening-and-Coloring Transformation},
booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition (CVPR)},
year = {2019}
}

Official PyTorch implementation of GDWCT (CVPR 2019, oral)

Related tags

Overview

Paper

Additional resources for comprehending the paper

Comparison with baselines on CelebA dataset

Comparison with baselines on Artworks dataset

Prerequisites

Instructions

Installation

Dataset

Train and Test

Pretrained models

Results

Citation

Owner

WonwoongCho

PyTorch implementation of MICCAI 2018 paper "Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector"

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

A library for augmentation of a YOLO-formated dataset

Deep Halftoning with Reversible Binary Pattern

Model Zoo for MindSpore

Tensorboard for pytorch (and chainer, mxnet, numpy, ...)

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

A crossplatform menu bar application using mpv as DLNA Media Renderer.

Towards Rolling Shutter Correction and Deblurring in Dynamic Scenes (CVPR2021)

Collection of machine learning related notebooks to share.

SpinalNet: Deep Neural Network with Gradual Input

Materials for my scikit-learn tutorial

Tutorial page of the Climate Hack, the greatest hackathon ever

Related resources for our EMNLP 2021 paper

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

Repository For Programmers Seeking a platform to show their skills

Implementation for Learning to Track with Object Permanence

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose (CVPR 2021)

FMA: A Dataset For Music Analysis