Pytorch implementation of MixNMatch

Last update: Dec 30, 2022

Overview

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
[Paper]

Yuheng Li, Krishna Kumar Singh, Utkarsh Ojha, Yong Jae Lee
UC Davis
In CVPR, 2020

1/31/2020 update: Code and models released.

Demo Video

This is our CVPR2020 presentation video link

Web Demo

For interactive web demo click here. This web demo is created by Yang Xue.

Requirements

Linux
Python 3.7
Pytorch 1.3.1
NVIDIA GPU + CUDA CuDNN

Getting started

Clone the repository

git clone https://github.com/Yuheng-Li/MixNMatch.git
cd MixNMatch

Setting up the data

Download the formatted CUB data from this link and extract it inside the data directory

Downloading pretrained models

Pretrained models for CUB, Dogs and Cars are available at this link. Download and extract them in the models directory.

Evaluating the model

In code

Run python eval.py --z path_to_pose_source_images --b path_to_bg_source_images --p path_to_shape_source_images --c path_to_color_source_images --out path_to_ourput --mode code_or_feature --models path_to_pretrained_models
For example python eval.py --z pose/pose-1.png --b background/background-1.png --p shape/shape-1.png --c color/color.png --mode code --models ../models --out ./code-1.png
- NOTE:(1) in feature mode pose source images will be ignored; (2) Generator, Encoder and Feature_extractor in models folder should be named as G.pth, E.pth and EX.pth

Training your own model

In code/config.py:

Specify the dataset location in DATA_DIR.
- NOTE: If you wish to train this on your own (different) dataset, please make sure it is formatted in a way similar to the CUB dataset that we've provided.
Specify the number of super and fine-grained categories that you wish for FineGAN to discover, in SUPER_CATEGORIES and FINE_GRAINED_CATEGORIES.
For the first stage training run python train_first_stage.py output_name
For the second stage training run python train_second_stage.py output_name path_to_pretrained_G path_to_pretrained_E
- NOTE: output will be in output/output_name
- NOTE: path_to_pretrained_G will be output/output_name/Model/G_0.pth
- NOTE: path_to_pretrained_E will be output/output_name/Model/E_0.pth
For example python train_second_stage.py Second_stage ../output/output_name/Model/G_0.pth ../output/output_name/Model/E_0.pth

Results

1. Extracting all factors from differnet real images to synthesize a new image

2. Comparison between the feature and code mode

3. Manipulating real images by varying a single factor

4. Inferring style from unseen data

Cartoon -> image	Sketch -> image

5. Converting a reference image according to a reference video

Citation

If you find this useful in your research, consider citing our work:

@inproceedings{li-cvpr2020,
  title = {MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation},
  author = {Yuheng Li and Krishna Kumar Singh and Utkarsh Ojha and Yong Jae Lee},
  booktitle = {CVPR},
  year = {2020}
}

Pytorch implementation of MixNMatch

Related tags

Overview

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation [Paper]

Demo Video

Web Demo

Requirements

Getting started

Clone the repository

Setting up the data

Downloading pretrained models

Evaluating the model

Training your own model

Results

1. Extracting all factors from differnet real images to synthesize a new image

2. Comparison between the feature and code mode

3. Manipulating real images by varying a single factor

4. Inferring style from unseen data

5. Converting a reference image according to a reference video

Citation

Owner

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data

BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

An implementation of the "Attention is all you need" paper without extra bells and whistles, or difficult syntax

Exploring Classification Equilibrium in Long-Tailed Object Detection, ICCV2021

An original implementation of "MetaICL Learning to Learn In Context" by Sewon Min, Mike Lewis, Luke Zettlemoyer and Hannaneh Hajishirzi

Duke Machine Learning Winter School: Computer Vision 2022

PyTorch implementation of paper “Unbiased Scene Graph Generation from Biased Training”

OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion.

This repository contains the code and models necessary to replicate the results of paper: How to Robustify Black-Box ML Models? A Zeroth-Order Optimization Perspective

Catalyst.Detection

CLIP + VQGAN / PixelDraw

Neural Scene Graphs for Dynamic Scene (CVPR 2021)

Convert Pytorch model to onnx or tflite, and the converted model can be visualized by Netron

NEG loss implemented in pytorch

Learn the Deep Learning for Computer Vision in three steps: theory from base to SotA, code in PyTorch, and space-repetition with Anki

Generic template to bootstrap your PyTorch project with PyTorch Lightning, Hydra, W&B, and DVC.

A Unified Generative Framework for Various NER Subtasks.

VIMPAC: Video Pre-Training via Masked Token Prediction and Contrastive Learning

Pre-trained model, code, and materials from the paper "Impact of Adversarial Examples on Deep Learning Models for Biomedical Image Segmentation" (MICCAI 2019).

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
[Paper]