[NeurIPS 2021] Low-Rank Subspaces in GANs

Last update: Dec 28, 2022

Related tags

Overview

Low-Rank Subspaces in GANs

Figure: Image editing results using LowRankGAN on StyleGAN2 (first three columns) and BigGAN (last column).

Low-Rank Subspaces in GANs
Jiapeng Zhu, Ruili Feng, Yujun Shen, Deli Zhao, Zhengjun Zha, Jingren Zhou, Qifeng Chen
Conference on Neural Information Processing Systems (NeurIPS)

In the repository, we propose LowRankGAN to locally control the image synthesis from GANs with the novel low-rank subspaces. Concretely, we first relate the image regions with the latent space with the help of Jacobian. We then perform low-rank factorization on the Jacobian to get the principal and null spaces. We finally project the principal space w.r.t. the region of interest onto the null space w.r.t. the rest region. In this way, by altering the latent codes along the directions within the projected space, which we call low-rank subspaces, we manage to precisely control the region of interest yet barely affect the rest region.

[Paper] [Project Page] [Demo]

[Stay Tuned] We are preparing the PyTorch code!

Examples of Local Editing
Eyes	Mouth	Nose	Hair

Manipulate with Provided Directions

We have already provided some directions under the directory directions/. Users can easily use these directions for image local editing.

MODEL_PATH='stylegan2-ffhq-config-f-1024x1024.pkl'
DIRECTION='directions/ffhq1024/eyes_size.npy'
python manipulate.py $MODEL_PATH $DIRECTION

Find More Directions

We also provide the code for users to find customized directions. Please follow the steps below.

Step-0: Prepare the pre-trained generator

Here, we use the FFHQ model officially released in StyleGAN2 as an example. Please download it first.

Step-1: Compute Jacobian with random syntheses

MODEL_PATH='stylegan2-ffhq-config-f-1024x1024.pkl'
python compute_jacobian.py $MODEL_PATH

Step-2: Compute the directions from the Jacobian

JACOBIAN_PATH='outputs/jacobian_seed_4/w_dataset_ffhq.npy'
python compute_directions.py $JACOBIAN_PATH

Step-3: Verify the directions through image manipulation

MODEL_PATH='stylegan2-ffhq-config-f-1024x1024.pkl'
DIRECTION_PATH='outputs/directions/${DIRECTION_NAME}'
python manipulate.py $MODEL_PATH $DIRECTION

BibTeX

@inproceedings{zhu2021lowrankgan,
  title     = {Low-Rank Subspaces in {GAN}s},
  author    = {Zhu, Jiapeng and Feng, Ruili and Shen, Yujun and Zhao, Deli and Zha, Zhengjun and Zhou, Jingren and Chen, Qifeng},
  booktitle = {Advances in Neural Information Processing Systems (NeurIPS)},
  year      = {2021}
}

[NeurIPS 2021] Low-Rank Subspaces in GANs

Related tags

Overview

Low-Rank Subspaces in GANs

Manipulate with Provided Directions

Find More Directions

Step-0: Prepare the pre-trained generator

Step-1: Compute Jacobian with random syntheses

Step-2: Compute the directions from the Jacobian

Step-3: Verify the directions through image manipulation

BibTeX

Owner

Official Repository of NeurIPS2021 paper: PTR

Trustworthy AI related projects

Official Repository for our ECCV2020 paper: Imbalanced Continual Learning with Partitioning Reservoir Sampling

Rethinking the U-Net architecture for multimodal biomedical image segmentation

CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

State-of-the-art language models can match human performance on many tasks

Episodic-memory - Ego4D Episodic Memory Benchmark

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Instance-Dependent Partial Label Learning

[SIGGRAPH 2021 Asia] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

Code for "ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on", accepted at WACV 2021 Generation of Human Behavior Workshop.

Get started learning C# with C# notebooks powered by .NET Interactive and VS Code.

TAug :: Time Series Data Augmentation using Deep Generative Models

A curated list of resources for Image and Video Deblurring

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Turning pixels into virtual points for multimodal 3D object detection.

pytorch implementation of dftd2 & dftd3

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

[NeurIPS 2021] Low-Rank Subspaces in GANs

Related tags

Overview

Low-Rank Subspaces in GANs

Manipulate with Provided Directions

Find More Directions

Step-0: Prepare the pre-trained generator

Step-1: Compute Jacobian with random syntheses

Step-2: Compute the directions from the Jacobian

Step-3: Verify the directions through image manipulation

BibTeX

Owner

Official Repository of NeurIPS2021 paper: PTR

Trustworthy AI related projects

Official Repository for our ECCV2020 paper: Imbalanced Continual Learning with Partitioning Reservoir Sampling

Rethinking the U-Net architecture for multimodal biomedical image segmentation

CPF: Learning a Contact Potential Field to Model the Hand-object Interaction

State-of-the-art language models can match human performance on many tasks

Episodic-memory - Ego4D Episodic Memory Benchmark

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Instance-Dependent Partial Label Learning

[SIGGRAPH 2021 Asia] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning

Code for reproducing experiments in "Improved Training of Wasserstein GANs"

Code for "ShineOn: Illuminating Design Choices for Practical Video-based Virtual Clothing Try-on", accepted at WACV 2021 Generation of Human Behavior Workshop.

Get started learning C# with C# notebooks powered by .NET Interactive and VS Code.

TAug :: Time Series Data Augmentation using Deep Generative Models

A curated list of resources for Image and Video Deblurring

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long*, Evan Shelhamer*, and Trevor Darrell. CVPR 2015 and PAMI 2016.

Turning pixels into virtual points for multimodal 3D object detection.

pytorch implementation of dftd2 & dftd3

[ICCV'2021] Image Inpainting via Conditional Texture and Structure Dual Generation

[SIGGRAPH Asia 2021] Pose with Style: Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN

Fully Convolutional Networks for Semantic Segmentation by Jonathan Long, Evan Shelhamer, and Trevor Darrell. CVPR 2015 and PAMI 2016.