Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Last update: Jun 17, 2022

Related tags

Deep Learning TCA-latent-space

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

dependencies

Firstly, to install the required packages, please run:

$ pip install -r requirements.txt

Pretrained weights

To replicate the results in the paper, you'll need to first download the pre-trained weights. To do so, simply run this from the command line:

./download_weights.sh

Quantitative results

building the prediction matrices

To reproduce Fig. 5, one can then run the ./quant.ipynb notebook using the pre-computed classification scores (please see this notebook for more details).

manually computing predictions

To call the Microsoft Azure Face API to generate the predictions again from scratch, one can run the shell script in ./quant/classify.sh. Firstly however, you need to generate our synthetic images to classify, which we detail below.

Qualitative results

generating the images

Reproducing the qualitative results (i.e. in Fig. 6) involves generating synthetic faces and 3 edited versions with the 3 attributes of interest (hair colour, yaw, and pitch). To generate these images (which are also used for the quantitative results), simply run:

$ ./generate_quant_edits.sh

mode-wise edits

Manual edits along individual modes of the tensor are made by calling main.py with the --mode edit_modewise flag. For example, one can reproduce the images from Fig. 3 with:

$ python main.py --cp_rank 0 --tucker_ranks "4,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 1000
  --n_to_edit 10 \
  --mode edit_modewise \
  --attribute_to_edit male

multilinear edits

Edits achieved with the 'multilinear mixing' are achieved instead by loading the relevant weights and supplying the --mode edit_multilinear flag. For example, the images in Fig. 4 are generated with:

$ python main.py --cp_rank 0 --tucker_ranks "256,4,4,512" --model_name pggan_celebahq1024 --penalty_lam 0.001 --resume_iters 200000
  --n_to_edit 10 \
  --mode edit_multilinear \
  --attribute_to_edit thick

Please feel free to get in touch at: [email protected], where x=oldfield

credits

All the code in ./architectures/ and utils.py is directly imported from https://github.com/genforce/genforce, only lightly modified to support performing the forward pass through the models partially, and returning the intermediate tensors.

The structure of the codebase follows https://github.com/yunjey/stargan, and hence we use their code as a template to build off. For this reason, you will find small helper functions (e.g. the first few lines of main.py) are borrowed from the StarGAN codebase.

Code to reproduce the results in the paper "Tensor Component Analysis for Interpreting the Latent Space of GANs".

Related tags

Overview

Tensor Component Analysis for Interpreting the Latent Space of GANs

[ paper | project page ]

dependencies

Pretrained weights

Quantitative results

building the prediction matrices

manually computing predictions

Qualitative results

generating the images

mode-wise edits

multilinear edits

credits

Owner

James Oldfield

Semi-supervised Domain Adaptation via Minimax Entropy

Official Pytorch implementation for 2021 ICCV paper "Learning Motion Priors for 4D Human Body Capture in 3D Scenes" and trained models / data

PyTorch code for our ECCV 2020 paper "Single Image Super-Resolution via a Holistic Attention Network"

Official PyTorch implementation of SyntaSpeech (IJCAI 2022)

Cards Against Humanity AI

Labels4Free: Unsupervised Segmentation using StyleGAN

The aim of this project is to build an AI bot that can play the Wordle game, or more generally Squabble

Pose estimation with MoveNet Lightning

Streamlit app demonstrating an image browser for the Udacity self-driving-car dataset with realtime object detection using YOLO.

Open source simulator for autonomous vehicles built on Unreal Engine / Unity, from Microsoft AI & Research

Picasso: A CUDA-based Library for Deep Learning over 3D Meshes

Neural Surface Maps

Implementation of paper "DeepTag: A General Framework for Fiducial Marker Design and Detection"

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

Differentiable simulation for system identification and visuomotor control

CTC segmentation python package

Deploy pytorch classification model using Flask and Streamlit

Generate vibrant and detailed images using only text.

ECCV18 Workshops - Enhanced SRGAN. Champion PIRM Challenge on Perceptual Super-Resolution. The training codes are in BasicSR.

Implement of homography net by pytorch