Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Last update: Dec 01, 2022

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Bo Li, Qiulin Wang, Jiquan Pei, Yu Yang, Xiangyang Ji

Abstract: The semantically disentangled latent subspace in GAN provides rich interpretable controls in image generation. This paper includes two contributions on semantic latent subspace analysis in the scenario of face generation using StyleGAN2. First, we propose a novel approach to disentangle latent subspace semantics by exploiting existing face analysis models, e.g., face parsers and face landmark detectors. These models provide the flexibility to construct various criterions with very concrete and interpretable semantic meanings (e.g., change face shape or change skin color) to restrict latent subspace disentanglement. Rich latent space controls unknown previously can be discovered using the constructed criterions. Second, we propose a new perspective to explain the behavior of a CNN classifier by generating counterfactuals in the interpretable latent subspaces we discovered. This explanation helps reveal whether the classifier learns semantics as intended. Experiments on various disentanglement criterions demonstrate the effectiveness of our approach. We believe this approach contributes to both areas of image manipulation and counterfactual explainability of CNNs.

The code is developed on NVlabs/stylegan2-ada-pytorch and put in the ice folder. Please play with the two ipython notebooks.

ice/discover_subspaces

Solve subspaces by using face analysis models as criterions. Currently we only include several representative subspaces. The notebook requires to download some pre-trained models. You might have to spend some efforts to put everything at the right place. See the notebook comments for details. This notebook shows the code sketch to generate Figure 3 (as below) in the paper, i.e., the latent subspace for interpretable face manipulation.

ice/explain_counterfactually

Use the interpretable subspaces discovered by the above notebook to explain the classifier of attractiveness. This notebook shows the code sketch to generate Figure 4 (as below) in the paper, i.e., the interpretable counterfactuals to increase attractiveness score of a given classifier. Since we did not find good public pre-trained model. The attractiveness classifier is trained by ourselves using d-li14/face-attribute-prediction.

Which Style Makes Me Attractive? Interpretable Control Discovery and Counterfactual Explanation on StyleGAN

Related tags

Overview

Interpretable Control Exploration and Counterfactual Explanation (ICE) on StyleGAN

Owner

Bo Li

Data and Code for paper Outlining and Filling: Hierarchical Query Graph Generation for Answering Complex Questions over Knowledge Graph is available for research purposes.

Diagnostic tests for linguistic capacities in language models

Checkout some cool self-projects you can try your hands on to curb your boredom this December!

Implementation for Curriculum DeepSDF

A deep neural networks for images using CNN algorithm.

(CVPR 2022) Energy-based Latent Aligner for Incremental Learning

Implementation of ICCV21 paper: PnP-DETR: Towards Efficient Visual Analysis with Transformers

Using CNN to mimic the driver based on training data from Torcs

The Python3 import playground

CAPRI: Context-Aware Interpretable Point-of-Interest Recommendation Framework

Data & Code for ACCENTOR Adding Chit-Chat to Enhance Task-Oriented Dialogues

Implementation of the master's thesis "Temporal copying and local hallucination for video inpainting".

[ICCV 2021] Encoder-decoder with Multi-level Attention for 3D Human Shape and Pose Estimation

Using this codebase as a tool for my own research. Making some modifications to the original repo for my own purposes.

Human Action Controller - A human action controller running on different platforms.

Tensors and neural networks in Haskell

Making self-supervised learning work on molecules by using their 3D geometry to pre-train GNNs. Implemented in DGL and Pytorch Geometric.

This repository is a series of notebooks that show solutions for the projects at Dataquest.io.

Convert Python 3 code to CUDA code.

Train DeepLab for Semantic Image Segmentation