Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

Last update: Dec 23, 2022

Related tags

Overview

Toward a Visual Concept Vocabulary for GAN Latent Space
_{Code and data from the ICCV 2021 paper}

Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba
Paper | Website | arxiv

This repository contains code for finding layer-selective directions, distilling them, and loading the vocabulary of visual concepts in BigGAN used in the original paper.

Notice: This repository is under active development! Expect instability until at least October 25th, 2021.

Installation

The provided code has been tested for Python 3.8 on MacOS and Ubuntu 20.04. It may still work in other environments, but we make no guarantees.

To run the code yourself, start by cloning the repository:

git clone https://github.com/schwettmann/visual-vocab
cd visual-vocab

(Optional) You will probably want to create a conda environment or virtual environment instead of installing the dependencies globally. E.g., to create a new virtual environment you can run:

python3 -m venv env
source env/bin/activate

Finally, install the Python dependencies using pip:

pip3 install -r requirements.txt

Usage

Notice: This section is under construction and will be updated as functionality gets added.

To download any of the various annotated directions from the paper, use datasets.load submodule. It downloads and parses the annoated directions. Example usage:

from visualvocab import datasets

# Download layer-selective directions and annotations used for distilling single-word directions:
dataset = datasets.load('lsd_all')

# Download distilled directions for all BigGAN-Places365 categories:
dataset = datasets.load('distilled_all')

# Download distilled directions for a specific BigGAN-Places365 category:
dataset = datasets.load('distilled_cottage')

See the module for a full list of available annotated directions.

Citation

Sarah Schwettmann, Evan Hernandez, David Bau, Samuel Klein, Jacob Andreas, Antonio Torralba. Toward a Visual Concept Vocabulary for GAN Latent Space, Proceedings of the International Conference on Computer Vision (ICCV), 2021.

Bibtex

@InProceedings{Schwettmann_2021_ICCV,
    author    = {Schwettmann, Sarah and Hernandez, Evan and Bau, David and Klein, Samuel and Andreas, Jacob and Torralba, Antonio},
    title     = {Toward a Visual Concept Vocabulary for GAN Latent Space},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {6804-6812}
}

Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

Related tags

Overview

Toward a Visual Concept Vocabulary for GAN Latent Space
_{Code and data from the ICCV 2021 paper}

Installation

Usage

Citation

Bibtex

Owner

Sarah Schwettmann

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Knowledge Management for Humans using Machine Learning & Tags

Espial is an engine for automated organization and discovery of personal knowledge

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

TLA - Twitter Linguistic Analysis

Text to speech converter with GUI made in Python.

Grover is a model for Neural Fake News -- both generation and detectio

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Idea is to build a model which will take keywords as inputs and generate sentences as outputs.

Refactored version of FastSpeech2

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

American Sign Language (ASL) to Text Converter

What are the best Systems? New Perspectives on NLP Benchmarking

An implementation of the Pay Attention when Required transformer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Crie tokens de autenticação íntegros e seguros com UToken.

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

KR-FinBert And KR-FinBert-SC

Toward a Visual Concept Vocabulary for GAN Latent Space, ICCV 2021

Related tags

Overview

Toward a Visual Concept Vocabulary for GAN Latent Space Code and data from the ICCV 2021 paper

Installation

Usage

Citation

Bibtex

Owner

Sarah Schwettmann

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Knowledge Management for Humans using Machine Learning & Tags

Espial is an engine for automated organization and discovery of personal knowledge

HiFi DeepVariant + WhatsHap workflowHiFi DeepVariant + WhatsHap workflow

TLA - Twitter Linguistic Analysis

Text to speech converter with GUI made in Python.

Grover is a model for Neural Fake News -- both generation and detectio

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Idea is to build a model which will take keywords as inputs and generate sentences as outputs.

Refactored version of FastSpeech2

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

Official implementation of MLP Singer: Towards Rapid Parallel Korean Singing Voice Synthesis

Natural Language Processing for Adverse Drug Reaction (ADR) Detection

American Sign Language (ASL) to Text Converter

What are the best Systems? New Perspectives on NLP Benchmarking

An implementation of the Pay Attention when Required transformer

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Crie tokens de autenticação íntegros e seguros com UToken.

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

KR-FinBert And KR-FinBert-SC

Toward a Visual Concept Vocabulary for GAN Latent Space
_{Code and data from the ICCV 2021 paper}