3D Generative Adversarial Network

Last update: Dec 20, 2022

Related tags

Overview

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

This repository contains pre-trained models and sampling code for the 3D Generative Adversarial Network (3D-GAN) presented at NIPS 2016.

http://3dgan.csail.mit.edu

Prerequisites

Torch

We use Torch 7 (http://torch.ch) for our implementation with these additional packages:

matio or fb.mattorch: we use .mat file for saving voxelized shapes.

Visualization

Basic visualization: MATLAB (tested on R2016b)
Advanced visualization: Python 2.7 with package numpy, matplotlib, scipy and vtk (version 5.10.1)

Note: for advanced visualization, the version of vtk has to be 5.10.1, not above. It is available in the package list of common Python distributions like Anaconda

Installation

Our current release has been tested on Ubuntu 14.04.

Cloning the repository

git clone [email protected]:zck119/3dgan-release.git
cd 3dgan-release

Downloading pretrained models

For CPU (947 MB):

./download_models_cpu.sh

For GPU (618 MB):

./download_models_gpu.sh

Downloading latent vector inputs for demo

./download_demo_inputs.sh

Guide

Synthesizing shapes (`main.lua`)

We show how to synthesize shapes with our pre-trained models. The file (main.lua) has the following options.

-gpu ID: GPU ID (starting from 1). Set to 0 to use CPU only.
-class CLASSNAME: synthesize shapes for the class CLASSNAME. We currently support five classes (car, chair, desk, gun, and sofa). Use all to generate shapes for each class.
-sample: whether to sample input latent vectors from an i.i.d. uniform distribution, or to generate shapes with demo vectors loaded from ./demo_inputs/CLASSNAME.mat
-bs BATCH_SIZE: use batch size of BATCH_SIZE during network forwarding
-ss SAMPLE_SIZE: set the number of generated shapes to SAMPLE_SIZE. This option is only available in -sample mode.

Usages include

Synthesize chairs with pre-sampled demo inputs and a CPU

th main.lua -gpu 0 -class chair

Randomly sample 150 desks with GPU 1 and a batch size of 50

th main.lua -gpu 1 -class desk -bs 50 -sample -ss 150

Randomly sample 150 shapes of each category with GPU 1 and a batch size of 50

th main.lua -gpu 1 -class all -bs 50 -sample -ss 150

The output is saved under folder ./output, with class_name_demo.mat for shapes generated by predetermined demo inputs (z in our paper), and class_name_sample.mat for randomly sampled 3D shapes. The variable inputs in the .mat file correponds to the input latent representation, and the variable voxels corresponds to the generated 3D shapes by our network.

Visualization

We offer two ways of visualizing results, one in MATLAB and the other in Python. We used the Python visualization in our paper. The MATLAB visualization is easier to install and run, but its output has a lower quality compared with the Python one.

MATLAB: Please use the function visualization/matlab/visualize.m for visualization. The MATLAB code allows users to either display rendered objects or save them as images. The script also supports downsampling and thresholding for faster rendering. The color of voxels represents the confidence value.

Options include

inputfile: the .mat file that saves the voxel matrices
indices: the indices of objects in the inputfile that should be rendered. The default value is 0, which stands for rendering all objects.
step (s): downsample objects via a max pooling of step s for efficiency. The default value is 4 (64 x 64 x 64 -> 16 x 16 x 16).
threshold: voxels with confidence lower than the threshold are not displayed
outputprefix:
- when not specified, Matlab shows figures directly.
- when specified, Matlab stores rendered images of objects at outputprefix_%i.bmp, where %i is the index of objects

Usage (after running th main.lua -gpu 0 -class chair, in MATLAB, in folder visualization/matlab):

visualize('../../output/chair_demo.mat', 0, 2, 0.1, 'chair')

The visualization might take a while. The obtained rendering (chair_1/3/4/5.bmp) should look as follows.

Python: Options for the Python visualization include

-t THRESHOLD: voxels with confidence lower than the threshold are not displayed. The default value is 0.1.
-i ID: the index of objects in the inputfile that should be rendered (one based). The default value is 1.
-df STEPSIZE: downsample objects via a max pooling of step STEPSIZE for efficiency. Currently supporting STEPSIZE 1, 2, and 4. The default value is 1 (i.e. no downsampling).
-dm METHOD: downsample method, where mean stands for average pooling and max for max pooling. The default is max pooling.
-u BLOCK_SIZE: set the size of the voxels to BLOCK_SIZE. The default value is 0.9.
-cm: whether to use a colormap to represent voxel occupancy, or to use a uniform color
-mc DISTANCE: whether to keep only the maximal connected component, where voxels of distance no larger than DISTANCE are considered connected. Set to 0 to disable this function. The default value is 3.

Usage:

python visualize.py chair_demo.mat -u 0.9 -t 0.1 -i 1 -mc 2

Reference

@inproceedings{3dgan,
  title={{Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling}},
  author={Wu, Jiajun and Zhang, Chengkai and Xue, Tianfan and Freeman, William T and Tenenbaum, Joshua B},
  booktitle={Advances In Neural Information Processing Systems},
  pages={82--90},
  year={2016}
}

For any questions, please contact Jiajun Wu ([email protected]) and Chengkai Zhang ([email protected]).

3D Generative Adversarial Network

Related tags

Overview

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

Prerequisites

Torch

Visualization

Installation

Cloning the repository

Downloading pretrained models

Downloading latent vector inputs for demo

Guide

Synthesizing shapes (`main.lua`)

Visualization

Reference

Owner

Chengkai Zhang

GazeScroller - Using Facial Movements to perform Hands-free Gesture on the system

Python Multi-Agent Reinforcement Learning framework

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

The Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

Efficient Lottery Ticket Finding: Less Data is More

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

Code for Emergent Translation in Multi-Agent Communication

Code for STFT Transformer used in BirdCLEF 2021 competition.

PyToch implementation of A Novel Self-supervised Learning Task Designed for Anomaly Segmentation

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

LyaNet: A Lyapunov Framework for Training Neural ODEs

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

3D Generative Adversarial Network

Related tags

Overview

Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling

Prerequisites

Torch

Visualization

Installation

Cloning the repository

Downloading pretrained models

Downloading latent vector inputs for demo

Guide

Synthesizing shapes (main.lua)

Visualization

Reference

Owner

Chengkai Zhang

GazeScroller - Using Facial Movements to perform Hands-free Gesture on the system

Python Multi-Agent Reinforcement Learning framework

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Symmetry and Uncertainty-Aware Object SLAM for 6DoF Object Pose Estimation

Attack classification models with transferability, black-box attack; unrestricted adversarial attacks on imagenet

This project is a loose implementation of paper "Algorithmic Financial Trading with Deep Convolutional Neural Networks: Time Series to Image Conversion Approach"

The Few-Shot Bot: Prompt-Based Learning for Dialogue Systems

Efficient Lottery Ticket Finding: Less Data is More

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

Code for Emergent Translation in Multi-Agent Communication

Code for STFT Transformer used in BirdCLEF 2021 competition.

PyToch implementation of A Novel Self-supervised Learning Task Designed for Anomaly Segmentation

A (PyTorch) imbalanced dataset sampler for oversampling low frequent classes and undersampling high frequent ones.

[CVPR 2021] Anycost GANs for Interactive Image Synthesis and Editing

EasyMocap is an open-source toolbox for markerless human motion capture from RGB videos.

The official PyTorch implementation of the paper: *Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." *.

Books, Presentations, Workshops, Notebook Labs, and Model Zoo for Software Engineers and Data Scientists wanting to learn the TF.Keras Machine Learning framework

codes for Image Inpainting with External-internal Learning and Monochromic Bottleneck

LyaNet: A Lyapunov Framework for Training Neural ODEs

Some experiments with tennis player aging curves using Hilbert space GPs in PyMC. Only experimental for now.

Synthesizing shapes (`main.lua`)

The official PyTorch implementation of the paper: Xili Dai, Xiaojun Yuan, Haigang Gong, Yi Ma. "Fully Convolutional Line Parsing." .