Learning Energy-Based Models by Diffusion Recovery Likelihood

Last update: Nov 22, 2022

Related tags

Deep Learning recovery_likelihood

Overview

Learning Energy-Based Models by Diffusion Recovery Likelihood

Ruiqi Gao, Yang Song, Ben Poole, Ying Nian Wu, Diederik P. Kingma

Paper: https://arxiv.org/pdf/2012.08125

Requirements

Experiments can be run on a single GPU or Google Cloud TPU v3-8. Requires python >= 3.5. To install dependencies:

pip install -r requirements.txt

To compute FID/inception scores, download the pre-computed statistics of datasets from: https://drive.google.com/file/d/1QOLyYHESflcdZu8CsBLZohZzC95HyukK/view?usp=sharing, unzip the file and put the folder in this repo.

Train with 1 GPU

CIFAR10

python main.py --num_res_blocks=8 --n_batch_train=256

CelebA

python main.py --problem=celeba --num_res_blocks=6 --beta_1=0.5 --batch_size=128

LSUN church_outdoor 64x64 / LSUN bedroom 64x64

python main.py --problem=[lsun_church64/lsun_bedroom64] --batch_size=128

LSUN church_outdoor 128x128

python main.py --problem=lsun_church128 --beta_1=0.5

LSUN bedroom 128x128

python main.py --problem=lsun_bedroom128 --beta_1=0.5 --num_res_blocks=5

Compute full FID / IS scores after training on CIFAR10

python main.py --eval --num_res_blocks=8 --noise_scale=0.99 --fid_n_batch=2000

For faster training, reduce the value of num_res_blocks.

Train with Google Cloud TPU

Add --tpu=True to the above scripts for 1 GPU. Also need to set --tpu_name and --tpu_zone as shown in Google Cloud.

Pretrained models

https://drive.google.com/file/d/1eneA6T5jQIyVFLFSOrSfJvDeUJJMh9xk/view?usp=sharing

This code is for T6 setting. Will upload T1k setting soon!

Citation

If you find our work helpful to your research, please cite:

@article{gao2020learning,
  title={Learning Energy-Based Models by Diffusion Recovery Likelihood},
  author={Gao, Ruiqi and Song, Yang and Poole, Ben and Wu, Ying Nian and Kingma, Diederik P},
  journal={arXiv preprint arXiv:2012.08125},
  year={2020}
}

Learning Energy-Based Models by Diffusion Recovery Likelihood

Related tags

Overview

Learning Energy-Based Models by Diffusion Recovery Likelihood

Requirements

Train with 1 GPU

CIFAR10

CelebA

LSUN church_outdoor 64x64 / LSUN bedroom 64x64

LSUN church_outdoor 128x128

LSUN bedroom 128x128

Compute full FID / IS scores after training on CIFAR10

Train with Google Cloud TPU

Pretrained models

Citation

Owner

Ruiqi Gao

A voice recognition assistant similar to amazon alexa, siri and google assistant.

MT3: Multi-Task Multitrack Music Transcription

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Official PyTorch Implementation of HELP: Hardware-adaptive Efficient Latency Prediction for NAS via Meta-Learning (NeurIPS 2021 Spotlight)

An end-to-end library for editing and rendering motion of 3D characters with deep learning [SIGGRAPH 2020]

Points2Surf: Learning Implicit Surfaces from Point Clouds (ECCV 2020 Spotlight)

Depth image based mouse cursor visual haptic

This is an official PyTorch implementation of Task-Adaptive Neural Network Search with Meta-Contrastive Learning (NeurIPS 2021, Spotlight).

MISSFormer: An Effective Medical Image Segmentation Transformer

An Intelligent Self-driving Truck System For Highway Transportation

Moon-patrol - A faithful recreation of the 1983 hit classic Moon Patrol for the Atari 2600 created using the Pygame library for Python

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

Traditional deepdream with VQGAN+CLIP and optical flow. Ready to use in Google Colab

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack

Simple image captioning model - CLIP prefix captioning.

Earth Vision Foundation

PyTorch implementation of Super SloMo by Jiang et al.

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Fast SHAP value computation for interpreting tree-based models

Train Scene Graph Generation for Visual Genome and GQA in PyTorch >= 1.2 with improved zero and few-shot generalization.