guided-diffusion

This is the codebase for Diffusion Models Beat GANS on Image Synthesis.

This repository is based on openai/improved-diffusion, with modifications for classifier conditioning and architecture improvements.

Usage

Training diffusion models is described in the parent repository. Training a classifier is similar. We assume you have put training hyperparameters into a TRAIN_FLAGS variable, and classifier hyperparameters into a CLASSIFIER_FLAGS variable. Then you can run:

mpiexec -n N python scripts/classifier_train.py --data_dir path/to/imagenet $TRAIN_FLAGS $CLASSIFIER_FLAGS

Make sure to divide the batch size in TRAIN_FLAGS by the number of MPI processes you are using.

Here are flags for training the 128x128 classifier. You can modify these for training classifiers at other resolutions:

TRAIN_FLAGS="--iterations 300000 --anneal_lr True --batch_size 256 --lr 3e-4 --save_interval 10000 --weight_decay 0.05"
CLASSIFIER_FLAGS="--image_size 128 --classifier_attention_resolutions 32,16,8 --classifier_depth 2 --classifier_width 128 --classifier_pool attention --classifier_resblock_updown True --classifier_use_scale_shift_norm True"

For sampling from a 128x128 classifier-guided model, 25 step DDIM:

MODEL_FLAGS="--attention_resolutions 32,16,8 --class_cond True --image_size 128 --learn_sigma True --num_channels 256 --num_heads 4 --num_res_blocks 2 --resblock_updown True --use_fp16 True --use_scale_shift_norm True"
CLASSIFIER_FLAGS="--image_size 128 --classifier_attention_resolutions 32,16,8 --classifier_depth 2 --classifier_width 128 --classifier_pool attention --classifier_resblock_updown True --classifier_use_scale_shift_norm True --classifier_scale 1.0 --classifier_use_fp16 True"
SAMPLE_FLAGS="--batch_size 4 --num_samples 50000 --timestep_respacing ddim25 --use_ddim True"
mpiexec -n N python scripts/classifier_sample.py \
    --model_path /path/to/model.pt \
    --classifier_path path/to/classifier.pt \
    $MODEL_FLAGS $CLASSIFIER_FLAGS $SAMPLE_FLAGS

To sample for 250 timesteps without DDIM, replace --timestep_respacing ddim25 to --timestep_respacing 250, and replace --use_ddim True with --use_ddim False.

This is the codebase for Diffusion Models Beat GANS on Image Synthesis.

Related tags

Overview

guided-diffusion

Usage

Owner

OpenAI

a general-purpose Transformer based vision backbone

Python project to take sound as input and output as RGB + Brightness values suitable for DMX

AI Virtual Calculator: This is a simple virtual calculator based on Artificial intelligence.

Sentiment analysis translations of the Bhagavad Gita

Semantic Segmentation for Aerial Imagery using Convolutional Neural Network

ICLR 2021, Fair Mixup: Fairness via Interpolation

Pathdreamer: A World Model for Indoor Navigation

FwordCTF 2021 Infrastructure and Source code of Web/Bash challenges

This repo includes the supplementary of our paper "CEMENT: Incomplete Multi-View Weak-Label Learning with Long-Tailed Labels"

[ICML 2020] Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

The official re-implementation of the Neurips 2021 paper, "Targeted Neural Dynamical Modeling".

GDSC-ML Team Interview Task

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Awesome Graph Classification - A collection of important graph embedding, classification and representation learning papers with implementations.

[ICCV 2021] A Simple Baseline for Semi-supervised Semantic Segmentation with Strong Data Augmentation

A task Provided by A respective Artenal Ai and Ml based Company to complete it

GUI for a Vocal Remover that uses Deep Neural Networks.

End-To-End Memory Network using Tensorflow

AAAI 2022: Stationary diffusion state neural estimation