Official implementation of VQ-Diffusion

Last update: Jan 03, 2023

Related tags

Overview

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Overview

This is the official repo for the paper: [Vector Quantized Diffusion Model for Text-to-Image Synthesis].

VQ-Diffusion is based on a VQ-VAE whose latent space is modeled by a conditional variant of the recently developed Denoising Diffusion Probabilistic Model (DDPM). It produces significantly better text-to-image generation results when compared with Autoregressive models with similar numbers of parameters. Compared with previous GAN-based methods, VQ-Diffusion can handle more complex scenes and improve the synthesized image quality by a large margin.

Our code and model is ready, however, they are still under the review of the company. We promise to release them in December.

Framework

Samples

More Samples

Owner

Microsoft

Open source projects and samples from Microsoft

GitHub Repository

Random-Afg - Afghanistan Random Old Idz Cloner Tools

AFGHANISTAN RANDOM OLD IDZ CLONER TOOLS Install $ apt update $ apt upgrade $ apt

5 Jan 26, 2022

A Novel Plug-in Module for Fine-grained Visual Classification

Pytorch implementation for A Novel Plug-in Module for Fine-Grained Visual Classification. fine-grained visual classification task.

109 Dec 20, 2022

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

BasicRL: easy and fundamental codes for deep reinforcement learning BasicRL is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up. It is

12 Apr 28, 2022

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks. Generally, we intergrete different kind of functional

28 Jan 08, 2023

DeepLearning Anomalies Detection with Bluetooth Sensor Data

Final Year Project. Constructing models to create offline anomalies detection using Travel Time Data collected from Bluetooth sensors along the route.

1 Jan 10, 2022

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation YouTube | BiliBili 16X interpolation results from two input images: Introd

28 Dec 09, 2022

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

DanceNet3D The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer. Dataset and Results Pleas

36 Dec 21, 2022

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

DCS-Net This is the implementation of "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement" Steps to run the model Edit V

10 Apr 04, 2022

Learning with Subset Stacking

Learning with Subset Stacking (LESS) LESS is a new supervised learning algorithm that is based on training many local estimators on subsets of a given

19 Oct 04, 2022

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

InfoSwap: Information Bottleneck Disentanglement for Identity Swapping Code usage Please check out the user manual page. Paper Gege Gao, Huaibo Huang,

56 Dec 20, 2022

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

Finite basis physics-informed neural networks (FBPINNs) This repository reproduces the results of the paper Finite Basis Physics-Informed Neural Netwo

65 Dec 28, 2022

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

BNN - BN = ? Training Binary Neural Networks without Batch Normalization Codes for this paper BNN - BN = ? Training Binary Neural Networks without Bat

40 Dec 30, 2022

Official implementation of VQ-Diffusion

Related tags

Overview

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Overview

Framework

Samples

More Samples

Owner

Microsoft

Random-Afg - Afghanistan Random Old Idz Cloner Tools

A Novel Plug-in Module for Fine-grained Visual Classification

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

DeepLearning Anomalies Detection with Bluetooth Sensor Data

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

Learning with Subset Stacking

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

Official pytorch implementation of Rainbow Memory (CVPR 2021)

TipToiDog - Tip Toi Dog With Python

Leveraging OpenAI's Codex to solve cornerstone problems in Music

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Extracts data from the database for a graph-node and stores it in parquet files

Reference implementation for Structured Prediction with Deep Value Networks

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

Official implementation of VQ-Diffusion

Related tags

Overview

Vector Quantized Diffusion Model for Text-to-Image Synthesis

Overview

Framework

Samples

More Samples

Owner

Microsoft

Random-Afg - Afghanistan Random Old Idz Cloner Tools

A Novel Plug-in Module for Fine-grained Visual Classification

BasicRL: easy and fundamental codes for deep reinforcement learning。It is an improvement on rainbow-is-all-you-need and OpenAI Spinning Up.

Luminous is a framework for testing the performance of Embodied AI (EAI) models in indoor tasks.

DeepLearning Anomalies Detection with Bluetooth Sensor Data

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

The personal repository of the work: *DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer*.

Implementation of paper "DCS-Net: Deep Complex Subtractive Neural Network for Monaural Speech Enhancement"

Learning with Subset Stacking

Official PyTorch Implementation for InfoSwap: Information Bottleneck Disentanglement for Identity Swapping

Reproduces the results of the paper "Finite Basis Physics-Informed Neural Networks (FBPINNs): a scalable domain decomposition approach for solving differential equations".

Official pytorch implementation of Rainbow Memory (CVPR 2021)

TipToiDog - Tip Toi Dog With Python

Leveraging OpenAI's Codex to solve cornerstone problems in Music

Weakly Supervised 3D Object Detection from Point Cloud with Only Image Level Annotation

Fast and Context-Aware Framework for Space-Time Video Super-Resolution (VCIP 2021)

Boosting Adversarial Attacks with Enhanced Momentum (BMVC 2021)

Extracts data from the database for a graph-node and stores it in parquet files

Reference implementation for Structured Prediction with Deep Value Networks

[CVPRW 21] "BNN - BN = ? Training Binary Neural Networks without Batch Normalization", Tianlong Chen, Zhenyu Zhang, Xu Ouyang, Zechun Liu, Zhiqiang Shen, Zhangyang Wang

The personal repository of the work: DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer.