Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Last update: Dec 31, 2022

Related tags

Deep Learning T2I_CL

Overview

T2I_CL

This is the official Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Requirements

Linux
Python ≥ 3.6
PyTorch ≥ 1.4.0

Prepare Data

Download the preprocessed datasets from AttnGAN

Alternatively, another site is from DM-GAN

Training

Pretrain DAMSM+CL:
- For bird dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/bird.yml --gpu 0
- For coco dataset: python pretrain_DAMSM.py --cfg cfg/DAMSM/coco.yml --gpu 0
Train AttnGAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_attn2.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_attn2.yml --gpu 0
Train DM-GAN+CL:
- For bird dataset: python main.py --cfg cfg/bird_DMGAN.yml --gpu 0
- For coco dataset: python main.py --cfg cfg/coco_DMGAN.yml --gpu 0

Pretrained Models

DAMSM+CL for bird. Download and save it to DAMSMencoders/
DAMSM+CL for coco. Download and save it to DAMSMencoders/
AttnGAN+CL for bird. Download and save it to models/
AttnGAN+CL for coco. Download and save it to models/
DM-GAN+CL for bird. Download and save it to models/
DM-GAN+CL for coco. Download and save it to models/

Evaluation

Sampling and get the R-precision:
- python main.py --cfg cfg/eval_bird.yml --gpu 0
- python main.py --cfg cfg/eval_coco.yml --gpu 0
Inception score:
- python inception_score_bird.py --image_folder fake_images_bird
- python inception_score_coco.py fake_images_coco
FID:
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_bird --path2 fake_images_bird
- python fid_score.py --gpu 0 --batch-size 50 --path1 real_images_coco --path2 fake_images_coco

Citation

If you find this work useful in your research, please consider citing:

@article{ye2021improving,
  title={Improving Text-to-Image Synthesis Using Contrastive Learning},
  author={Ye, Hui and Yang, Xiulong and Takac, Martin and Sunderraman, Rajshekhar and Ji, Shihao},
  journal={arXiv preprint arXiv:2107.02423},
  year={2021}
}

Acknowledge

Our work is based on the following works:

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

Related tags

Overview

T2I_CL

Requirements

Prepare Data

Training

Pretrained Models

Evaluation

Citation

Acknowledge

Owner

Residual Dense Net De-Interlace Filter (RDNDIF)

Nvdiffrast - Modular Primitives for High-Performance Differentiable Rendering

Fast convergence of detr with spatially modulated co-attention

This is the repository for the NeurIPS-21 paper [Contrastive Graph Poisson Networks: Semi-Supervised Learning with Extremely Limited Labels].

Applying curriculum to meta-learning for few shot classification

Github project for Attention-guided Temporal Coherent Video Object Matting.

[ICME 2021 Oral] CORE-Text: Improving Scene Text Detection with Contrastive Relational Reasoning

OpenDILab Multi-Agent Environment

Unofficial PyTorch reimplementation of the paper Swin Transformer V2: Scaling Up Capacity and Resolution

PyTorch implementation of MulMON

Transfer SemanticKITTI labeles into other dataset/sensor formats.

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

SeisComP/SeisBench interface to enable deep-learning (re)picking in SeisComP

SuperSDR: multiplatform KiwiSDR + CAT transceiver integrator

MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.

Differential fuzzing for the masses!

Quick program made to generate alpha and delta tables for Hidden Markov Models

Simple (but Strong) Baselines for POMDPs

PyTorch implementation of our method for adversarial attacks and defenses in hyperspectral image classification.

Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.