Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Last update: Dec 21, 2022

Overview

CIPS -- Official Pytorch Implementation

of the paper Image Generators with Conditionally-Independent Pixel Synthesis

Requirements

pip install -r requirements.txt

Usage

First create lmdb datasets:

python prepare_data.py images --out LMDB_PATH --n_worker N_WORKER --size SIZE1,SIZE2,SIZE3,... DATASET_PATH

This will convert images to jpeg and pre-resizes it.

To train on FFHQ-256 or churches please run:

python3 -m torch.distributed.launch --nproc_per_node=8 --master_port=1234 train.py --n_sample=8 --batch=4 --fid_batch=8 --Generator=CIPSskip --output_dir=skip-[ffhq/churches] --img2dis --num_workers=16 DATASET_PATH

To train on patches add --crop=PATCH_SIZE. PATCH_SIZE has to be a power of 2.

Pretrained Checkpoints

Generate samples

To play with the models please download checkpoints and check out a notebook.ipynb

Progressive training

We also tried to train progressively on FFHQ starting from 256×256 initialization and got FID 10.07. We will update the paper with the training details soon. Checkpoint name is ffhq1024.pt. Samples are below.

Citation

If you found our work useful, please don't forget to cite

@article{anokhin2020image,
  title={Image Generators with Conditionally-Independent Pixel Synthesis},
  author={Anokhin, Ivan and Demochkin, Kirill and Khakhulin, Taras and Sterkin, Gleb and Lempitsky, Victor and Korzhenkov, Denis},
  journal={arXiv preprint arXiv:2011.13775},
  year={2020}
}

The code is heavely based on the styleganv2 pytorch implementation

Nvidia-licensed CUDA kernels (fused_bias_act_kernel.cu, upfirdn2d_kernel.cu) is for non-commercial use only.

Official implementation of the paper Image Generators with Conditionally-Independent Pixel Synthesis https://arxiv.org/abs/2011.13775

Related tags

Overview

CIPS -- Official Pytorch Implementation

Requirements

Usage

Pretrained Checkpoints

Generate samples

Progressive training

Citation

Owner

Multimodal Lab @ Samsung AI Center Moscow

Package to compute Mauve, a similarity score between neural text and human text. Install with `pip install mauve-text`.

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Simple ray intersection library similar to coldet - succedeed by libacc

This is a model made out of Neural Network specifically a Convolutional Neural Network model

Official implementation for “Unsupervised Low-Light Image Enhancement via Histogram Equalization Prior”

Interactive Image Generation via Generative Adversarial Networks

Cross-Task Consistency Learning Framework for Multi-Task Learning

Deep Learning Models for Causal Inference

Sign Language is detected in realtime using video sequences. Our approach involves MediaPipe Holistic for keypoints extraction and LSTM Model for prediction.

[제 13회 투빅스 컨퍼런스] OK Mugle! - 장르부터 멜로디까지, Content-based Music Recommendation

DeepLearning Anomalies Detection with Bluetooth Sensor Data

Sparse R-CNN: End-to-End Object Detection with Learnable Proposals, CVPR2021

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

A project to make Amazon Echo respond to sign language using your webcam

A particular navigation route using satellite feed and can help in toll operations & traffic managemen

An Active Automata Learning Library Written in Python

Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.

Generative Models for Graph-Based Protein Design

Detecting Blurred Ground-based Sky/Cloud Images

Using Python to Play Cyberpunk 2077