Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

Last update: Oct 11, 2022

Related tags

Overview

SinIR (Official Implementation)

Requirements

To install requirements:

pip install -r requirements.txt

We used Python 3.7.4 and f-strings which are introduced in python 3.6+

Training

To train a model, write a proper yaml config file in 'config_train' folder (sample yaml files provided in the config_train folder), and run this command:

python train.py <gpu_num> -y <yaml_file_in_'config_train'_folder>

For example, if you want to train a model with config_train/photo.yaml on gpu 0, run:

python train.py 0 -y photo

This will output a trained model, training logs, training output images and so on, to a subdirectory of 'outs' folder with proper naming and numbering which are used for inference.

Note that even though we provide one yaml file for each task, they can be used interchangeably, except few tasks.

You can copy and modify them depending on your purpose. Detailed explanation about configuration is written in the sample yaml files. Please read through it carefully if you need.

Inference

To carry out inference (i.e., image manipulation), you can specify inference yaml files in training yaml files. Please see provided sample training yaml files.

Or alternatively you can run this command:

python infer.py <output_dirnum> <gpu_num> -y <yaml_file_in_config_folder>

For example, if you want to carry out inference with a trained model numbered 002, with config_infer/photo_infer.yaml on gpu 0, run:

python infer.py 2 0 -y photo_infer

Then it will automatically find an output folder numbered 002 and conduct image manipulation, saving related results in the subdirectory.

Note that duplicated numbering (which can be avoided with a normal usage) will incur error. In this case, please keep only one output folder.

We also provide sample yaml files for inference which are paired with yaml files for training. Feel free to copy and modify depending on your purpose.

Acknowledgement

This repository includes images from:

https://www2.eecs.berkeley.edu/Research/Projects/CS/vision/bsds/ (BSD dataset)
https://github.com/luanfujun/deep-painterly-harmonization/ (https://arxiv.org/abs/1804.03189)
https://github.com/luanfujun/deep-photo-styletransfer (https://arxiv.org/abs/1703.07511)
The Web (free images)

This repository includes codes snippets from:

SSIM: https://github.com/VainF/pytorch-msssim
Anti-aliasing + Bicubic resampling: https://github.com/thstkdgus35/bicubic_pytorch
dilated mask: https://github.com/tamarott/SinGAN

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

Related tags

Overview

SinIR (Official Implementation)

Requirements

Training

Inference

Acknowledgement

Owner

A foreign language learning aid using a neural network to predict probability of translating foreign words

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Toolkit for collecting and applying prompts

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

Applying curriculum to meta-learning for few shot classification

Self-Supervised Contrastive Learning of Music Spectrograms

A library for implementing Decentralized Graph Neural Network algorithms.

A Small and Easy approach to the BraTS2020 dataset (2D Segmentation)

CLIP (Contrastive Language–Image Pre-training) for Italian

This code uses generative adversarial networks to generate diverse task allocation plans for Multi-agent teams.

“Data Augmentation for Cross-Domain Named Entity Recognition” (EMNLP 2021)

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Position detection system of mobile robot in the warehouse enviroment

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Backend code to use MCPI's python API to make infinite worlds with custom generation

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

Related tags

Overview

SinIR (Official Implementation)

Requirements

Training

Inference

Acknowledgement

Owner

A foreign language learning aid using a neural network to predict probability of translating foreign words

Links to works on deep learning algorithms for physics problems, TUM-I15 and beyond

Weakly Supervised Learning of Instance Segmentation with Inter-pixel Relations, CVPR 2019 (Oral)

Toolkit for collecting and applying prompts

CMUA-Watermark: A Cross-Model Universal Adversarial Watermark for Combating Deepfakes (AAAI2022)

This is an official implementation of the paper "Distance-aware Quantization", accepted to ICCV2021.

Applying curriculum to meta-learning for few shot classification

Self-Supervised Contrastive Learning of Music Spectrograms

A library for implementing Decentralized Graph Neural Network algorithms.

A Small and Easy approach to the BraTS2020 dataset (2D Segmentation)

CLIP (Contrastive Language–Image Pre-training) for Italian

This code uses generative adversarial networks to generate diverse task allocation plans for Multi-agent teams.

“Data Augmentation for Cross-Domain Named Entity Recognition” (EMNLP 2021)

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Official pytorch implementation of "DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion"

A U-Net combined with a variational auto-encoder that is able to learn conditional distributions over semantic segmentations.

​TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.

Position detection system of mobile robot in the warehouse enviroment

Official Implementation and Dataset of "PPR10K: A Large-Scale Portrait Photo Retouching Dataset with Human-Region Mask and Group-Level Consistency", CVPR 2021

Backend code to use MCPI's python API to make infinite worlds with custom generation

TextWorld is a sandbox learning environment for the training and evaluation of reinforcement learning (RL) agents on text-based games.