This project helps to colorize grayscale images using multiple exemplars.

Last update: Aug 05, 2022

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Pretrained Model

[Jitendra Chautharia](IIT Jodhpur)^1,3,

Prerequisites

Python 3.6+
Nvidia GPU + CUDA, CuDNN

Installation

First use the following commands to prepare the environment:

conda create -n ColorVid python=3.6
source activate ColorVid
pip install -r requirements.txt

Then, download the pretrained models from this link, unzip the file and place the files into the corresponding folders:

video_moredata_l1 under the checkpoints folder
vgg19_conv.pth and vgg19_gray.pth under the data folder

Data Preparation

In order to colorize your own video, it requires to extract the video frames, and provide a reference image as an example.

Place your Target grayscale image into one folder, e.g., ./exp_sample/target
Place your reference images into another folder, e.g., ./exp_sample/references

If you want to automatically retrieve color images, you can try the retrieval algorithm from this link which will retrieve similar images from the ImageNet dataset. Or you can try this link on your own image database.

Test

python test.py --image-size [image-size] \
               --clip_path [path-to-target-grayscale-image] \
               --ref_path [path-to-reference] \
               --output_path [path-to-output]

We provide several sample video clips with corresponding references. For example, one can colorize one sample legacy video using:

python test.py --clip_path ./exp_sample/target \
               --ref_path ./exp_sample/references \
               --output_path ./exp_sample/output

Note that we use 216*384 images for training, which has aspect ratio of 1:2. During inference, we scale the input to this size and then rescale the output back to the original size.

Train

We also provide training code for reference. The training can be started by running:

python --data_root [root of video samples] \
       --data_root_imagenet [root of image samples] \
       --gpu_ids [gpu ids] \

We do not provide the full video dataset due to the copyright issue. For image samples, we retrieve semantically similar images from ImageNet using this repository. Still, one can refer to our code to understand the detailed procedure of augmenting the image dataset to mimic the video frames.

This project helps to colorize grayscale images using multiple exemplars.

Related tags

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Prerequisites

Installation

Data Preparation

Test

Train

Comparison with State-of-the-Arts

Owner

jitendra chautharia

Unsupervised Representation Learning by Invariance Propagation

A library for low-memory inferencing in PyTorch.

Code for CMaskTrack R-CNN (proposed in Occluded Video Instance Segmentation)

Paper: De-rendering Stylized Texts

FaceVerse: a Fine-grained and Detail-controllable 3D Face Morphable Model from a Hybrid Dataset (CVPR2022)

Framework for abstracting Amiga debuggers and access to AmigaOS libraries and devices.

Python scripts to detect faces in Python with the BlazeFace Tensorflow Lite models

Manifold-Mixup implementation for fastai V2

Implementation of Bottleneck Transformer in Pytorch

Code for paper "Extract, Denoise and Enforce: Evaluating and Improving Concept Preservation for Text-to-Text Generation" EMNLP 2021

Surrogate- and Invariance-Boosted Contrastive Learning (SIB-CL)

Code of the paper "Deep Human Dynamics Prior" in ACM MM 2021.

Aerial Imagery dataset for fire detection: classification and segmentation (Unmanned Aerial Vehicle (UAV))

Cross-Task Consistency Learning Framework for Multi-Task Learning

Tensorflow 2 implementation of the paper: Learning and Evaluating Representations for Deep One-class Classification published at ICLR 2021

The ICS Chat System project for NYU Shanghai Fall 2021

PyTorch Implement of Context Encoders: Feature Learning by Inpainting

GMFlow: Learning Optical Flow via Global Matching

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Extracts essential Mediapipe face landmarks and arranges them in a sequenced order.