Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Last update: Jan 01, 2023

Related tags

Overview

BasicVSR_PlusPlus (CVPR 2022)

This is the official repository for BasicVSR++. Please feel free to raise issue related to BasicVSR++! If you are also interested in RealBasicVSR, which is also accepted to CVPR 2022, please don't hesitate to star!

Authors: Kelvin C.K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Nanyang Technological University

Acknowedgement: Our work is built upon MMEditing. Please follow and star this repository and MMEditing!

News

2 Dec 2021: Colab demo released
18 Apr 2022: Code released. Also merged into MMEditing

TODO

Add data processing scripts
~~Add checkpoints for deblur and denoise~~
~~Add configs for deblur and denoise~~
~~Add Colab demo~~

Pre-trained Weights

You can find the pre-trained weights for deblurring and denoising in this link. For super-resolution and compressed video enhancement, please refer to MMEditing.

Installation

Install PyTorch
pip install openmim
mim install mmcv-full
git clone https://github.com/ckkelvinchan/BasicVSR_PlusPlus.git
cd BasicVSR_PlusPlus
pip install -v -e .

Inference a Video

Download pre-trained weights
python demo/restoration_video_demo.py ${CONFIG} ${CHKPT} ${IN_PATH} ${OUT_PATH}

For example, you can download the VSR checkpoint here to chkpts/basicvsr_plusplus_reds4.pth, then run

python demo/restoration_video_demo.py configs/basicvsr_plusplus_reds4.py chkpts/basicvsr_plusplus_reds4.pth data/demo_000 results/demo_000

You can also replace ${IN_PATH} ${OUT_PATH} by your video path (e.g., xxx/yyy.mp4) to input/output videos.

Training Models

Put the dataset in the designated locations specified in the configuration file.
sh tools/dist_train.sh ${CONFIG} ${NGPUS}

Data Preprocessing

To be added...

Related Work

Our BasicVSR series:

More about deformable alignment:

Understanding Deformable Alignment in Video Super-Resolution, AAAI 2021

Citations

@inproceedings{chan2022basicvsrpp,
  author = {Chan, Kelvin C.K. and Zhou, Shangchen and Xu, Xiangyu and Loy, Chen Change},
  title = {{BasicVSR++}: Improving video super-resolution with enhanced propagation and alignment},
  booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
  year = {2022}
}

@article{chan2022generalization,
  title={On the Generalization of {BasicVSR++} to Video Deblurring and Denoising},
  author={Chan, Kelvin CK and Zhou, Shangchen and Xu, Xiangyu and Loy, Chen Change},
  journal={arXiv preprint arXiv:2204.05308},
  year={2022}
}

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Related tags

Overview

BasicVSR_PlusPlus (CVPR 2022)

News

TODO

Pre-trained Weights

Installation

Inference a Video

Training Models

Data Preprocessing

Related Work

Citations

Owner

Kelvin C.K. Chan

Code for the paper "MASTER: Multi-Aspect Non-local Network for Scene Text Recognition" (Pattern Recognition 2021)

Source code of the paper PatchGraph: In-hand tactile tracking with learned surface normals.

Weighted QMIX: Expanding Monotonic Value Function Factorisation

Implementation of Convolutional LSTM in PyTorch.

DeepHawkeye is a library to detect unusual patterns in images using features from pretrained neural networks

Learning Tracking Representations via Dual-Branch Fully Transformer Networks

A Runtime method overload decorator which should behave like a compiled language

Image-to-Image Translation with Conditional Adversarial Networks (Pix2pix) implementation in keras

StyleGAN of All Trades: Image Manipulation withOnly Pretrained StyleGAN

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

Official implementation of the Implicit Behavioral Cloning (IBC) algorithm

YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4

Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models Benchmark and Efficient Evaluation

History Aware Multimodal Transformer for Vision-and-Language Navigation

[CVPR'21] Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild

PyTorch implementation of EGVSR: Efficcient & Generic Video Super-Resolution (VSR)

Patch2Pix: Epipolar-Guided Pixel-Level Correspondences [CVPR2021]

Grounding Representation Similarity with Statistical Testing

MODNet: Trimap-Free Portrait Matting in Real Time

A Python package for generating concise, high-quality summaries of a probability distribution