Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Last update: Dec 28, 2022

Related tags

Overview

RealBasicVSR

This is the official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution, arXiv". This repository contains codes, colab, video demos of our work.

Authors: Kelvin C.K. Chan, Shangchen Zhou, Xiangyu Xu, Chen Change Loy, Nanyang Technological University

Acknowedgement: Our work is built upon MMEditing. The code will also appear in MMEditing soon. Please follow and star this repository and MMEditing!

News

29 Nov 2021: Test code released
25 Nov 2021: Initialize with video demos

Video Demos

The videos have been compressed. Therefore, the results are inferior to that of the actual outputs.

output.mp4

Code

Installation

Install PyTorch and torchvision following the official instructions, e.g.,

conda install pytorch==1.7.1 torchvision==0.8.2 torchaudio==0.7.2 cudatoolkit=10.1 -c pytorch

Install mim and mmcv-full

pip install openmim
mim install mmcv-full

Install mmedit

pip install mmedit

Inference

Download the pre-trained weights to checkpoints/. (Dropbox / Google Drive)
Run the following command:

python inference_realbasicvsr.py ${CONFIG_FILE} ${CHECKPOINT_FILE} ${INPUT_DIR} ${OUTPUT_DIR} --max-seq-len=${MAX_SEQ_LEN} --is_save_as_png=${IS_SAVE_AS_PNG}  --fps=${FPS}

This script supports both images and videos as inputs and outputs. You can simply change ${INPUT_DIR} and ${OUTPUT_DIR} to the paths corresponding to the video files, if you want to use videos as inputs and outputs. But note that saving to videos may induce additional compression, which reduces output quality.

For example:

Images as inputs and outputs

python inference_realbasicvsr.py configs/realbasicvsr_x4.py checkpoints/RealBasicVSR_x4.pth data/demo_000 results/demo_000

Video as input and output

python inference_realbasicvsr.py configs/realbasicvsr_x4.py checkpoints/RealBasicVSR_x4.pth data/demo_001.mp4 results/demo_001.mp4 --fps=12.5

Training

To be appeared.

VideoLQ Dataset

You can download the dataset using Dropbox or Google Drive.

Citations

@article{chan2021investigating,
  author = {Chan, Kelvin C.K. and Zhou, Shangchen and Xu, Xiangyu and Loy, Chen Change},
  title = {Investigating Tradeoffs in Real-World Video Super-Resolution},
  journal = {arXiv preprint arXiv:2111.12704},
  year = {2021}
}

Official repository of "Investigating Tradeoffs in Real-World Video Super-Resolution"

Related tags

Overview

RealBasicVSR

News

Table of Content

Video Demos

Code

Installation

Inference

Training

VideoLQ Dataset

Citations

Owner

Kelvin C.K. Chan

Code for SyncTwin: Treatment Effect Estimation with Longitudinal Outcomes (NeurIPS 2021)

Some useful blender add-ons for SMPL skeleton's poses and global translation.

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

CvT-ASSD: Convolutional vision-Transformerbased Attentive Single Shot MultiBox Detector (ICTAI 2021 CCF-C 会议)The 33rd IEEE International Conference on Tools with Artificial Intelligence

Generating synthetic mobility data for a realistic population with RNNs to improve utility and privacy

Neural Logic Inductive Learning

A Data Annotation Tool for Semantic Segmentation, Object Detection and Lane Line Detection.(In Development Stage)

PyTorch implementation of Towards Accurate Alignment in Real-time 3D Hand-Mesh Reconstruction (ICCV 2021).

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

Code for Mesh Convolution Using a Learned Kernel Basis

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

rliable is an open-source Python library for reliable evaluation, even with a handful of runs, on reinforcement learning and machine learnings benchmarks.

Flask101 - FullStack Web Development with Python & JS - From TAQWA

Yolov5 deepsort inference，使用YOLOv5+Deepsort实现车辆行人追踪和计数，代码封装成一个Detector类，更容易嵌入到自己的项目中

Deep learned, hardware-accelerated 3D object pose estimation

Ratatoskr: Worcester Tech's conference scheduling system

[ICCV'21] Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

The official code of Anisotropic Stroke Control for Multiple Artists Style Transfer

Contrastively Disentangled Sequential Variational Audoencoder