RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Last update: Dec 09, 2022

Overview

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YouTube | BiliBili

16X interpolation results from two input images:

Introduction

This project is an official implementation (MegEngine implementation) of RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation. For Pytorch implementation, please refers to this repo. Currently, our model can run 30+FPS for 2X 720p interpolation on a 2080Ti GPU. It supports arbitrary-timestep interpolation between a pair of images.

CLI Usage

Installation

git clone [email protected]:MegEngine/arXiv2020-RIFE
cd arXiv2020-RIFE
pip3 install -r requirements.txt

Download the pretrained HD models from here.
Unzip and move the pretrained parameters to train_log/*
This model is not reported by our paper, for our paper model please refer to evaluation.

Run

Image Interpolation

python3 inference_img.py --img img0.png img1.png --exp=4

(2^4=16X interpolation results) After that, you can use pngs to generate mp4:

ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -c:v libx264 -pix_fmt yuv420p output/slomo.mp4 -q:v 0 -q:a 0

You can also use pngs to generate gif:

ffmpeg -r 10 -f image2 -i output/img%d.png -s 448x256 -vf "split[s0][s1];[s0]palettegen=stats_mode=single[p];[s1][p]paletteuse=new=1" output/slomo.gif

Evaluation

Download RIFE model or RIFE_m model reported by our paper.

MiddleBury: Download MiddleBury OTHER dataset at ./other-data and ./other-gt-interp

HD: Download HD dataset at ./HD_dataset. We also provide a google drive download link.

We provide code for evaluating with datasets above, please follow lines:

python3 benchmark/HD_multi_4X.py
python3 benchmark/HD.py
python3 benchmark/MiddleBury_Other.py
python3 benchmark/yuv_frame_io.py
python3 testtime.py

Training and Reproduction

Download Vimeo90K dataset.

We use 16 CPUs, 4 GPUs and 20G memory for training:

python3 train.py --arbitrary=False

Citation

@article{huang2020rife,
  title={RIFE: Real-Time Intermediate Flow Estimation for Video Frame Interpolation},
  author={Huang, Zhewei and Zhang, Tianyuan and Heng, Wen and Shi, Boxin and Zhou, Shuchang},
  journal={arXiv preprint arXiv:2011.06294},
  year={2020}
}

Reference

Optical Flow: ARFlow pytorch-liteflownet RAFT pytorch-PWCNet

Video Interpolation: DVF TOflow SepConv DAIN CAIN MEMC-Net SoftSplat BMBC EDSC

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Related tags

Overview

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

YouTube | BiliBili

Introduction

CLI Usage

Installation

Run

Evaluation

Training and Reproduction

Citation

Reference

Owner

旷视天元 MegEngine

[SIGGRAPH 2021 Asia] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning

Measuring and Improving Consistency in Pretrained Language Models

Improving Machine Translation Systems via Isotopic Replacement

A voice recognition assistant similar to amazon alexa, siri and google assistant.

A tensorflow implementation of GCN-LPA

Yolov5 + Deep Sort with PyTorch

My take on a practical implementation of Linformer for Pytorch.

Reading list for research topics in Masked Image Modeling

Code for Understanding Pooling in Graph Neural Networks

Guided Internet-delivered Cognitive Behavioral Therapy Adherence Forecasting

Dual Attention Network for Scene Segmentation (CVPR2019)

tree-math: mathematical operations for JAX pytrees

CNN designed for pansharpening

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Instance-level Image Retrieval using Reranking Transformers

An end-to-end image translation model with weight-map for color constancy

[AAAI 2022] Separate Contrastive Learning for Organs-at-Risk and Gross-Tumor-Volume Segmentation with Limited Annotation

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Defending graph neural networks against adversarial attacks (NeurIPS 2020)

Adaptive Denoising Training (ADT) for Recommendation.