an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Last update: Dec 22, 2022

Overview

revisiting-sepconv

This is a reference implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation [1] using PyTorch. Given two frames, it will make use of adaptive convolution [2] in a separable manner [3] to interpolate the intermediate frame. Should you be making use of our work, please cite our paper [1].

For the original SepConv, see: https://github.com/sniklaus/sepconv-slomo
For softmax splatting, please see: https://github.com/sniklaus/softmax-splatting

setup

The separable convolution layer is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

If you plan to process videos, then please also make sure to have pip install moviepy installed.

usage

To run it on your own pair of frames, use the following command.

python run.py --model paper --one ./images/one.png --two ./images/two.png --out ./out.png

To run in on a video, use the following command.

python run.py --model paper --video ./videos/car-turn.mp4 --out ./out.mp4

For a quick benchmark using examples from the Middlebury benchmark for optical flow, run python benchmark.py. You can use it to easily verify that the provided implementation runs as expected.

video

license

Please refer to the appropriate file within this repository.

references

[1]  @inproceedings{Niklaus_WACV_2021,
         author = {Simon Niklaus and Long Mai and Oliver Wang},
         title = {Revisiting Adaptive Convolutions for Video Frame Interpolation},
         booktitle = {IEEE Winter Conference on Applications of Computer Vision},
         year = {2021}
     }

[2]  @inproceedings{Niklaus_ICCV_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Separable Convolution},
         booktitle = {IEEE International Conference on Computer Vision},
         year = {2017}
     }

[3]  @inproceedings{Niklaus_CVPR_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Convolution},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2017}
     }

an implementation of Revisiting Adaptive Convolutions for Video Frame Interpolation using PyTorch

Related tags

Overview

revisiting-sepconv

setup

usage

video

license

references

Owner

Simon Niklaus

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

OMLT: Optimization and Machine Learning Toolkit

[ECCV 2020] Gradient-Induced Co-Saliency Detection

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

This repo implements several applications of the proposed generalized Bures-Wasserstein (GBW) geometry on symmetric positive definite matrices.

MetaTTE: a Meta-Learning Based Travel Time Estimation Model for Multi-city Scenarios

Model Zoo of BDD100K Dataset

A library built upon PyTorch for building embeddings on discrete event sequences using self-supervision

Codebase for "ProtoAttend: Attention-Based Prototypical Learning."

[CVPR 2022] Official code for the paper: "A Stitch in Time Saves Nine: A Train-Time Regularizing Loss for Improved Neural Network Calibration"

Tackling Obstacle Tower Challenge using PPO & A2C combined with ICM.

[CVPR 2020] Interpreting the Latent Space of GANs for Semantic Face Editing

[CVPR 2021] VirTex: Learning Visual Representations from Textual Annotations

Explainability for Vision Transformers (in PyTorch)

🚩🚩🚩

Code of the paper "Shaping Visual Representations with Attributes for Few-Shot Learning (ASL)".

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Housing Price Prediction

code for CVPR paper Zero-shot Instance Segmentation

Codes for our paper The Stem Cell Hypothesis: Dilemma behind Multi-Task Learning with Transformer Encoders published to EMNLP 2021.