official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Last update: Dec 27, 2022

Related tags

Deep Learning FuseFormer

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li.

This repo is the official Pytorch implementation of FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Introduction

Usage

Prerequisites

Python >= 3.6
Pytorch >= 1.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/FuseFormer.git

Install other packages:

cd FuseFormer
pip install -r requirements.txt

Training

Dataset preparation

Download datasets (YouTube-VOS and DAVIS) into the data folder.

mkdir data

Training script

python train.py -c configs/youtube-vos.json

Test

Download pre-trained model into checkpoints folder.

mkdir checkpoints

Test script

python test.py -c checkpoints/fuseformer.pth -v data/DAVIS/JPEGImages/blackswan -m data/DAVIS/Annotations/blackswan

Citing FuseFormer

If you find FuseFormer useful in your research, please consider citing:

@InProceedings{Liu_2021_FuseFormer,
  title={FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting},
  author={Liu, Rui and Deng, Hanming and Huang, Yangyi and Shi, Xiaoyu and Lu, Lewei and Sun, Wenxiu and Wang, Xiaogang and Dai, Jifeng and Li, Hongsheng},
  booktitle = {International Conference on Computer Vision (ICCV)},
  year={2021}
}

Acknowledement

This code borrows heavily from the video inpainting framework spatial-temporal transformer net.

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Related tags

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Introduction

Usage

Prerequisites

Install

Training

Dataset preparation

Training script

Test

Test script

Citing FuseFormer

Acknowledement

Owner

Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation"

Object classification with basic computer vision techniques

Face and other object detection using OpenCV and ML Yolo

PyTorch Implementation of PortaSpeech: Portable and High-Quality Generative Text-to-Speech

AntiFuzz: Impeding Fuzzing Audits of Binary Executables

NExT-QA: Next Phase of Question-Answering to Explaining Temporal Actions (CVPR2021)

This repository consists of Blender python scripts and corresponding assets to generate variants of the CANDLE dataset

PEPit is a package enabling computer-assisted worst-case analyses of first-order optimization methods.

A PyTorch based deep learning library for drug pair scoring.

Autonomous Ground Vehicle Navigation and Control Simulation Examples in Python

Implementation of a Transformer, but completely in Triton

Simple reimplemetation experiments about FcaNet

Train CNNs for the fruits360 data set in NTOU CS「Machine Vision」class.

This is an official implementation of CvT: Introducing Convolutions to Vision Transformers.

A general, feasible, and extensible framework for classification tasks.

Code release for our paper, "SimNet: Enabling Robust Unknown Object Manipulation from Pure Synthetic Data via Stereo"

Phonetic PosteriorGram (PPG)-Based Voice Conversion (VC)

PyTorch implementation for STIN

Generating Radiology Reports via Memory-driven Transformer

Learning To Have An Ear For Face Super-Resolution