Real-Time High-Resolution Background Matting

Last update: Jan 03, 2023

Overview

Real-Time High-Resolution Background Matting

Official repository for the paper Real-Time High-Resolution Background Matting. Our model requires capturing an additional background image and produces state-of-the-art matting results at 4K 30fps and HD 60fps on an Nvidia RTX 2080 TI GPU.

Disclaimer: The video conversion script in this repo is not meant be real-time. Our research's main contribution is the neural architecture for high resolution refinement and the new matting datasets. The inference_speed_test.py script allows you to measure the tensor throughput of our model, which should achieve real-time. The inference_video.py script allows you to test your video on our model, but the video encoding and decoding is done without hardware acceleration and parallization. For production use, you are expected to do additional engineering for hardware encoding/decoding and loading frames to GPU in parallel. For more architecture detail, please refer to our paper.

New Paper is Out!

Check out Robust Video Matting! Our new method does not require pre-captured backgrounds, and can inference at even faster speed!

Updates

[Jun 21 2021] Paper received CVPR 2021 Best Student Paper Honorable Mention.
[Apr 21 2021] VideoMatte240K dataset is now published.
[Mar 06 2021] Training script is published.
[Feb 28 2021] Paper is accepted to CVPR 2021.
[Jan 09 2021] PhotoMatte85 dataset is now published.
[Dec 21 2020] We updated our project to MIT License, which permits commercial use.

Download

Model / Weights

Download model / weights

Video / Image Examples

HD videos (by Sengupta et al.) (Our model is more robust on HD footage)
4K videos and images

Datasets

Download datasets

Demo

Scripts

We provide several scripts in this repo for you to experiment with our model. More detailed instructions are included in the files.

inference_images.py: Perform matting on a directory of images.
inference_video.py: Perform matting on a video.
inference_webcam.py: An interactive matting demo using your webcam.

Notebooks

Additionally, you can try our notebooks in Google Colab for performing matting on images and videos.

Virtual Camera

We provide a demo application that pipes webcam video through our model and outputs to a virtual camera. The script only works on Linux system and can be used in Zoom meetings. For more information, checkout:

Webcam plugin

Usage / Documentation

You can run our model using PyTorch, TorchScript, TensorFlow, and ONNX. For detail about using our model, please check out the Usage / Documentation page.

Training

Configure data_path.pth to point to your dataset. The original paper uses train_base.pth to train only the base model till convergence then use train_refine.pth to train the entire network end-to-end. More details are specified in the paper.

Project members

Shanchuan Lin*, University of Washington
Andrey Ryabtsev*, University of Washington
Soumyadip Sengupta, University of Washington
Brian Curless, University of Washington
Steve Seitz, University of Washington
Ira Kemelmacher-Shlizerman, University of Washington

^{* Equal contribution.}

License

This work is licensed under the MIT License. If you use our work in your project, we would love you to include an acknowledgement and fill out our survey.

Community Projects

Projects developed by third-party developers.

After Effects Plug-In

Real-Time High-Resolution Background Matting

Related tags

Overview

Real-Time High-Resolution Background Matting

New Paper is Out!

Overview

Updates

Download

Model / Weights

Video / Image Examples

Datasets

Demo

Scripts

Notebooks

Virtual Camera

Usage / Documentation

Training

Project members

License

Community Projects

Owner

Peter Lin

Tool for working with Y-chromosome data from YFull and FTDNA

Fortuitous Forgetting in Connectionist Networks

Pytorch implementation for our ICCV 2021 paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering".

Testability-Aware Low Power Controller Design with Evolutionary Learning, ITC2021

A rough implementation of the paper "A Steering Algorithm for Redirected Walking Using Reinforcement Learning"

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

Learning nonlinear operators via DeepONet

Code for PhySG: Inverse Rendering with Spherical Gaussians for Physics-based Relighting and Material Editing

[NeurIPS-2020] Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID.

Official implementation for (Show, Attend and Distill: Knowledge Distillation via Attention-based Feature Matching, AAAI-2021)

This repository is the code of the paper Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies

9th place solution in "Santa 2020 - The Candy Cane Contest"

Neuron class provides LNU (Linear Neural Unit), QNU (Quadratic Neural Unit), RBF (Radial Basis Function), MLP (Multi Layer Perceptron), MLP-ELM (Multi Layer Perceptron - Extreme Learning Machine) neurons learned with Gradient descent or LeLevenberg–Marquardt algorithm

Evaluating Cross-lingual Sentence Representations

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

SemiNAS: Semi-Supervised Neural Architecture Search

Code for 'Blockwise Sequential Model Learning for Partially Observable Reinforcement Learning' (AAAI 2022)

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"

ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss (HDCWNet)