Learning to Segment Instances in Videos with Spatial Propagation Network

Last update: Sep 28, 2022

Related tags

Deep Learning Seg-with-SPN

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

This paper is available at the 2017 DAVIS Challenge website.

Check our results in this video.

Contact: Jingchun Cheng (chengjingchun at gmail dot com)

Cite the Paper

If you find that our method is useful in your research, please cite:

@article{DAVIS2017-6th,
  author = {J. Cheng and S. Liu and Y.-H. Tsai and W.-C. Hung and S. Gupta and J. Gu and J. Kautz and S. Wang and M.-H. Yang}, 
  title = {Learning to Segment Instances in Videos with Spatial Propagation Network}, 
  journal = {The 2017 DAVIS Challenge on Video Object Segmentation - CVPR Workshops}, 
  year = {2017}
}

About the Code

The code released here mainly consistes of two parts in the paper: foreground segmentation and instance recognition.
It contains the parent net for foreground segmentation and training codes for instance recognition networks.
The matlab_code folder contains a simple version of our CRAF step for segmentation refinement.

Requirements

Install caffe and pycaffe at http://caffe.berkeleyvision.org/.
Download the DAVIS 2017 dataset and put it in the data folder.
Download the pre-trained foreground/background model here and put it in the pretrained folder.

Training

Train the per-object recognition model.
cd training
python solve.py PATH_OF_MODEL PATH_OF_SOLVER
Foe example, on the 'choreography' video for the 1st object, run:
python solve.py ../pretrained/PN_ResNetF.caffemodel ../ResNetF/testnet_per_obj/choreography/solver_1.prototxt

Testing

Test the general foreground/backgroung model.
python infer_test_fgbg.py PATH_OF_MODEL PATH_OF_RESULT VIDEO_NAME
Foe example, on the 'lions' video, run:
python infer_test_fgbg.py pretrained/PN_ResNetF.caffemodel results/fgbg lions
Test the object instance model.
python infer_test_perobj.py MODEL_ITERATION VIDEO_NAME OBJECT_ID
For example, on the 'lions' video for the 2nd object, run:
python infer_test_perobj.py 3000 lions 2
Run example_CRAF.m in the matlab_code folder for a demo on CRAF segmentation refinement.

Download Our Segmentation Results on 2017 DAVIS Challenge

General foreground/background segmentation here
Instance-level object segmentation without refinement here
Final instance-level object segmentation with refinement here

Note

The model and code are available for non-commercial research purposes only.

09/2017: code and model released
03/2018: pre-trained model updated

Learning to Segment Instances in Videos with Spatial Propagation Network

Related tags

Overview

Learning to Segment Instances in Videos with Spatial Propagation Network

Cite the Paper

About the Code

Requirements

Training

Testing

Download Our Segmentation Results on 2017 DAVIS Challenge

Note

Owner

Jingchun Cheng

Rule-based Customer Segmentation

A modular active learning framework for Python

Source code for the paper "PLOME: Pre-training with Misspelled Knowledge for Chinese Spelling Correction" in ACL2021

YOLOX Win10 Project

Establishing Strong Baselines for TripClick Health Retrieval; ECIR 2022

Information Gain Filtration (IGF) is a method for filtering domain-specific data during language model finetuning. IGF shows significant improvements over baseline fine-tuning without data filtration.

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Official Tensorflow implementation of U-GAT-IT: Unsupervised Generative Attentional Networks with Adaptive Layer-Instance Normalization for Image-to-Image Translation (ICLR 2020)

Fast, general, and tested differentiable structured prediction in PyTorch

Styleformer - Official Pytorch Implementation

A minimal yet resourceful implementation of diffusion models (along with pretrained models + synthetic images for nine datasets)

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Hierarchical Cross-modal Talking Face Generation with Dynamic Pixel-wise Loss （ATVGnet）

Predicting a person's gender based on their weight and height

classification task on dataset-CIFAR10,by using Tensorflow/keras

Pytorch implementation of

Library for 8-bit optimizers and quantization routines.

Discord bot for notifying on github events

Classifies galaxy morphology with Bayesian CNN

Arabic Car License Recognition. A solution to the kaggle competition Machathon 3.0.