implement of SwiftNet:Real-time Video Object Segmentation

Last update: Dec 14, 2022

Related tags

Overview

SwiftNet

The official PyTorch implementation of SwiftNet:Real-time Video Object Segmentation, which has been accepted by CVPR2021.

Requirements

Python >= 3.6
Pytorch 1.5
Numpy
Pillow
opencv-python
scipy
tqdm

Training

The training pipeline of Swiftnet is similar with the training pipeline of STM, which can be found in our reproduced STM training code.

Inference

Usage

python eval.py -g 0 -y 17 -s val -D 'path to davis'

Performance

Performance on Davis-17 val set.

backbone	J&F	J	F	FPS	weights
resnet-18	77.6	75.5	79.7	65	`link`

Note: The FPS is tested on one P100, which does not include the time of image loading and evaluation cost.

Acknowledgement

This repository is partially founded on the official STM repository.

Citation

If you find this repository helpful and want to cite SwiftNet in your own projects, please use the following citation info.

@inproceedings{wang2021swiftnet,
  title={SwiftNet: Real-time Video Object Segmentation},
  author={Wang, Haochen and Jiang, Xiaolong and Ren, Haibing and Hu, Yao and Bai, Song},
  booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition},
  pages={1296--1305},
  year={2021}
}

implement of SwiftNet:Real-time Video Object Segmentation

Related tags

Overview

SwiftNet

Requirements

Training

Inference

Performance

Acknowledgement

Citation

Owner

haochen wang

Code for AA-RMVSNet: Adaptive Aggregation Recurrent Multi-view Stereo Network (ICCV 2021).

The repository for our EMNLP 2021 paper "Finnish Dialect Identification: The Effect of Audio and Text"

v objective diffusion inference code for JAX.

A rule-based log analyzer & filter

Perturb-and-max-product: Sampling and learning in discrete energy-based models

PROJECT - Az Residential Real Estate Analysis

InvTorch: memory-efficient models with invertible functions

CN24 is a complete semantic segmentation framework using fully convolutional networks

Pytorch library for end-to-end transformer models training and serving

Implementation of the 😇 Attention layer from the paper, Scaling Local Self-Attention For Parameter Efficient Visual Backbones

Deep Anomaly Detection with Outlier Exposure (ICLR 2019)

Systemic Evolutionary Chemical Space Exploration for Drug Discovery

This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.

Official codebase for ICLR oral paper Unsupervised Vision-Language Grammar Induction with Shared Structure Modeling

Face detection using deep learning.

CaFM-pytorch ICCV ACCEPT Introduction of dataset VSD4K

Credo AI Lens is a comprehensive assessment framework for AI systems. Lens standardizes model and data assessment, and acts as a central gateway to assessments created in the open source community.

Code for the Population-Based Bandits Algorithm, presented at NeurIPS 2020.

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

SMCA replication There are no extra compiled components in SMCA DETR and package dependencies are minimal