A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Last update: Nov 29, 2022

Related tags

Deep Learning CVPR2021_VSPW_Implement

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Preparation

Download VSPW dataset

The VSPW dataset with extracted frames and masks is available here. Now you can directly download VSPW_480P dataset.

Dependencies

Python 3.7
Pytorch 1.3.1
Numpy

Download the ImageNet-pretrained models at this link. Put it in the root folder and decompress it.

Train and Test

Resize the frames and masks of the VSPW dataset to 480p.

python change2_480p.py

Edit the .sh files in scripts/ and change the $DATAROOT to your path to VSPW_480p.

Image-based methods

PSPNet

sh scripts/run_psp.sh

OCRNet

sh scripts/run_ocr.sh

Video-based methods

TCB-PSP

sh run_temporal_psp.sh

TCB-OCR

sh run_temporal_ocr.sh

Evaluation on TC and VC

Change dataroot and prediction root in TC_cal.py and VC_perclip.py.

python TC_cal.py

python VC_perclip.py

This implementation utilized this code and RAFT.

Citation

@inproceedings{miao2021vspw,

  title={VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild},

  author={Miao, Jiaxu and Wei, Yunchao and  Wu, Yu and Liang, Chen and Li, Guangrui and Yang, Yi},

  booktitle={Proceedings of the {IEEE} Conference on Computer Vision and Pattern Recognition},

  year={2021}

}

A pytorch implementation of the CVPR2021 paper "VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild"

Related tags

Overview

VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild

Preparation

Download VSPW dataset

Dependencies

Train and Test

Image-based methods

Video-based methods

Evaluation on TC and VC

Citation

Owner

Randstad Artificial Intelligence Challenge (powered by VGEN). Soluzione proposta da Stefano Fiorucci (anakin87) - primo classificato

Implementation for the paper SMPLicit: Topology-aware Generative Model for Clothed People (CVPR 2021)

OOD Generalization and Detection (ACL 2020)

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

InferPy: Deep Probabilistic Modeling with Tensorflow Made Easy

Directed Greybox Fuzzing with AFL

Reproduce ResNet-v2(Identity Mappings in Deep Residual Networks) with MXNet

GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond

Decentralized Reinforcment Learning: Global Decision-Making via Local Economic Transactions (ICML 2020)

Histology images query (unsupervised)

Combining Automatic Labelers and Expert Annotations for Accurate Radiology Report Labeling Using BERT

This is an official implementation for "Self-Supervised Learning with Swin Transformers".

A curated list and survey of awesome Vision Transformers.

Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation

Solution of Kaggle competition: Sartorius - Cell Instance Segmentation

Source code for Zalo AI 2021 submission

[CVPRW 2022] Attentions Help CNNs See Better: Attention-based Hybrid Image Quality Assessment Network

Edge-aware Guidance Fusion Network for RGB-Thermal Scene Parsing

Utilizes Pose Estimation to offer sprinters cues based on an image of their running form.

A task Provided by A respective Artenal Ai and Ml based Company to complete it