Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Last update: Oct 24, 2022

Overview

Video Class Agnostic Segmentation

[Method Paper] [Benchmark Paper] [Project] [Demo]

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation Benchmark in Autonomous Driving" in Workshop on Autonomous Driving, CVPR 2021.

Installation

This repo is tested under Python 3.6, PyTorch 1.4

Download Required Packages

pip install -r requirements.txt
pip install "git+https://github.com/cocodataset/panopticapi.git"

Setup mmdet

python setup.py develop

Motion Segmentation Track

Dataset Preparation

Follow Dataset Preparation Instructions.

Inference

Download Trained Weights on Ego Flow Suppressed, trained on Cityscapes and KITTI-MOTS
Modify Configs according to dataset path + Image/Annotation/Flow prefix

configs/data/kittimots_motion_supp.py
configs/data/cscapesvps_motion_supp.py

Evaluate CAQ,

python tools/test_eval_caq.py CONFIG_FILE WEIGHTS_FILE

CONFIG_FILE: configs/infer_kittimots.py or configs/infer_cscapesvps.py

Qualitative Results

python tools/test_vis.py CONFIG_FILE WEIGHTS_FILE --vis_unknown --save_dir OUTS_DIR

Evaluate Image Panoptic Quality, Note: evaluated on 1024x2048 Images

python tools/test_eval_ipq.py configs/infer_cscapesvps_pq.py WEIGHTS_FILE --out PKL_FILE

Training

Coming Soon ...

Open-set Segmentation Track

Coming soon ...

Acknowledgements

Dataset and Repository relied on these sources:

Voigtlaender, Paul, et al. "Mots: Multi-object tracking and segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2019.
Kim, Dahun, et al. "Video panoptic segmentation." Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2020.
Wang, Xinlong, et al. "Solo: Segmenting objects by locations." European Conference on Computer Vision. Springer, Cham, 2020.
This Repository built upon SOLO Code

Citation

@article{siam2021video,
      title={Video Class Agnostic Segmentation Benchmark for Autonomous Driving}, 
      author={Mennatullah Siam and Alex Kendall and Martin Jagersand},
      year={2021},
      eprint={2103.11015},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

Contact

If you have any questions regarding the dataset or repository, please contact [email protected].

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Related tags

Overview

Video Class Agnostic Segmentation

Installation

Motion Segmentation Track

Dataset Preparation

Inference

Training

Open-set Segmentation Track

Acknowledgements

Citation

Contact

Owner

Mennatullah Siam

Code for the paper "VisualBERT: A Simple and Performant Baseline for Vision and Language"

RefineNet: Multi-Path Refinement Networks for High-Resolution Semantic Segmentation

A modular, research-friendly framework for high-performance and inference of sequence models at many scales

Advancing mathematics by guiding human intuition with AI

Prompts - Read a textfile of prompts and import into anki via ankiconnect

Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems

The Habitat-Matterport 3D Research Dataset - the largest-ever dataset of 3D indoor spaces.

A demonstration of using a live Tensorflow session to create an interactive face-GAN explorer.

arxiv-sanity, but very lite, simply providing the core value proposition of the ability to tag arxiv papers of interest and have the program recommend similar papers.

Finite difference solution of 2D Poisson equation. Can handle Dirichlet, Neumann and mixed boundary conditions.

Official Repsoitory for "Mish: A Self Regularized Non-Monotonic Neural Activation Function" [BMVC 2020]

Memory efficient transducer loss computation

Training RNNs as Fast as CNNs

LIVECell - A large-scale dataset for label-free live cell segmentation

Code for pre-training CharacterBERT models (as well as BERT models).

EDPN: Enhanced Deep Pyramid Network for Blurry Image Restoration

EASY - Ensemble Augmented-Shot Y-shaped Learning: State-Of-The-Art Few-Shot Classification with Simple Ingredients.

A quick recipe to learn all about Transformers

Semantic Image Synthesis with SPADE