This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Last update: Dec 01, 2022

Related tags

Deep Learning contrastive_association

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans.

The approach builds on top of an arbitrary single-scan Panoptic Segmentation network and extends it to the temporal domain by associating instances across time using our Contrastive Aggregation network that leverages the point-wise features from the panoptic network.

Requirements

Install this package: go to the root directory of this repo and run:

pip3 install -U -e .

Install packages in requirements.txt.
Install MinkowskiEngine.
Install spconv version 1.2.1.

Data preparation

Download the SemanticKITTI dataset inside the directory data/kitti/. The directory structure should look like this:

./
└── data/
    └── kitti
        └── sequences
            ├── 00/           
            │   ├── velodyne/	
            |   |	├── 000000.bin
            |   |	├── 000001.bin
            |   |	└── ...
            │   └── labels/ 
            |       ├── 000000.label
            |       ├── 000001.label
            |       └── ...
            ├── 08/ # for validation
            ├── 11/ # 11-21 for testing
            └── 21/
                └── ...

Pretrained models

Pretrained Panoptic Segmentation model.
Pretrained Contrastive Aggregation model.

Reproducing the results

Run the evaluation script, which will compute the metrics for the validation set:

python evaluate_4dpanoptic.py --ckpt_ps path/to/panoptic_weights --ckpt_ag path/to/aggregation_weights

Training

Create instances dataset

Since we use a frozen Panoptic Segmentation Network, to avoid running the forward pass during training, we save the instance predictions and the point features in advance running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights

This will create a directory in cont_assoc/data/instance_features with the same structure as Kitti but containing, for each sequence of the train set, npy files containing the instance points, labels and features for each scan.

Save validation predictions

To get the 4D Panoptic Segmentation performance for the validation step during training, we save the full predictions for the validation set (sequence 08) running:

python save_panoptic_features.py --ckpt path/to/panoptic_weights --save_val_pred

This will create a directory in cont_assoc/data/validation_predictions with npy files for each scan of the validation sequence containing the semantic and instance predictions for each point.

Train Contrastive Aggregation Network

Once the instance dataset and the validation predictions are generated, we're ready to train the Contrastive Aggregation Network running:

python train_aggregation.py

All the configurations are in the config/contrastive_instances.yaml file.

Citation

If you use this repo, please cite as :

@article{marcuzzi2022ral,
  author = {Rodrigo Marcuzzi and Lucas Nunes and Louis Wiesmann and Ignacio Vizzo and Jens Behley and Cyrill Stachniss},
  title = {{Contrastive Instance Association for 4D Panoptic Segmentation \\ using Sequences of 3D LiDAR Scans}},
  journal = {IEEE Robotics and Automation Letters (RA-L)},
  year = 2022,
  volume={7},
  number={2},
  pages={1550-1557},
}

Acknowledgments

The Panoptic Segmentation Network used in this repo is DS-Net.

The loss function it's a modified version of SupContrast.

License

This project is free software made available under the MIT License. For details see the LICENSE file.

This repository contains the implementation of the paper Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Related tags

Overview

Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR Scans

Requirements

Data preparation

Pretrained models

Reproducing the results

Training

Create instances dataset

Save validation predictions

Train Contrastive Aggregation Network

Citation

Acknowledgments

License

Owner

Photogrammetry & Robotics Bonn

PyTorch implementation of: Michieli U. and Zanuttigh P., "Continual Semantic Segmentation via Repulsion-Attraction of Sparse and Disentangled Latent Representations", CVPR 2021.

PyTorch Implement of Context Encoders: Feature Learning by Inpainting

This repo contains the code required to train the multivariate time-series Transformer.

Memory Defense: More Robust Classificationvia a Memory-Masking Autoencoder

Code for the paper titled "Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages"

Pytorch implementation of NeurIPS 2021 paper: Geometry Processing with Neural Fields.

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

Revisiting Self-Training for Few-Shot Learning of Language Model.

Spherical Confidence Learning for Face Recognition, accepted to CVPR2021.

Code for CPM-2 Pre-Train

PyTorch implementation of DeepDream algorithm

Reinforcement Learning for finance

Code release for the paper “Worldsheet Wrapping the World in a 3D Sheet for View Synthesis from a Single Image”, ICCV 2021.

Real-Time SLAM for Monocular, Stereo and RGB-D Cameras, with Loop Detection and Relocalization Capabilities

A pytorch &keras implementation and demo of Fastformer.

A set of simple scripts to process the Imagenet-1K dataset as TFRecords and make index files for NVIDIA DALI.

Code release for DS-NeRF (Depth-supervised Neural Radiance Fields)

Gif-caption - A straightforward GIF Captioner written in Python

A python-image-classification web application project, written in Python and served through the Flask Microframework

Official Pytorch implementation of MixMo framework