Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Last update: Jan 04, 2023

Overview

SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Abstract

In this paper, we introduce SalsaNext for the uncertainty-aware semantic segmentation of a full 3D LiDAR point cloud in real-time. SalsaNext is the next version of SalsaNet which has an encoder-decoder architecture where the encoder unit has a set of ResNet blocks and the decoder part combines upsampled features from the residual blocks. In contrast to SalsaNet, we introduce a new context module, replace the ResNet encoder blocks with a new residual dilated convolution stack with gradually increasing receptive fields and add the pixel-shuffle layer in the decoder. Additionally, we switch from stride convolution to average pooling and also apply central dropout treatment. To directly optimize the Jaccard index, we further combine the weighted cross-entropy loss with Lovasz-Softmax loss . We finally inject a Bayesian treatment to compute the epistemic and aleatoric uncertainties for each point in the cloud. We provide a thorough quantitative evaluation on the Semantic-KITTI dataset, which demonstrates that the proposed SalsaNext outperforms other state-of-the-art semantic segmentation.

Examples

Video

Semantic Kitti Segmentation Scores

The up-to-date scores can be found in the Semantic-Kitti page.

How to use the code

First create the anaconda env with: conda env create -f salsanext_cuda10.yml --name salsanext then activate the environment with conda activate salsanext.

To train/eval you can use the following scripts:

Training script (you might need to chmod +x the file)
- We have the following options:
  - -d [String] : Path to the dataset
  - -a [String]: Path to the Architecture configuration file
  - -l [String]: Path to the main log folder
  - -n [String]: additional name for the experiment
  - -c [String]: GPUs to use (default no gpu)
  - -u [String]: If you want to train an Uncertainty version of SalsaNext (default false) [Experimental: tests done so with uncertainty far used pretrained SalsaNext with Deep Uncertainty Estimation]
- For example if you have the dataset at /dataset the architecture config file in /salsanext.yml and you want to save your logs to /logs to train "salsanext" with 2 GPUs with id 3 and 4:
  - ./train.sh -d /dataset -a /salsanext.yml -m salsanext -l /logs -c 3,4

Eval script (you might need to chmod +x the file)
- We have the following options:
  - -d [String]: Path to the dataset
  - -p [String]: Path to save label predictions
  - -m [String]: Path to the location of saved model
  - -s [String]: Eval on Validation or Train (standard eval on both separately)
  - -u [String]: If you want to infer using an Uncertainty model (default false)
  - -c [Int]: Number of MC sampling to do (default 30)
- If you want to infer&evaluate a model that you saved to /salsanext/logs/[the desired run] and you want to infer$eval only the validation and save the label prediction to /pred:
  - ./eval.sh -d /dataset -p /pred -m /salsanext/logs/[the desired run] -s validation -n salsanext

Pretrained Model

SalsaNext

Disclamer

We based our code on RangeNet++, please go show some support!

Citation

@misc{cortinhal2020salsanext,
    title={SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving},
    author={Tiago Cortinhal and George Tzelepis and Eren Erdal Aksoy},
    year={2020},
    eprint={2003.03653},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Related tags

Overview

SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Abstract

Examples

Video

Semantic Kitti Segmentation Scores

How to use the code

Pretrained Model

Disclamer

Citation

Owner

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Sharpened cosine similarity torch - A Sharpened Cosine Similarity layer for PyTorch

TransVTSpotter: End-to-end Video Text Spotter with Transformer

Rest API Written In Python To Classify NSFW Images.

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

BMN: Boundary-Matching Network

A library for performing coverage guided fuzzing of neural networks

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Pansharpening by convolutional neural networks in the full resolution framework

[ACM MM 2021] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers

Extending JAX with custom C++ and CUDA code

STBP is a way to train SNN with datasets by Backward propagation.

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

PyTorch implementation of UNet++ (Nested U-Net).

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Related tags

Overview

SalsaNext: Fast, Uncertainty-aware Semantic Segmentation of LiDAR Point Clouds for Autonomous Driving

Abstract

Examples

Video

Semantic Kitti Segmentation Scores

How to use the code

Pretrained Model

Disclamer

Citation

Owner

Data and Code for ACL 2021 Paper "Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning"

Sharpened cosine similarity torch - A Sharpened Cosine Similarity layer for PyTorch

TransVTSpotter: End-to-end Video Text Spotter with Transformer

Rest API Written In Python To Classify NSFW Images.

Bootstrapped Unsupervised Sentence Representation Learning (ACL 2021)

Official Python implementation of the 'Sparse deconvolution'-v0.3.0

BMN: Boundary-Matching Network

A library for performing coverage guided fuzzing of neural networks

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Pansharpening by convolutional neural networks in the full resolution framework

[ACM MM 2021] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers

Extending JAX with custom C++ and CUDA code

STBP is a way to train SNN with datasets by Backward propagation.

CM-NAS: Cross-Modality Neural Architecture Search for Visible-Infrared Person Re-Identification (ICCV2021)

2021搜狐校园文本匹配算法大赛 分比我们低的都是帅哥队

The official PyTorch implementation for the paper "sMGC: A Complex-Valued Graph Convolutional Network via Magnetic Laplacian for Directed Graphs".

Train an RL agent to execute natural language instructions in a 3D Environment (PyTorch)

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On, CVPR 2021

PyTorch implementation of UNet++ (Nested U-Net).

Create time-series datacubes for supervised machine learning with ICEYE SAR images.

2021搜狐校园文本匹配算法大赛分比我们低的都是帅哥队