Det3D

A general 3D Object Detection codebase in PyTorch.

1. Introduction

Det3D is the first 3D Object Detection toolbox which provides off the box implementations of many 3D object detection algorithms such as PointPillars, SECOND, PIXOR, etc, as well as state-of-the-art methods on major benchmarks like KITTI(ViP) and nuScenes(CBGS). Key features of Det3D include the following aspects:

Multi Datasets Support: KITTI, nuScenes, Lyft
Point-based and Voxel-based model zoo
State-of-the-art performance
DDP & SyncBN

2. Installation

Please refer to INSTALATION.md.

3. Quick Start

Please refer to GETTING_STARTED.md.

4. Model Zoo

4.1 nuScenes

	mAP	mATE	mASE	mAOE	mAVE	mAAE	NDS	ckpt
CBGS	49.9	0.335	0.256	0.323	0.251	0.197	61.3	link
PointPillar	41.8	0.363	0.264	0.377	0.288	0.198	56.0	link

The original model and prediction files are available in the CBGS README.

4.2 KITTI

Second on KITTI(val) Dataset

car  AP @0.70, 0.70,  0.70:
bbox AP:90.54, 89.35, 88.43
bev  AP:89.89, 87.75, 86.81
3d   AP:87.96, 78.28, 76.99
aos  AP:90.34, 88.81, 87.66

PointPillars on KITTI(val) Dataset

car  [email protected],  0.70,  0.70:
bbox AP:90.63, 88.86, 87.35
bev  AP:89.75, 86.15, 83.00
3d   AP:85.75, 75.68, 68.93
aos  AP:90.48, 88.36, 86.58

4.3 Lyft

Lyft Config

4.4 Waymo

5. Functionality

Models
- VoxelNet
- SECOND
- PointPillars
Features
- Multi task learning & Multi-task Learning
- Distributed Training and Validation
- SyncBN
- Flexible anchor dimensions
- TensorboardX
- Checkpointer & Breakpoint continue
- Self-contained visualization
- Finetune
- Multiscale Training & Validation
- Rotated RoI Align

6. TODO List

To Be Released
- CGBS on Lyft(val) Dataset
Models
- PointRCNN
- PIXOR

7. Call for contribution.

Support Waymo Dataset.
Add other 3D detection / segmentation models, such as VoteNet, STD, etc.

8. Developers

Benjin Zhu , Bingqi Ma

9. License

Det3D is released under the Apache licenes.

10. Citation

Det3D is a derivative codebase of CBGS, if you find this work useful in your research, please consider cite:

@article{zhu2019class,
  title={Class-balanced Grouping and Sampling for Point Cloud 3D Object Detection},
  author={Zhu, Benjin and Jiang, Zhengkai and Zhou, Xiangxin and Li, Zeming and Yu, Gang},
  journal={arXiv preprint arXiv:1908.09492},
  year={2019}
}

A general 3D Object Detection codebase in PyTorch.

Related tags

Overview

Det3D

1. Introduction

2. Installation

3. Quick Start

4. Model Zoo

4.1 nuScenes

4.2 KITTI

Second on KITTI(val) Dataset

PointPillars on KITTI(val) Dataset

4.3 Lyft

4.4 Waymo

5. Functionality

6. TODO List

7. Call for contribution.

8. Developers

9. License

10. Citation

11. Acknowledgement

Owner

Benjin Zhu

Tensorflow 2 implementations of the C-SimCLR and C-BYOL self-supervised visual representation methods from "Compressive Visual Representations" (NeurIPS 2021)

Code for the head detector (HeadHunter) proposed in our CVPR 2021 paper Tracking Pedestrian Heads in Dense Crowd.

A Streamlit demo demonstrating the Deep Dream technique. Adapted from the TensorFlow Deep Dream tutorial.

Multi-scale discriminator feature-wise loss function

TimeSHAP explains Recurrent Neural Network predictions.

Pytorch implementation of Learning Rate Dropout.

Pytorch Code for "Medical Transformer: Gated Axial-Attention for Medical Image Segmentation"

Implementation of "Learning to Match Features with Seeded Graph Matching Network" ICCV2021

INSPIRED: A Transparent Dialogue Dataset for Interactive Semantic Parsing

Training, generation, and analysis code for Learning Particle Physics by Example: Location-Aware Generative Adversarial Networks for Physics

A PyTorch library for Vision Transformers

TrackTech: Real-time tracking of subjects and objects on multiple cameras

A JAX-based research framework for writing differentiable numerical simulators with arbitrary discretizations

Code of paper Interact, Embed, and EnlargE (IEEE): Boosting Modality-specific Representations for Multi-Modal Person Re-identification.

Repository for code and dataset for our EMNLP 2021 paper - “So You Think You’re Funny?”: Rating the Humour Quotient in Standup Comedy.

An image classification app boilerplate to serve your deep learning models asap!

An SMPC companion library for Syft

PERIN is Permutation-Invariant Semantic Parser developed for MRP 2020

This is a demo app to be used in the video streaming applications

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)