Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Last update: Dec 19, 2022

Related tags

Deep Learning BtcDet

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

We implement our model, BtcDet, based on [OpenPcdet 0.3.0].

Installation

Requirements

All the codes are tested in the following environment:

Linux (tested on Ubuntu 14.04/16.04)
Python 3.6+
PyTorch 1.1 or higher (tested on PyTorch 1.7, 1.8.1, 1.9, 1.10)
CUDA 9.0 or higher (PyTorch 1.3+ needs CUDA 9.2+, test on CUDA 10.2)
spconv v1.2.1 (commit fad3000249d27ca918f2655ff73c41f39b0f3127)

Install

b. Install the dependent libraries as follows:

Install the dependent python libraries:

pip install -r requirements.txt

Install the SparseConv library, we use the implementation from [spconv].
- If you use PyTorch 1.1, then make sure you install the spconv v1.0 with (commit 8da6f96) instead of the latest one.
- If you use PyTorch 1.3+, then you need to install the spconv v1.2. As mentioned by the author of spconv
```
git clone -b v1.2.1  https://github.com/traveller59/spconv.git --recursive

cd spconv

sudo apt-get install libboost-all-dev

python setup.py bdist_wheel

cd ./dist 
```
then use pip to install generated whl file.
```
pip install spconv-1.2.1-{your system info}.whl
```
After that, you should first get out of the spconv directory, then do python import spconv to see if you installed it correctly.

c. Install this btcdet library by running the following command:

cd btcdet
python setup.py develop

Preparation

KITTI Dataset

Please download the official KITTI 3D object detection dataset and organize the downloaded files as follows (the road planes could be downloaded from [road plane], which are optional for data augmentation in the training):

BtcDet
├── data
│   ├── kitti
    │   │   │──detection3d  │── ImageSets
                    │   │   │── training
                    │   │   │   ├──calib & velodyne & label_2 & image_2 & (optional: planes) & (optional: depth_2)
                    │   │   │── testing
                    │   │   │   ├──calib & velodyne & image_2

Generate the data infos by running the following command:

python -m btcdet.datasets.kitti.kitti_dataset create_kitti_infos tools/cfgs/dataset_configs/kitti_dataset.yaml

Generate Approximated complete object points:

(at btcdet directory, execute:)

python -m btcdet.datasets.multifindbestfit

Alternatively,

you can use our generated kitti's data including the generated complete object points, download it [here (about 31GBs)] and put the zip file inside data/kitti/ and unzip it as detection3d directory.

Run training:

cd tools/

Single gpu training

mkdir output

mkdir output/kitti_car

python train.py --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/ --batch_size 2

Multi gpu training

bash scripts/dist_train.sh 4  --batch_size 8 --gpu_str "0,1,2,3" --cfg_file ./cfgs/model_configs/btcdet_kitti_car.yaml --output_dir ../output/kitti_car/

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Related tags

Overview

Behind the Curtain: Learning Occluded Shapes for 3D Object Detection

Acknowledgement

Installation

Requirements

Install

Preparation

KITTI Dataset

Generate Approximated complete object points:

Alternatively,

Run training:

Owner

Qiangeng Xu

Official Implementation for the paper DeepFace-EMD: Re-ranking Using Patch-wise Earth Mover’s Distance Improves Out-Of-Distribution Face Identification

Fast Axiomatic Attribution for Neural Networks (NeurIPS*2021)

Leveraging OpenAI's Codex to solve cornerstone problems in Music

subpixel: A subpixel convnet for super resolution with Tensorflow

Repo for the Tutorials of Day1-Day3 of the Nordic Probabilistic AI School 2021 (https://probabilistic.ai/)

Example repository for custom C++/CUDA operators for TorchScript

Code repository for "Stable View Synthesis".

Official pytorch implementation of "Feature Stylization and Domain-aware Contrastive Loss for Domain Generalization" ACMMM 2021 (Oral)

Does Oversizing Improve Prosumer Profitability in a Flexibility Market? - A Sensitivity Analysis using PV-battery System

A Fast Knowledge Distillation Framework for Visual Recognition

tf2onnx - Convert TensorFlow, Keras and Tflite models to ONNX.

A clean and extensible PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners

Fashion Landmark Estimation with HRNet

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters

Fake News Detection Using Machine Learning Methods

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

Large scale and asynchronous Hyperparameter Optimization at your fingertip.

PyTorch code for our paper "Image Super-Resolution with Non-Local Sparse Attention" (CVPR2021).

PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)