Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

Last update: Jan 04, 2023

Related tags

Overview

Delving into Localization Errors for Monocular 3D Detection

By Xinzhu Ma, Yinmin Zhang, Dan Xu, Dongzhan Zhou, Shuai Yi, Haojie Li, Wanli Ouyang.

Introduction

This repository is an official implementation of the paper 'Delving into Localization Errors for Monocular 3D Detection'. In this work, by intensive diagnosis experiments, we quantify the impact introduced by each sub-task and found the ‘localization error’ is the vital factor in restricting monocular 3D detection. Besides, we also investigate the underlying reasons behind localization errors, analyze the issues they might bring, and propose three strategies.

Usage

Installation

This repo is tested on our local environment (python=3.6, cuda=9.0, pytorch=1.1), and we recommend you to use anaconda to create a vitural environment:

conda create -n monodle python=3.6

Then, activate the environment:

conda activate monodle

Install Install PyTorch:

conda install pytorch==1.1.0 torchvision==0.3.0 cudatoolkit=9.0 -c pytorch

and other requirements:

pip install -r requirements.txt

Data Preparation

Please download KITTI dataset and organize the data as follows:

#ROOT
  |data/
    |KITTI/
      |ImageSets/ [already provided in this repo]
      |object/			
        |training/
          |calib/
          |image_2/
          |label/
        |testing/
          |calib/
          |image_2/

Training & Evaluation

Move to the workplace and train the network:

 cd #ROOT
 cd experiments/example
 python ../../tools/train_val.py --config config_patchnet.yaml

The model will be evaluated automatically if the training completed. If you only want evaluate your trained model (or the provided pretrained model) , you can modify the test part configuration in the .yaml file and use the following command:

python ../../tools/train_val.py --config config_patchnet.yaml --e

For ease of use, we also provide a pre-trained checkpoint, which can be used for evaluation directly. See the below table to check the performance.

	[email protected]	[email protected].	[email protected]
In original paper	17.45	13.66	11.68
In this repo	17.94	13.72	12.10

Citation

If you find our work useful in your research, please consider citing:

@InProceedings{Ma_2021_CVPR,
author = {Ma, Xinzhu and Zhang, Yinmin, and Xu, Dan and Zhou, Dongzhan and Yi, Shuai and Li, Haojie and Ouyang, Wanli},
title = {Delving into Localization Errors for Monocular 3D Object Detection},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month = {June},
year = {2021}}

Acknowlegment

This repo benefits from the excellent work CenterNet. Please also consider citing it.

License

This project is released under the MIT License.

Contact

If you have any question about this project, please feel free to contact [email protected].

Delving into Localization Errors for Monocular 3D Object Detection, CVPR'2021

Related tags

Overview

Delving into Localization Errors for Monocular 3D Detection

Introduction

Usage

Installation

Data Preparation

Training & Evaluation

Citation

Acknowlegment

License

Contact

Owner

XINZHU.MA

Codebase for Attentive Neural Hawkes Process (A-NHP) and Attentive Neural Datalog Through Time (A-NDTT)

Fibonacci Method Gradient Descent

Code repository for our paper regarding the L3D dataset.

[NeurIPS 2021]: Are Transformers More Robust Than CNNs? (Pytorch implementation & checkpoints)

The Easy-to-use Dialogue Response Selection Toolkit for Researchers

Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

Safe Control for Black-box Dynamical Systems via Neural Barrier Certificates

NAACL2021 - COIL Contextualized Lexical Retriever

Portfolio Optimization and Quantitative Strategic Asset Allocation in Python

GAN encoders in PyTorch that could match PGGAN, StyleGAN v1/v2, and BigGAN. Code also integrates the implementation of these GANs.

Plaything for Autistic Children (demo for PaddlePaddle/Wechaty/Mixlab project)

A PyTorch implementation of "SimGNN: A Neural Network Approach to Fast Graph Similarity Computation" (WSDM 2019).

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Fight Recognition from Still Images in the Wild @ WACVW2022, Real-world Surveillance Workshop

Pytorch implementation of Supporting Clustering with Contrastive Learning, NAACL 2021

CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation

Cycle Consistent Adversarial Domain Adaptation (CyCADA)

Weakly Supervised Segmentation by Tensorflow.