MonoRCNN is a monocular 3D object detection method for automonous driving

Last update: Dec 27, 2022

Related tags

Overview

MonoRCNN

MonoRCNN is a monocular 3D object detection method for automonous driving, published at ICCV 2021. This project is an implementation of MonoRCNN.

Visualization

Methodology

Installation

Python 3.6
PyTorch 1.5.0
Detectron2 0.1.3

Please use the Detectron2 included in this project. To ignore fully occluded objects during training, build.py, rpn.py, and roi_heads.py have been modified.

Dataset Preparation

KITTI

Model & Log

KITTI val1 split

Organize the downloaded files as follows:

├── projects
│   ├── MonoRCNN
│   │   ├── output
│   │   │   ├── model
│   │   │   ├── log.txt
│   │   │   ├── ...

Test

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1 --resume --eval-only

Set VISUALIZE as True to visualize 3D object detection results (saved in output/evaluation/test/visualization).

Training

cd projects/MonoRCNN
./main.py --config-file config/MonoRCNN_KITTI.yaml --num-gpus 1

Citation

If you find this project useful in your research, please cite:

@inproceedings{MonoRCNN_ICCV21,
    title = {Geometry-based Distance Decomposition for Monocular 3D Object Detection},
    author = {Xuepeng Shi and Qi Ye and 
              Xiaozhi Chen and Chuangrong Chen and 
              Zhixiang Chen and Tae-Kyun Kim},
    booktitle = {ICCV},
    year = {2021},
}

Contact

[email protected]

MonoRCNN is a monocular 3D object detection method for automonous driving

Related tags

Overview

MonoRCNN

Visualization

Methodology

Related Link

Installation

Dataset Preparation

Model & Log

Test

Training

Citation

Contact

Acknowledgement

Owner

I-SECRET: Importance-guided fundus image enhancement via semi-supervised contrastive constraining

Tensorflow 2.x implementation of Panoramic BlitzNet for object detection and semantic segmentation on indoor panoramic images.

Official PyTorch implementation of "Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition" in AAAI2022.

gtfs2vec - Learning GTFS Embeddings for comparing PublicTransport Offer in Microregions

Fast Style Transfer in TensorFlow

VOLO: Vision Outlooker for Visual Recognition

Python code for loading the Aschaffenburg Pose Dataset.

DGL-TreeSearch and the Gurobi-MWIS interface

Code for the SIGGRAPH 2021 paper "Consistent Depth of Moving Objects in Video".

Awesome Human Pose Estimation

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Part-Aware Data Augmentation for 3D Object Detection in Point Cloud

A novel pipeline framework for multi-hop complex KGQA task. About the paper title: Improving Multi-hop Embedded Knowledge Graph Question Answering by Introducing Relational Chain Reasoning

A PyTorch Implementation of Single Shot Scale-invariant Face Detector.

PaddleBoBo是基于PaddlePaddle和PaddleSpeech、PaddleGAN等开发套件的虚拟主播快速生成项目

This is the official pytorch implementation of AutoDebias, an automatic debiasing method for recommendation.

Best Practices on Recommendation Systems

graph-theoretic framework for robust pairwise data association

pytorch implementation of trDesign

Immortal tracker