Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Yunsong Zhou, Yuan He, Hongzi Zhu, Cheng Wang, Hongyang Li, Qinhong Jiang

Our paper is now avaiable on CVPR 2021 open access.

Introduction

Our framework is implemented and tested with Ubuntu 16.04, CUDA 8.0/9.0, Python 3, Pytorch 0.4/1.0/1.1, NVIDIA Tesla V100/TITANX GPU.

If you find our work useful in your research please consider citing our paper:

@InProceedings{Zhou_2021_CVPR,
author    = {Zhou, Yunsong and He, Yuan and Zhu, Hongzi and Wang, Cheng and Li, Hongyang and Jiang, Qinhong},
title     = {Monocular 3D Object Detection: An Extrinsic Parameter Free Approach},
booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
month     = {June},
year      = {2021},
pages     = {7556-7566}
}

Requirements

Cuda & Cudnn & Python & Pytorch

This project is tested with CUDA 8.0/9.0, Python 3, Pytorch 0.4/1.0/1.1, NVIDIA Tesla V100/TITANX GPU. And almost all the packages we use are covered by Anaconda.

Please install proper CUDA and CUDNN version, and then install Anaconda3 and Pytorch.

Data preparation

Download and unzip the full KITTI detection dataset.

Training

I am currently busy with my own courses. I will sort out the work involved in the near future. Relevant code and models will be avaiable soon.

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Related tags

Overview

Monocular 3D Object Detection: An Extrinsic Parameter Free Approach (CVPR2021)

Introduction

Requirements

Data preparation

Training

Owner

Yunsong Zhou

DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight or group of weights, in order to achieve a given trade-off between model size and accuracy.

FG-transformer-TTS Fine-grained style control in transformer-based text-to-speech synthesis

Code of our paper "Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning"

ICRA 2021 "Towards Precise and Efficient Image Guided Depth Completion"

LSTM and QRNN Language Model Toolkit for PyTorch

Python Implementation of Chess Playing AI with variable difficulty

The Official Implementation of the ICCV-2021 Paper: Semantically Coherent Out-of-Distribution Detection.

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Pytorch implementation of the paper Improving Text-to-Image Synthesis Using Contrastive Learning

The final project of "Applying AI to 3D Medical Imaging Data" from "AI for Healthcare" nanodegree - Udacity.

2D&3D human pose estimation

Space Invaders For Python

Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch

ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

Neural Contours: Learning to Draw Lines from 3D Shapes (CVPR2020)

Scales, Chords, and Cadences: Practical Music Theory for MIR Researchers

Research - dataset and code for 2016 paper Learning a Driving Simulator

Normalizing Flows with a resampled base distribution

Code for "Multi-Compound Transformer for Accurate Biomedical Image Segmentation"

Diffgram - Supervised Learning Data Platform