Object DGCNN & DETR3D

This repo contains the implementations of Object DGCNN (https://arxiv.org/abs/2110.06923) and DETR3D (https://arxiv.org/abs/2110.06922). Our implementations are built on top of MMdetection3D.

Prerequisite

mmcv (https://github.com/open-mmlab/mmcv)
mmdet (https://github.com/open-mmlab/mmdetection)
mmseg (https://github.com/open-mmlab/mmsegmentation)
mmdet3d (https://github.com/open-mmlab/mmdetection3d)

Data

Follow the mmdet3d to process the data.

Train

Downloads the pretrained backbone weights to pretrained/
For example, to train Object-DGCNN with pillar on 8 GPUs, please use

tools/dist_train.sh projects/configs/obj_dgcnn/pillar.py 8

Evaluation using pretrained models

Download the weights accordingly.

Backbone	mAP	NDS	Download
DETR3D, ResNet101 w/ DCN	34.7	42.2	model \| log
above, + CBGS	34.9	43.4	model \| log
DETR3D, VoVNet on trainval, evaluation on test set	41.2	47.9	model \| log

Backbone	mAP	NDS	Download
Object DGCNN, pillar	53.2	62.8	model \| log
Object DGCNN, voxel	58.6	66.0	model \| log

To test, use
tools/dist_test.sh projects/configs/obj_dgcnn/pillar_cosine.py /path/to/ckpt 8 --eval=bbox

If you find this repo useful for your research, please consider citing the papers

@inproceedings{
   obj-dgcnn,
   title={Object DGCNN: 3D Object Detection using Dynamic Graphs},
   author={Wang, Yue and Solomon, Justin M.},
   booktitle={2021 Conference on Neural Information Processing Systems ({NeurIPS})},
   year={2021}
}

@inproceedings{
   detr3d,
   title={DETR3D: 3D Object Detection from Multi-view Images via 3D-to-2D Queries},
   author={Wang, Yue and Guizilini, Vitor and Zhang, Tianyuan and Wang, Yilun and Zhao, Hang and and Solomon, Justin M.},
   booktitle={The Conference on Robot Learning ({CoRL})},
   year={2021}
}

Object DGCNN and DETR3D, Our implementations are built on top of MMdetection3D.

Related tags

Overview

Object DGCNN & DETR3D

Prerequisite

Data

Train

Evaluation using pretrained models

Owner

Wang, Yue

Using NumPy to solve the equations of fluid mechanics together with Finite Differences, explicit time stepping and Chorin's Projection methods

StarGAN - Official PyTorch Implementation (CVPR 2018)

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

NAACL2021 - COIL Contextualized Lexical Retriever

Moment-DETR code and QVHighlights dataset

PyTorch reimplementation of hand-biomechanical-constraints (ECCV2020)

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)

This is the repository for our paper Ditch the Gold Standard: Re-evaluating Conversational Question Answering

Official PyTorch implementation of MAAD: A Model and Dataset for Attended Awareness

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

performing moving objects segmentation using image processing techniques with opencv and numpy

Scalable, event-driven, deep-learning-friendly backtesting library

Breaking the Dilemma of Medical Image-to-image Translation

Image Matching Evaluation

[CVPR'22] Weakly Supervised Semantic Segmentation by Pixel-to-Prototype Contrast

Project page of the paper 'Analyzing Perception-Distortion Tradeoff using Enhanced Perceptual Super-resolution Network' (ECCVW 2018)

Official repository for "Orthogonal Projection Loss" (ICCV'21)

Step by Step on how to create an vision recognition model using LOBE.ai, export the model and run the model in an Azure Function