SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

Last update: Dec 13, 2022

Overview

SimpleDepthEstimation

Introduction

This is an unified codebase for NN-based monocular depth estimation methods, the framework is based on detectron2 (with a lot of modifications) and supports both supervised and self-supervised monocular depth estimation methods. The main goal for developing this repository is to help understand popular depth estimation papers, I tried my best to keep the code simple.

Environment:

clone this repo

SDE_ROOT=/path/to/SimpleDepthEstimation
git clone https://github.com/zzzxxxttt/SimpleDepthEstimation $SDE_ROOT
cd $SDE_ROOT

create a new conda environment and activate it

conda create -n sde python=3.6 
conda activate sde

install torch==1.8.0 and torchvision==0.9.0 follow the official instructions. (I haven't tried other pytorch versions)
install other requirements
```
pip install -r requirements.txt
```

Data preparation

KITTI:

Download and extract KITTI raw dataset, refined KITTI depth groundtruth, and eigen split files, then modify the data path in the config file.

Training

python path/to/project/train.py --num-gpus 2 --cfg path/to/config RUN_NAME run_name

Evaluation

python path/to/project/train.py --num-gpus 2 --cfg path/to/config --eval MODEL.WEIGHTS /path/to/checkpoint_file

Results:

KITTI:

model	type	config	abs rel err	sq rel err	rms	log rms	d1	d2	d3
ResNet-18	supervised	link	0.076	0.306	3.066	0.116	0.936	0.990	0.998
BTSNet (ResNet-50)	supervised	link	0.062	0.259	2.859	0.100	0.950	0.992	0.998
MonoDepth2 (ResNet-18)	self-supervised	link	0.118	0.735	4.517	0.163	0.860	0.974	0.994

Demo:

python tools/demo.py --cfg path/to/config --input path/to/image --output path/to/output_dir MODEL.WEIGHTS /path/to/checkpoint_file

Demo results:

Todo

add PackNet (I have added it, performance need verification)
add Dynamic Motion Learning (I have implemented it but still buggy, help welcome!)
support more datasets

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

Related tags

Overview

SimpleDepthEstimation

Introduction

Environment:

Data preparation

KITTI:

Training

Evaluation

Results:

KITTI:

Demo:

Todo

Reference

Owner

Resources for the "Evaluating the Factual Consistency of Abstractive Text Summarization" paper

TCPNet - Temporal-attentive-Covariance-Pooling-Networks-for-Video-Recognition

Recreate CenternetV2 based on MMDET.

GRaNDPapA: Generator of Rad Names from Decent Paper Acronyms

Implementation of GGB color space

RAMA: Rapid algorithm for multicut problem

Continuous Conditional Random Field Convolution for Point Cloud Segmentation

K-Means Clustering and Hierarchical Clustering Unsupervised Learning Solution in Python3.

This is an official implementation of our CVPR 2021 paper "Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression" (https://arxiv.org/abs/2104.02300)

💃 VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic Phenomena

Local-Global Stratified Transformer for Efficient Video Recognition

Anomaly detection related books, papers, videos, and toolboxes

This repository is the official implementation of Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning (NeurIPS21).

An NLP library with Awesome pre-trained Transformer models and easy-to-use interface, supporting wide-range of NLP tasks from research to industrial applications.

Codes for NAACL 2021 Paper "Unsupervised Multi-hop Question Answering by Question Generation"

[ICCV 2021] Group-aware Contrastive Regression for Action Quality Assessment

3D AffordanceNet is a 3D point cloud benchmark consisting of 23k shapes from 23 semantic object categories, annotated with 56k affordance annotations and covering 18 visual affordance categories.

MAME is a multi-purpose emulation framework.

This repository contains all source code, pre-trained models related to the paper "An Empirical Study on GANs with Margin Cosine Loss and Relativistic Discriminator"

Reinforcement Learning via Supervised Learning