Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Last update: Aug 12, 2022

Related tags

Deep Learning InterpretableMDE

Overview

InterpretableMDE

A PyTorch implementation for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

arXiv link: https://arxiv.org/abs/2108.05312

Data and Model

For MFF models, we use the dataset they released here, and you can download their models as the baselines here. For BTS models, they use a different set of NYUv2 training images (24,231 instead of 50,688), and you download it here. We put all of our models here.

Evaluation

In this project we use yacs to manage the configurations. To evaluate the performance of a model, for example, the MFF model with SENet backbone using our assigning method, simply run

python eval.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn]

from the root directory.

To evaluate the depth selectivity, run

python dissect.py MODEL_WEIGHTS_FILE [PATH_TO_MODEL/mff_senet_asn] LAYERS D_MFF ON_TRAINING_DATA True

then get the depth selectivity and the dissection result of each unit. Layers' names are seperated by _.

Training

To train a model from scratch, run

python train.py MODEL_NAME MFF_resnet

We currently provide four options for MODEL_NAME, and the training scheme will automatically be switched to align with the original ones when using BTS models.

Acknowledgement

The model part of our code is adapted from Revisiting_Single_Depth_Estimation and bts. Some snippets are adapted from monodepth2.

Bibtex

@inproceedings{you2021iccv,
 title = {Towards Interpretable Deep Networks for Monocular Depth Estimation},
 author = {Zunzhi You and Yi-Hsuan Tsai and Wei-Chen Chiu and Guanbin Li},
 booktitle = {International Conference on Computer Vision (ICCV)},
 year = {2021}
}

Code repo for "Towards Interpretable Deep Networks for Monocular Depth Estimation" paper.

Related tags

Overview

InterpretableMDE

Data and Model

Evaluation

Training

Acknowledgement

Bibtex

Owner

Zunzhi You

Feup-csr - Repository holding my group's submission to the CSR project competition

Implementation of Barlow Twins paper

This is the source code for our ICLR2021 paper: Adaptive Universal Generalized PageRank Graph Neural Network.

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

Pytorch Implementation of Auto-Compressing Subset Pruning for Semantic Image Segmentation

Implementation for paper "Towards the Generalization of Contrastive Self-Supervised Learning"

Learning to Map Large-scale Sparse Graphs on Memristive Crossbar

"SOLQ: Segmenting Objects by Learning Queries", SOLQ is an end-to-end instance segmentation framework with Transformer.

Curriculum Domain Adaptation for Semantic Segmentation of Urban Scenes, ICCV 2017

This repository gives an example on how to preprocess the data of the HECKTOR challenge

PyTorch implementation of the method described in the paper VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop.

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

Implementation of the ALPHAMEPOL algorithm, presented in Unsupervised Reinforcement Learning in Multiple Environments.

KITTI-360 Annotation Tool is a framework that developed based on python(cherrypy + jinja2 + sqlite3) as the server end and javascript + WebGL as the front end.

Decorator for PyMC3

https://sites.google.com/cornell.edu/recsys2021tutorial

Python Interview Questions

Ankou: Guiding Grey-box Fuzzing towards Combinatorial Difference

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Predicting the duration of arrival delays for commercial flights.