the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Last update: Jul 27, 2022

Related tags

Deep Learning G2S

Overview

G2S

This is the official code for ICRA 2021 Paper: Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation by Hemang Chawla, Arnav Varma, Elahe Arani and Bahram Zonooz.

G2S (GPS-to-Scale) Loss is a dynamically-weighted loss that can be added to the appearance-based losses to train any monocular self-supervised depth estimation architecture to get scale-consistant and scale-aware depth estimates at inference.

Here, we provide helper GPS dataloader and the G2S loss classes for using this loss with any model.

For details, please see the Paper and Presentation.

KITTI GPS

The GPS files containing geodesic gps information of raw kitti dataset in local coordinates for training with the g2s loss can be found in the assets folder as kitti_gps_raw.zip.
Unzip the file at /path/to/KITTI/raw_data/sync to merge the GPS files in the expected directory tree structure.

Usage

You can use the G2S class in lossG2S.py within your project for scale-consistent and -aware predictions. This requires using the copresent GPS modality along with images. To load the GPS, please adopt the GPSDataloader class within dataloaderGPS.py into your images dataloader.

Cite Our Work

If you find the code useful in your research, please consider citing our paper:

@inproceedings{chawlavarma2021multimodal,
	author={H. {Chawla} and A. {Varma} and E. {Arani} and B. {Zonooz}},
	booktitle={2021 IEEE International Conference on Robotics and Automation (ICRA)},
	title={Multimodal Scale Consistency and Awareness for Monocular Self-Supervised
	Depth Estimation},
	location={Xi’an, China},
	publisher={IEEE (in press)},
	year={2021}
}

License

This project is licensed under the terms of the MIT license.

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

Related tags

Overview

G2S

KITTI GPS

Usage

Cite Our Work

License

Owner

NeurAI

Bachelor's Thesis in Computer Science: Privacy-Preserving Federated Learning Applied to Decentralized Data

Code, Data and Demo for Paper: Controllable Generation from Pre-trained Language Models via Inverse Prompting

GndNet: Fast ground plane estimation and point cloud segmentation for autonomous vehicles using deep neural networks.

Machine Learning Model deployment for Container (TensorFlow Serving)

[CVPR 2021] Official PyTorch Implementation for "Iterative Filter Adaptive Network for Single Image Defocus Deblurring"

Unofficial implement with paper SpeakerGAN: Speaker identification with conditional generative adversarial network

Distributed DataLoader For Pytorch Based On Ray

A library for preparing, training, and evaluating scalable deep learning hybrid recommender systems using PyTorch.

SAAVN - Sound Adversarial Audio-Visual Navigation,ICLR2022 (In PyTorch)

Born-Infeld (BI) for AI: Energy-Conserving Descent (ECD) for Optimization

Y. Zhang, Q. Yao, W. Dai, L. Chen. AutoSF: Searching Scoring Functions for Knowledge Graph Embedding. IEEE International Conference on Data Engineering (ICDE). 2020

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Existing Literature about Machine Unlearning

This repository contain code on Novelty-Driven Binary Particle Swarm Optimisation for Truss Optimisation Problems.

Group Fisher Pruning for Practical Network Compression(ICML2021)

ConvMAE: Masked Convolution Meets Masked Autoencoders

Official implementation for paper Render In-between: Motion Guided Video Synthesis for Action Interpolation

A PyTorch implementation of the Transformer model in "Attention is All You Need".

This program uses trial auth token of Azure Cognitive Services to do speech synthesis for you.

A gesture recognition system powered by OpenPose, k-nearest neighbours, and local outlier factor.