Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Last update: Dec 29, 2022

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation

Pano3D is a new benchmark for depth estimation from spherical panoramas. We generate a dataset (using GibsonV2) and provide baselines for holistic performance assessment, offering:

Primary and secondary traits metrics:
- Direct depth performance:
  - (w)RMSE
  - (w)RMSLE
  - AbsRel
  - SqRel
  - (w)Relative accuracy (\delta) @ {1.05, 1.1, 1.25, 1.25², 1.25³ }
- Boundary discontinuity preservation:
  - Precision @ {0.25, 0.5, 1.0}m
  - Recall @ {0.25, 0.5, 1.0}m
  - Depth boundary errors of accuracy and completeness
- Surface smoothness:
  - RMSE^o
  - Relative accuracy (\alpha) @ {11.25^o, 22.5^o, 30^o}
Out-of-distribution & Zero-shot cross dataset transfer:
- Different depth distribution test set
- Varying scene context test set
- Shifted camera domain test set

By disentangling generalization and assessing all depth properties, Pano3D aspires to drive progress benchmarking for 360^o depth estimation.

Using Pano3D to search for a solid baseline results in an acknowledgement of exploiting complementary error terms, adding encoder-decoder skip connections and using photometric augmentations.

TODO

Demo

A publicly hosted demo of the baseline models can be found here. Using the web app, it is possible to upload a panorama and download a 3D reconstructed mesh of the scene using the derived depth map.

Note that due to the external host's caching issues, it might be necessary to refresh your browser's cache in between runs to update the 3D models.

Data

Download

To download the data, follow the instructions at vcl3d.github.io/Pano3D/download/.

Please note that getting access to the data download links is a two step process as the dataset is a derivative and compliance with the original dataset's terms and usage agreements is required. Therefore:

You first need to fill in this Google Form.
And, then, you need to perform an access request at each one of the Zenodo repositories (depending on which dataset partition you need):

After both these steps are completed, you will soon receive the download links for each dataset partition.

Code and models for "Pano3D: A Holistic Benchmark and a Solid Baseline for 360 Depth Estimation", OmniCV Workshop @ CVPR21.

Related tags

Overview

Pano3D

A Holistic Benchmark and a Solid Baseline for 360o Depth Estimation

TODO

Demo

Data

Download

Loader

Splits

Models

Download

Inference

Serve

Metrics

Direct

Boundary

Smoothness

Results

Owner

Visual Computing Lab, Information Technologies Institute, Centre for Reseach and Technology Hellas

Synthesize photos from PhotoDNA using machine learning 🌱

Baseline of DCASE 2020 task 4

CFNet: Cascade and Fused Cost Volume for Robust Stereo Matching（CVPR2021）

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

Detecting Blurred Ground-based Sky/Cloud Images

CoMoGAN: continuous model-guided image-to-image translation. CVPR 2021 oral.

Cache Requests in Deta Bases and Echo them with Deta Micros

Magisk module to enable hidden features on Android 12 Developer Preview 1.

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

OpenMMLab Image Classification Toolbox and Benchmark

CVPR2021: Temporal Context Aggregation Network for Temporal Action Proposal Refinement

Sample code from the Neural Networks from Scratch book.

Contrastive Language-Image Pretraining

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

Multi Agent Path Finding Algorithms

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Few-shot Learning of GPT-3

A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis

Lightweight, Python library for fast and reproducible experimentation :microscope:

Learning Time-Critical Responses for Interactive Character Control

A Holistic Benchmark and a Solid Baseline for 360^o Depth Estimation