Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Last update: Nov 15, 2022

Related tags

Deep Learning pytorch

Overview

Photon-Starved Scene Inference using Single Photon Cameras

ICCV 2021
Arxiv Project Video

Bhavya Goyal, Mohit Gupta

University of Wisconsin-Madison

Abstract

Scene understanding under low-light conditions is a challenging problem. This is due to the small number of photons captured by the camera and the resulting low signal-to-noise ratio (SNR). Single-photon cameras (SPCs) are an emerging sensing modality that are capable of cap-turing images with high sensitivity. Despite having minimal read-noise, images captured by SPCs in photon-starved conditions still suffer from strong shot noise, preventing reliable scene inference. We propose photon scale-space, a collection of high-SNR images spanning a wide range of photons-per-pixel (PPP) levels (but same scene content) as guides to train inference model on low photon flux images. We develop training techniques that push images with different illumination levels closer to each other in feature representation space. The key idea is that having a spectrum of different brightness levels during training enables effective guidance, and increases robustness to shot noise even in extreme noise cases. Based on the proposed approach, we demonstrate, via simulations and real experiments with a SPAD camera, high-performance on various inference tasks such as image classification and monocular depth estimation under ultra low-light, down to < 1 PPP.

Code Structure

.
├── classification          # Code for image classification using Photon Net training
├── monodepth               # Code for monocular depth estimation using Photon Net training
├── simulation              # Scripts for simulating noisy SPAD images
├── figures                 # figures used for results
└── README.md

Requirements/Installation

Install PyTorch (pytorch.org)
pip install -r requirements.txt

How to Use

Download the datasets (CUB/CARS/NYUV2/others) from the official sources and use scripts in simulation to simulate noisy images from SPAD
Use classification and monodepth code for image classifiation and monocular depth estimation using Photon Net

Citation

@InProceedings{Goyal_2021_ICCV,
    author    = {Goyal, Bhavya and Gupta, Mohit},
    title     = {Photon-Starved Scene Inference Using Single Photon Cameras},
    booktitle = {Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)},
    month     = {October},
    year      = {2021},
    pages     = {2512-2521}
}

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Related tags

Overview

Photon-Starved Scene Inference using Single Photon Cameras

Bhavya Goyal, Mohit Gupta

Abstract

Code Structure

Requirements/Installation

How to Use

Citation

Owner

Bhavya Goyal

Contrastively Disentangled Sequential Variational Audoencoder

ManipNet: Neural Manipulation Synthesis with a Hand-Object Spatial Representation - SIGGRAPH 2021

4D Human Body Capture from Egocentric Video via 3D Scene Grounding

Supporting code for the Neograd algorithm

Cognition-aware Cognate Detection

Building blocks for uncertainty-aware cycle consistency presented at NeurIPS'21.

Pseudo lidar - (CVPR 2019) Pseudo-LiDAR from Visual Depth Estimation: Bridging the Gap in 3D Object Detection for Autonomous Driving

Official PyTorch implementation of Synergies Between Affordance and Geometry: 6-DoF Grasp Detection via Implicit Representations

U-Net: Convolutional Networks for Biomedical Image Segmentation

RuleBERT: Teaching Soft Rules to Pre-Trained Language Models

Official implementation of the Neurips 2021 paper Searching Parameterized AP Loss for Object Detection.

Trading Gym is an open source project for the development of reinforcement learning algorithms in the context of trading.

Self-Supervised Methods for Noise-Removal

This code finds bounding box of a single human mouth.

Lowest memory consumption and second shortest runtime in NTIRE 2022 challenge on Efficient Super-Resolution

CT Based COVID 19 Diagnose by Image Processing and Deep Learning

A small library for doing fluid simulation with neural networks.

Reimplementation of Learning Mesh-based Simulation With Graph Networks

DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition, TPAMI 2021

Kalman Filter book using Jupyter Notebook. Focuses on building intuition and experience, not formal proofs. Includes Kalman filters,extended Kalman filters, unscented Kalman filters, particle filters, and more. All exercises include solutions.