Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

This repository is an implementation of the method described in the following paper:

Shumian Xin, Neal Wadhwa, Tianfan Xue, Jonathan T. Barron, Pratul P. Srinivasan, Jiawen Chen, Ioannis Gkioulekas, and Rahul Garg. "Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image", ICCV 2021.

If you use our code or dataset, please cite our paper:

@article{Xin_2021_ICCV_dual_pixel,
    author    = {Xin, Shumian and Wadhwa, Neal and Xue, Tianfan and Barron, Jonathan T. and Srinivasan, Pratul P. and Chen, Jiawen and Gkioulekas, Ioannis and Garg, Rahul},
    title     = {Defocus Map Estimation and Deblurring From a Single Dual-Pixel Image},
    journal   = {IEEE/CVF International Conference on Computer Vision (ICCV)},
    year      = {2021}
}

Dataset

We captured a new dataset of 17 indoor and outdoor scenes using a Google Pixel 4 smartphone camera. Data can be found in ./DP_data_pixel_4. Google Pixel 4 camera provides dual-pixel (DP) images in the green channel. These DP images are 14-bit, with a black level of 1024. Please refer to this GitHub Repo for more details about Google Pixel's DP data.

We also provide calibrated blur kernels and vignetting patterns of our device in ./DP_data_pixel_4/calibration.

Code

Code implementation is in ./code. It is written in Python, with autograd package Jax. Note: When installing Jax, make sure to install with GPU support.

To reproduce results in the paper (in the paper, results are postprocessed for better visualization), run:

cd ./code; python ./run.py

Each optimization runs for 10,000 iterations with an Adam optimizer, and takes about 2 hours on an Nvidia Titan RTX GPU.

Code has been tested with:

Python 3.7.8
Jax 0.2.19
OpenCV 4.4.0

Note: To run the code on your own Google Pixel 4 data, please adjust the preprocessing step in ./code/util.py/load_data_and_calibration if needed, such that the input dual pixel images are normalized to the range of [0, 1].

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

Related tags

Overview

Defocus Map Estimation and Deblurring from a Single Dual-Pixel Image

Dataset

Code

Owner

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

Video Representation Learning by Recognizing Temporal Transformations. In ECCV, 2020.

Self-attentive task GAN for space domain awareness data augmentation.

Implementation for our ICCV 2021 paper: Dual-Camera Super-Resolution with Aligned Attention Modules

Finite-temperature variational Monte Carlo calculation of uniform electron gas using neural canonical transformation.

PyTorch implementation of EfficientNetV2

PyTorch implementation of the NIPS-17 paper "Poincaré Embeddings for Learning Hierarchical Representations"

Display, filter and search log messages in your terminal

This repository contains code and data for "On the Multimodal Person Verification Using Audio-Visual-Thermal Data"

TCTrack: Temporal Contexts for Aerial Tracking (CVPR2022)

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region

Differential rendering based motion capture blender project.

Code for the paper "Reinforcement Learning as One Big Sequence Modeling Problem"

Code for One-shot Talking Face Generation from Single-speaker Audio-Visual Correlation Learning (AAAI 2022)

PyTorch implementation for "Mining Latent Structures with Contrastive Modality Fusion for Multimedia Recommendation"

Mesh TensorFlow: Model Parallelism Made Easier

A simple and lightweight genetic algorithm for optimization of any machine learning model

IJCAI2020 & IJCV 2020 :city_sunrise: Unsupervised Scene Adaptation with Memory Regularization in vivo

Permeability Prediction Via Multi Scale 3D CNN

Official Repository for the paper "Improving Baselines in the Wild".