The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Last update: Dec 15, 2022

Related tags

Deep Learning 2StageAlign

Overview

TwoStageAlign

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Paper | Supp

Abstract

Denoising and demosaicking are two essential steps to reconstruct a clean full-color image from the raw data. Recently, joint denoising and demosaicking (JDD) for burst images, namely JDD-B, has attracted much attention by using multiple raw images captured in a short time to reconstruct a single high-quality image. One key challenge of JDD-B lies in the robust alignment of image frames. State-of-the-art alignment methods in feature domain cannot effectively utilize the temporal information of burst images, where large shifts commonly exist due to camera and object motion. In addition, the higher resolution (e.g., 4K) of modern imaging devices results in larger displacement between frames. To address these challenges, we design a differentiable two-stage alignment scheme sequentially in patch and pixel level for effective JDD-B. The input burst images are firstly aligned in the patch level by using a differentiable progressive block matching method, which can estimate the offset between distant frames with small computational cost. Then we perform implicit pixel-wise alignment in full-resolution feature domain to refine the alignment results. The two stages are jointly trained in an end-to-end manner. Extensive experiments demonstrate the significant improvement of our method over existing JDD-B methods.

Framework

Test

Pretrain models

REDS4

we only put an example of REDS4 in dataset folder, please download the full testset in official website, RED.
More detail can refer to data preparation

python /codes/test_Vid4_REDS4_joint_2stage_REDS4.py

Videezy

To evaluate the performance on 4K burst images/video, we collect several clips from website.
Dataset: Google Drive

python /codes/test_Vid4_REDS4_joint_2stage_Videezy4K.py

SC_burst (Smartphone burst) Dataset

Please refer to GCP-Net.
Whole dataset: BaiduYun with password d8u8.

python /codes/test_Vid4_REDS4_joint_2stage_RealCaptured.py

Train

training data preparation: Please refer to the "Video Super-Resolution" part of data preparation. To create LMDB dataset, please run create_lmdb.py.
change training options in train_burst_JDD_2stage.yml

python -m torch.distributed.launch --nproc_per_node=2 --master_port=4540 train.py -opt options/train/train_GCP_Net.yml --launcher pytorch

Environment

Refer to the requirement.txt
We utilize pytorch 1.2 and the deformable version does not support pytorch > 1.3. Thus when you use newest pytorch, please replace deformable version to newest (refer to BasicSR).

Citation

@article{guo2022differentiable,
  title={A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift},
  author={Guo, Shi and Yang, Xi and Ma, Jianqi and Ren, Gaofeng and Zhang, Lei},
  booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition},
  year={2022}
}

Acknowledgement

This repo is built upon the framework of EDVR, and we borrow some code from Unprocessing denoising, thanks for their excellent work!

The official codes of our CVPR2022 paper: A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Related tags

Overview

TwoStageAlign

Abstract

Framework

Test

REDS4

Videezy

SC_burst (Smartphone burst) Dataset

Train

Environment

Citation

Acknowledgement

Owner

Shi Guo

Gluon CV Toolkit

This repository is maintained for the scientific paper tittled " Study of keyword extraction techniques for Electric Double Layer Capacitor domain using text similarity indexes: An experimental analysis "

[CVPR'21 Oral] Seeing Out of tHe bOx: End-to-End Pre-training for Vision-Language Representation Learning

Aerial Single-View Depth Completion with Image-Guided Uncertainty Estimation (RA-L/ICRA 2020)

Official implementation of the paper 'Efficient and Degradation-Adaptive Network for Real-World Image Super-Resolution'

Source for the paper "Universal Activation Function for machine learning"

A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.

PyTorch code for the paper "Complementarity is the King: Multi-modal and Multi-grained Hierarchical Semantic Enhancement Network for Cross-modal Retrieval".

This is a Keras implementation of a CNN for estimating age, gender and mask from a camera.

Dictionary Learning with Uniform Sparse Representations for Anomaly Detection

Official repository for HOTR: End-to-End Human-Object Interaction Detection with Transformers (CVPR'21, Oral Presentation)

Based on Yolo's low-power, ultra-lightweight universal target detection algorithm, the parameter is only 250k, and the speed of the smart phone mobile terminal can reach ~300fps+

Google-drive-to-sqlite - Create a SQLite database containing metadata from Google Drive

PyTorch implementation for the paper Pseudo Numerical Methods for Diffusion Models on Manifolds

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

A copy of Ares that costs 30 fucking dollars.

LaneDetectionAndLaneKeeping - Lane Detection And Lane Keeping

Official implementation for the paper "Attentive Prototypes for Source-free Unsupervised Domain Adaptive 3D Object Detection"

TANL: Structured Prediction as Translation between Augmented Natural Languages

Cross-Image Region Mining with Region Prototypical Network for Weakly Supervised Segmentation