OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Last update: Dec 15, 2022

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

This repository contains the code for the CVPR 2022 paper OcclusionFusion, where we introduce a novel method to calculate occlusion-aware 3D motion to guide dynamic 3D reconstruction.

In our technique, the motion of visible regions is first estimated and combined with temporal information to infer the motion of the occluded regions through an LSTM-involved graph neural network.

Currently, we provide a pretrained model and a demo. Code for data pre-processing, network training and evaluation will be available soon.

Setup

We use python 3.8.10, pytorch-1.8.0 and pytorch-geometric-1.7.2.

conda create -n occlusionfu python==3.8.10
conda activate occlusionfu
pip install -r requirements.txt
conda install pytorch==1.8.0 torchvision==0.9.0 torchaudio==0.8.0 cudatoolkit=10.2 -c pytorch
pip install torch-scatter==2.0.8 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-sparse==0.6.12 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-cluster==1.5.9 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-spline-conv==1.2.1 -f https://pytorch-geometric.com/whl/torch-1.8.0+cu102.html
pip install torch-geometric==1.7.2

Running the demo

Run the demo with the pretrained model and prepared inputs:

python demo.py

Visualize the input and output:

python visualize.py

The defualt setting of visualize.py will render the network's input and output to a video as follow. You can also change the setting to view the network's input and output with Open3D viewer.

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{lin2022occlusionfusion,
    title={OcclusionFusion: Occlusion-aware Motion Estimation for Real-time Dynamic 3D Reconstruction}, 
    author={Wenbin Lin, Chengwei Zheng, Jun-Hai Yong, Feng Xu}, 
    journal={Conference on Computer Vision and Pattern Recognition (CVPR)}, 
    year={2022}
}

OcclusionFusion: realtime dynamic 3D reconstruction based on single-view RGB-D

Related tags

Overview

OcclusionFusion (CVPR'2022)

Project Page | Paper | Video

Overview

Setup

Running the demo

Citation

Owner

Wenbin Lin

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

Official PyTorch implementation for "Low Precision Decentralized Distributed Training with Heterogenous Data"

[arXiv'22] Panoptic NeRF: 3D-to-2D Label Transfer for Panoptic Urban Scene Segmentation

ROS support for Velodyne 3D LIDARs

Tidy interface to polars

Source code for CVPR2022 paper "Abandoning the Bayer-Filter to See in the Dark"

Code for "FGR: Frustum-Aware Geometric Reasoning for Weakly Supervised 3D Vehicle Detection", ICRA 2021

Accelerated NLP pipelines for fast inference on CPU and GPU. Built with Transformers, Optimum and ONNX Runtime.

AnimationKit: AI Upscaling & Interpolation using Real-ESRGAN+RIFE

Source code for CVPR 2020 paper "Learning to Forget for Meta-Learning"

HDMapNet: A Local Semantic Map Learning and Evaluation Framework

Public Code for NIPS submission SimiGrad: Fine-Grained Adaptive Batching for Large ScaleTraining using Gradient Similarity Measurement

Official implementation of the paper: "LDNet: Unified Listener Dependent Modeling in MOS Prediction for Synthetic Speech"

Object-Centric Learning with Slot Attention

The repo for reproducing Seed-driven Document Ranking for Systematic Reviews: A Reproducibility Study

PyTorch implementation of "Learn to Dance with AIST++: Music Conditioned 3D Dance Generation."

Classify bird species based on their songs using SIamese Networks and 1D dilated convolutions.

Official code for "EagerMOT: 3D Multi-Object Tracking via Sensor Fusion" [ICRA 2021]

GNN-based Recommendation Benchmark

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow