Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Last update: Jan 03, 2023

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

This is a lightweight python script that fuses multiple registered color and depth images into a projective truncated signed distance function (TSDF) volume, which can then be used to create high quality 3D surface meshes and point clouds. Tested on Ubuntu 16.04.

An older CUDA/C++ version can be found here.

Requirements

Python 2.7+ with NumPy, PyCUDA, OpenCV, Scikit-image and Numba. These can be quickly installed/updated by running the following:
```
pip install --user numpy opencv-python scikit-image numba
```
[Optional] GPU acceleration requires an NVIDA GPU with CUDA and PyCUDA:
```
pip install --user pycuda
```

Demo

This demo fuses 1000 RGB-D images from the 7-scenes dataset into a 405 x 264 x 289 projective TSDF voxel volume with 2cm resolution at about 30 FPS in GPU mode (0.4 FPS in CPU mode), and outputs a 3D mesh mesh.ply which can be visualized with a 3D viewer like Meshlab.

Note: color images are saved as 24-bit PNG RGB, depth images are saved as 16-bit PNG in millimeters.

python demo.py

Seen In

References

Citing

This repository is a part of 3DMatch Toolbox. If you find this code useful in your work, please consider citing:

@inproceedings{zeng20163dmatch,
    title={3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions},
    author={Zeng, Andy and Song, Shuran and Nie{\ss}ner, Matthias and Fisher, Matthew and Xiao, Jianxiong and Funkhouser, Thomas},
    booktitle={CVPR},
    year={2017}
}

Python code to fuse multiple RGB-D images into a TSDF voxel volume.

Related tags

Overview

Volumetric TSDF Fusion of RGB-D Images in Python

Requirements

Demo

Seen In

References

Citing

Owner

Andy Zeng

PyKale is a PyTorch library for multimodal learning and transfer learning as well as deep learning and dimensionality reduction on graphs, images, texts, and videos

A Partition Filter Network for Joint Entity and Relation Extraction EMNLP 2021

An efficient toolkit for Face Stylization based on the paper "AgileGAN: Stylizing Portraits by Inversion-Consistent Transfer Learning"

Multi-Scale Progressive Fusion Network for Single Image Deraining

Code release for NeRF (Neural Radiance Fields)

CRF-RNN for Semantic Image Segmentation - PyTorch version

Unsupervised Domain Adaptation for Nighttime Aerial Tracking (CVPR2022)

Awesome-AI-books - Some awesome AI related books and pdfs for learning and downloading

Contrastive Learning Inverts the Data Generating Process

Optimizers-visualized - Visualization of different optimizers on local minimas and saddle points.

Malware Bypass Research using Reinforcement Learning

Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval.

Implementation of our paper "Video Playback Rate Perception for Self-supervised Spatio-Temporal Representation Learning".

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

The code for SAG-DTA: Prediction of Drug–Target Affinity Using Self-Attention Graph Network.

MAterial del programa Misión TIC 2022

A Multi-modal Model Chinese Spell Checker Released on ACL2021.

Temporal Knowledge Graph Reasoning Triggered by Memories

Code & Data for Enhancing Photorealism Enhancement

Compare GAN code.