KinectFusion implemented in Python with PyTorch

Last update: Jan 03, 2023

Overview

KinectFusion implemented in Python with PyTorch

This is a lightweight Python implementation of KinectFusion. All the core functions (TSDF volume, frame-to-model tracking, point-to-plane ICP, raycasting, TSDF fusion, etc.) are implemented using pure PyTorch, i.e. no custom CUDA kernels.

Although without any custom CUDA functions, the system could still run at a fairly fast speed: The demo reconstructs the TUM fr1_desk sequence into a 225 x 171 x 111 TSDF volume with 2cm resolution at round 17 FPS with a single RTX-2080 GPU (~1.5 FPS in CPU mode)

Note that this project is mainly for study purpose, and is not fully optimized for accurate camera tracking.

Requirements

The core functionalities were implemented in PyTorch (1.10). Open3D (0.14.0) is used for visualisation. Other important dependancies include:

numpy==1.21.2
opencv-python==4.5.5
imageio==2.14.1
scikit-image==0.19.1
trimesh==3.9.43

You can create an anaconda environment called kinfu with the required dependencies by running:

conda env create -f environment.yml
conda activate kinfu

Data Preparation

The code was tested on TUM dataset. After downloading the raw sequences, you will need to run the pre-processing script under dataset/. For example:

python dataset/preprocess.py --config configs/fr1_desk.yaml

There are some example config files under configs/ which correspond to different sequences. You need to replace data_root to your own sequence directory before running the script. After running the script a new directory processed/ will appear under your sequence directory.

Run

After obtaining the processed sequence, you can simply run kinfu.py. For example:

python kinfu.py --config configs/fr1_desk.yaml --save_dir reconstruct/fr1_desk

which will perform the tracking and mapping headlessly and save the results. Or you could run:

python kinfu_gui.py --config configs/fr1_desk.yaml

If you want to visualize the tracking and reconstruction process on-the-fly.

Acknowledgement

Part of the tracking code was borrowed and modified from DeepIC. Also thank Binbin Xu for implementing part of the TSDF volume code which is inspired by Andy Zeng's tsdf-fusion-python.

KinectFusion implemented in Python with PyTorch

Related tags

Overview

KinectFusion implemented in Python with PyTorch

Requirements

Data Preparation

Run

Acknowledgement

Owner

Jingwen Wang

Line-level Handwritten Text Recognition (HTR) system implemented with TensorFlow.

WSDM‘2022: Knowledge Enhanced Sports Game Summarization

You Only 👀 One Sequence

[ICCV 2021 Oral] PoinTr: Diverse Point Cloud Completion with Geometry-Aware Transformers

RGBD-Net - This repository contains a pytorch lightning implementation for the 3DV 2021 RGBD-Net paper.

EDCNN: Edge enhancement-based Densely Connected Network with Compound Loss for Low-Dose CT Denoising

Official repository for the ICCV 2021 paper: UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model.

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

The official implementation of paper Siamese Transformer Pyramid Networks for Real-Time UAV Tracking, accepted by WACV22

The end-to-end platform for building voice products at scale

Exploring Cross-Image Pixel Contrast for Semantic Segmentation

Keras-tensorflow implementation of Fully Convolutional Networks for Semantic Segmentation（Unfinished）

Python Auto-ML Package for Tabular Datasets

FIRA: Fine-Grained Graph-Based Code Change Representation for Automated Commit Message Generation

Code for ICDM2020 full paper: "Sub-graph Contrast for Scalable Self-Supervised Graph Representation Learning"

Course about deep learning for computer vision and graphics co-developed by YSDA and Skoltech.

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

PyTorch implementations of neural network models for keyword spotting

Company clustering with K-means/GMM and visualization with PCA, t-SNE, using SSAN relation extraction

Official code for the ICCV 2021 paper "DECA: Deep viewpoint-Equivariant human pose estimation using Capsule Autoencoders"