Multiview Dataset Toolkit

Using multi-view cameras is a natural way to obtain a complete point cloud. However, there is to date only one multi-view 3D hand pose dataset– NYU. Furthermore, NYU is primarily used as a depth map dataset; although they also provided the RGB images, these RGB images are of low resolution and quality. FreiHand also records data using a multi- view setup, but the released images are not from corresponding viewpoints. In that sense, it can be regarded only as a single-view dataset containing multiple views rather than a true multi-view dataset.
To fill this gap, we present a new multi-view RGB-D 3D hand pose dataset. We use four RealSense D415 cameras in different views to record 4 RGB-D sequences from 4 subjects and the resolution of our recorded dataset is 640 × 480. We use a 21-joint model to annotate the hand pose. Additionally, we provide hand masks, 2D and 3D joint locations, hand meshes in the form of MANO parameters, real complete hand point clouds and full camera parameters. In particular, we provide extrinsic camera parameters so it is easy for users to use multi-view information.

Basic setup

download data
install basic requirements

pip install numpy matplotlib scikit-image transforms3d tqdm opencv-python trimesh pyrender

example code

python toolkit.py

Provided data

four views color images
four views depth images
intrinsic and extrinsic camera parameters
21 hand joints
- 0 wrist
- 1 mcp index, 2 pip index, 3 dip index, 4 tip index
- 5 mcp middle, 6 pip middle, 7 dip middle, 8 tip middle
- 9 mcp ring, 10 pip ring, 11 dip ring, 12 tip ring
- 13 mcp pinky, 14 pip pinky, 15 dip pinky, 16 tip pinky
- 17 mcp thumb, 18 pip thumb, 19 dip thumb, 20 tip thumb
mano parameters

Access the dataset

data usage in toolkit.py
- drawMesh
- drawPose4view
- getBetterDepth

Info for our camera calibration

here

Terms of use

@InProceedings{Local2021,
  author    = {Ziwei Yu, Linlin Yang, Shicheng Chen, Angela Yao},
  title     = {Local and Global Point Cloud Reconstruction for 3D Hand Pose Estimation},
  booktitle    = {British Machine Vision Conference (BMVC)},
  year      = {2021},
  url          = {"https://github.com/ShichengChen/multiviewDataset"}
}

Multiview Dataset Toolkit

Related tags

Overview

Multiview Dataset Toolkit

Basic setup

Provided data

Access the dataset

Info for our camera calibration

Terms of use

Owner

The dynamics of representation learning in shallow, non-linear autoencoders

This is the code for Compressing BERT: Studying the Effects of Weight Pruning on Transfer Learning

Neural Radiance Fields Using PyTorch

NVIDIA container runtime

Robotic Process Automation in Windows and Linux by using Driagrams.net BPMN diagrams.

Implementation of a protein autoregressive language model, but with autoregressive infilling objective (editing subsequences capability)

Code for Talk-to-Edit (ICCV2021). Paper: Talk-to-Edit: Fine-Grained Facial Editing via Dialog.

Hummingbird compiles trained ML models into tensor computation for faster inference.

A list of all papers and resoureces on Semantic Segmentation

Ensemble Knowledge Guided Sub-network Search and Fine-tuning for Filter Pruning

TorchIO is a Medical image preprocessing and augmentation toolkit for deep learning. Part of the PyTorch Ecosystem.

Implementation of the federated dual coordinate descent (FedDCD) method.

Exact Pareto Optimal solutions for preference based Multi-Objective Optimization

A Free and Open Source Python Library for Multiobjective Optimization

Dataset and codebase for NeurIPS 2021 paper: Exploring Forensic Dental Identification with Deep Learning

Learning from Synthetic Humans, CVPR 2017

A novel framework to automatically learn high-quality scanning of non-planar, complex anisotropic appearance.

Curvlearn, a Tensorflow based non-Euclidean deep learning framework.

Simple image captioning model - CLIP prefix captioning.

Python utility to generate filesystem content for Obsidian.