Real-time 3D multi-person detection made easy with OpenPose and the ZED

Last update: Nov 06, 2020

Related tags

Overview

OpenPose ZED

This sample show how to simply use the ZED with OpenPose, the deep learning framework that detects the skeleton from a single 2D image. The 3D information provided by the ZED is used to place the joints in space. The output is a 3D view of the skeletons.

Installation

Openpose

This sample can be put in the folder examples/user_code/ OR preferably, compile and install openpose with the cmake and compile this anywhere

The installation process is very easy using cmake.

Clone the repository :

    git clone https://github.com/CMU-Perceptual-Computing-Lab/openpose/

Build and install it :

    cd openpose
    mkdir build
    cmake .. # This can take a while
    make -j8
    sudo make install

ZED SDK

The ZED SDK is also a requirement for this sample, download the ZED SDK and follows the instructions.

It requires ZED SDK 2.4 for the floor plane detection but can be easily disabled to use an older ZED SDK version.

Build the program

Open a terminal in the sample directory and execute the following command:

    mkdir build
    cd build
    cmake ..
    make -j8

We then need to make a symbolic link to the models folder to be able to loads it

    ln -s ~/path/to/openpose/models "$(pwd)"

A models folder should now be in the build folder

Run the program

Navigate to the build directory and launch the executable
Or open a terminal in the build directory and run the sample :
```
  ./zed_openpose
```

Options

Beyond the openpose option, several more were added, mainly:

Option	Description
svo_path	SVO file path to load instead of opening the ZED
ogl_ptcloud	Boolean to show the point cloud in the OpenGL window
estimate_floor_plane	Boolean to align the point cloud on the floor plane
opencv_display	Enable the 2D View of OpenPose output
depth_display	Display the depth map with OpenCV

Example :

    ./zed_openpose -net_resolution 320x240 -ogl_ptcloud true -svo_path ~/foo/bar.svo

Notes

This sample is a proof of concept and might not be robust to every situation, especially to detect the floor plane if the environment is cluttered.
This sample was only tested on Linux but should be easy to run on Windows.
This sample requires both Openpose and the ZED SDK which are heavily relying on the GPU.
Only the body keypoints are currently used, however we could imagine doing the same for hand and facial keypoints, though the precision required might be a limiting factor.

Real-time 3D multi-person detection made easy with OpenPose and the ZED

Related tags

Overview

OpenPose ZED

Installation

Openpose

ZED SDK

Build the program

Run the program

Options

Notes

Owner

blanktec

Spatio-Temporal Entropy Model (STEM) for end-to-end leaned video compression.

Code & Data for Enhancing Photorealism Enhancement

The Rich Get Richer: Disparate Impact of Semi-Supervised Learning

Fast and robust clustering of point clouds generated with a Velodyne sensor.

Real-Time Multi-Contact Model Predictive Control via ADMM

The open source code of SA-UNet: Spatial Attention U-Net for Retinal Vessel Segmentation.

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

PyTorch implementation for Graph Contrastive Learning with Augmentations

Practical and Real-world applications of ML based on the homework of Hung-yi Lee Machine Learning Course 2021

A tutorial on DataFrames.jl prepared for JuliaCon2021

Accurate Phylogenetic Inference with Symmetry-Preserving Neural Networks

Y. Zhang, Q. Yao, W. Dai, L. Chen. AutoSF: Searching Scoring Functions for Knowledge Graph Embedding. IEEE International Conference on Data Engineering (ICDE). 2020

Pytorch implementation of AREL

TransferNet: Learning Transferrable Knowledge for Semantic Segmentation with Deep Convolutional Neural Network

IPATool-py: download ipa easily

🍅🍅🍅YOLOv5-Lite: lighter, faster and easier to deploy. Evolved from yolov5 and the size of model is only 1.7M (int8) and 3.3M (fp16). It can reach 10+ FPS on the Raspberry Pi 4B when the input size is 320×320~

Developed an optimized algorithm which finds the most optimal path between 2 points in a 3D Maze using various AI search techniques like BFS, DFS, UCS, Greedy BFS and A*

This is an official pytorch implementation of Lite-HRNet: A Lightweight High-Resolution Network.

Code Release for Learning to Adapt to Evolving Domains

Personalized Federated Learning using Pytorch (pFedMe)