Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Last update: Dec 31, 2022

Overview

ONNX-Mobile-Human-Pose-3D

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model.

Original image for inference: (https://static2.diariovasco.com/www/pre2017/multimedia/noticias/201412/01/media/DF0N5391.jpg)

❗ ⚠️ Known issues

The models works well when the person is looking forward and without occlusions, it will start to fail as soon as the person is occluded.
The model is fast, but the 3D representation is slow due to matplotlib, this will be fixed. The 3d representation can be ommitted for faster inference by setting draw_3dpose to False

Requirements

OpenCV, imread-from-url, scipy, onnx and onnxruntime. Also, pafy and youtube-dl are required for youtube video inference.

Installation

pip install -r requirements.txt
pip install pafy youtube-dl

ONNX model

The original models were converted to different formats (including .onnx) by PINTO0309, download the models from his repository and save them into the models folder.

YOLOv5s: You will also need an object detector to first detect the people in the image. Download the model from the model zoo and save the .onnx version into the models folder.

Original model

The original model was taken from the original repository.

Examples

Image inference:

python imagePoseEstimation.py

Video inference:

python videoPoseEstimation.py

Webcam inference:

python webcamPoseEstimation.py

Inference video Example

References:

Mobile human pose model: https://github.com/SangbumChoi/MobileHumanPose
PINTO0309's model zoo: https://github.com/PINTO0309/PINTO_model_zoo
PINTO0309's model conversion tool: https://github.com/PINTO0309/openvino2tensorflow
3DMPPE_POSENET_RELEASE repository: https://github.com/mks0601/3DMPPE_POSENET_RELEASE
Original YOLOv5 repository: https://github.com/ultralytics/yolov5
Original paper: https://openaccess.thecvf.com/content/CVPR2021W/MAI/html/Choi_MobileHumanPose_Toward_Real-Time_3D_Human_Pose_Estimation_in_Mobile_Devices_CVPRW_2021_paper.html

Python scripts for performing 3D human pose estimation using the Mobile Human Pose model in ONNX.

Related tags

Overview

ONNX-Mobile-Human-Pose-3D

❗ ⚠️ Known issues

Requirements

Installation

ONNX model

Original model

Examples

Inference video Example

References:

Owner

Ibai Gorordo

This code is a near-infrared spectrum modeling method based on PCA and pls

Convolutional Neural Networks

The code for paper "Contrastive Spatio-Temporal Pretext Learning for Self-supervised Video Representation" which is accepted by AAAI 2022

Neural Koopman Lyapunov Control

Deep Image Search is an AI-based image search engine that includes deep transfor learning features Extraction and tree-based vectorized search.

Official implementation of the paper 'High-Resolution Photorealistic Image Translation in Real-Time: A Laplacian Pyramid Translation Network' in CVPR 2021

Controlling a game using mediapipe hand tracking

A privacy-focused, intelligent security camera system.

AI pipelines for Nvidia Jetson Platform

Pytorch code for ICRA'21 paper: "Hierarchical Cross-Modal Agent for Robotics Vision-and-Language Navigation"

Learning What and Where to Draw

This Deep Learning Model Predicts that from which disease you are suffering.

docTR by Mindee (Document Text Recognition) - a seamless, high-performing & accessible library for OCR-related tasks powered by Deep Learning.

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

An Active Automata Learning Library Written in Python

PyBrain - Another Python Machine Learning Library.

This repository focus on Image Captioning & Video Captioning & Seq-to-Seq Learning & NLP

Source-to-Source Debuggable Derivatives in Pure Python

NCNN implementation of Real-ESRGAN. Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

A toy project using OpenCV and PyMunk