Yoga Pose Identification and Icon Matching

Project Goal

Detect yoga poses performed by a user and overlay a corresponding icon image. Running the main script starts the videostream with automatic pose detection.

Part 1: Pose Detection

I use the 32 body landmarks provided by MediaPipe to measure joint angles, then determine yoga poses based on key joint angles for each pose. For example, in the star pose, the angle between the shoulder, elbow, and wrist landmarks (elbow flexion) are below 20 degrees and the angle of the elbow, shoulder, and opposite shoulder (shoulder flexion) are also below 20 degrees.

Part 2: Icon Image Transformation

To transform the icon image that will be overlayed over the user, I first preprocess the icon image then apply an affine transform. To preprocess the icon, I resize the icon image to be roughly the same heigt as the user, a metric also calculated with MediaPie's landmarks. I then apply a border to the icon image so that its image array has the same dimensions as the video stream frames. These steps help make the affine transform more effective. I select three key pose landmarks for each pose, then find three key points on the icon that should match these points. For example, I chose to match the nose and ankles of the person with the top tip and bottom two tips of the star.

Part 3: Image Overlay

I overlayed just the icon pixels (the icon background is ignored) by summing .5 of the icon pixel value with .5 of the the video frame value, resulting in a transparent overlay of just the icon.

OpenCV, MediaPipe Pose Estimation, Affine Transform for Icon Overlay

Related tags

Overview

Yoga Pose Identification and Icon Matching

Project Goal

Part 1: Pose Detection

Part 2: Icon Image Transformation

Part 3: Image Overlay

Results

Star Pose

Tree Pose

Chair pose

Owner

Anna Garverick

Author's PyTorch implementation of TD3 for OpenAI gym tasks

Only valid pull requests will be allowed. Use python only and readme changes will not be accepted.

PyTorch implementation of Tacotron speech synthesis model.

Hierarchical User Intent Graph Network for Multimedia Recommendation

Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks

Official public repository of paper "Intention Adaptive Graph Neural Network for Category-Aware Session-Based Recommendation"

[ArXiv 2021] One-Shot Generative Domain Adaptation

🎃 Core identification module of AI powerful point reading system platform.

The 2nd place solution of 2021 google landmark retrieval on kaggle.

Minimal implementation of Denoised Smoothing: A Provable Defense for Pretrained Classifiers in TensorFlow.

QQ Browser 2021 AI Algorithm Competition Track 1 1st Place Program

A PyTorch Implementation of SphereFace.

NL-Augmenter 🦎 → 🐍 A Collaborative Repository of Natural Language Transformations

Patches desktop steam to look like the new steamdeck ui.

Pointer-generator - Code for the ACL 2017 paper Get To The Point: Summarization with Pointer-Generator Networks

deep_image_prior_extension

Predicting path with preference based on user demonstration using Maximum Entropy Deep Inverse Reinforcement Learning in a continuous environment

Ranger - a synergistic optimizer using RAdam (Rectified Adam), Gradient Centralization and LookAhead in one codebase

A-ESRGAN aims to provide better super-resolution images by using multi-scale attention U-net discriminators.

[NeurIPS 2021] Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data