3D HourGlass Networks for Human Pose Estimation Through Videos

Last update: Jan 02, 2023

Overview

3D-HourGlass-Network

3D CNN Based Hourglass Network for Human Pose Estimation (3D Human Pose) from videos. This was my summer'18 research project.

Discussion

In this work I try to extend the idea in Carriera et. al. CVPR'17 of 3D CNN inflation for action recognition from videos to human pose estimation from videos. We use a pretrained hourglass network with a fully connected depth regressor, inflate the 2D convolutions to 3D convolutions and perform temporal 3D human pose estimation. This inflation helps the network learn features from nearby frames and refine its predictions. Similar idea was used in Girdhar et. al. CVPR'18 (at about the same time!) where they perform multiperson human pose estimartion from videos using an inflated Mask RCNN

Requirements

python 3.6
pytorch 0.4
torchvision
progress

Datasets

We used Human 3.6 dataset for this project.

Instructions to run

python main.py -expID [EXP-NAME] -nFramesReg [NUM-FRAMES]

Results

We improved the baseline performance of hourglass network from MPJPE of 64 to MPJPE 62.8 and thus show significance of temporal features in real world problems. This idea could be easily extended for other tasks also like semantic segmentation and object detection.

3D HourGlass Networks for Human Pose Estimation Through Videos

Related tags

Overview

3D-HourGlass-Network

Discussion

Requirements

Datasets

Instructions to run

Results

Owner

Naman Jain

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Hierarchical Few-Shot Generative Models

BRNet - code for Automated assessment of BI-RADS categories for ultrasound images using multi-scale neural networks with an order-constrained loss function

Fair Recommendation in Two-Sided Platforms

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Measuring Coding Challenge Competence With APPS

VISNOTATE: An Opensource tool for Gaze-based Annotation of WSI Data

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

RIM: Reliable Influence-based Active Learning on Graphs.

Combinatorially Hard Games where the levels are procedurally generated

Pytorch implementation of Integrating Tree Path in Transformer for Code Representation

Multi-Modal Machine Learning toolkit based on PyTorch.

AutoML library for deep learning

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

Location-Sensitive Visual Recognition with Cross-IOU Loss

Self-Supervised depth kalilia

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

ECAENet (TensorFlow and Keras)

Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020)

3D HourGlass Networks for Human Pose Estimation Through Videos

Related tags

Overview

3D-HourGlass-Network

Discussion

Requirements

Datasets

Instructions to run

Results

Owner

Naman Jain

A PyTorch implementation for V-Net: Fully Convolutional Neural Networks for Volumetric Medical Image Segmentation

Hierarchical Few-Shot Generative Models

BRNet - code for Automated assessment of BI-RADS categories for ultrasound images using multi-scale neural networks with an order-constrained loss function

Fair Recommendation in Two-Sided Platforms

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

Measuring Coding Challenge Competence With APPS

VISNOTATE: An Opensource tool for Gaze-based Annotation of WSI Data

PyTorch-LIT is the Lite Inference Toolkit (LIT) for PyTorch which focuses on easy and fast inference of large models on end-devices.

RIM: Reliable Influence-based Active Learning on Graphs.

Combinatorially Hard Games where the levels are procedurally generated

Pytorch implementation of Integrating Tree Path in Transformer for Code Representation

Multi-Modal Machine Learning toolkit based on PyTorch.

AutoML library for deep learning

Context Decoupling Augmentation for Weakly Supervised Semantic Segmentation

Location-Sensitive Visual Recognition with Cross-IOU Loss

Self-Supervised depth kalilia

Official implementation for the paper "SAPE: Spatially-Adaptive Progressive Encoding for Neural Optimization".

Implementation of CaiT models in TensorFlow and ImageNet-1k checkpoints. Includes code for inference and fine-tuning.

ECAENet (TensorFlow and Keras)

Repository for Traffic Accident Benchmark for Causality Recognition (ECCV 2020)

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人