MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Last update: Jan 07, 2023

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

This repo is the official implementation of "MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation, Wenhao Li, Hong Liu, Hao Tang, Pichao Wang, Luc Van Gool" in PyTorch.

Dependencies

Cuda 11.1
Python 3.6
Pytorch 1.7.1

Dataset setup

Please download the dataset from Human3.6m website and refer to VideoPose3D to set up the Human3.6M dataset ('./dataset' directory).

${POSE_ROOT}/
|-- dataset
|   |-- data_3d_h36m.npz
|   |-- data_2d_h36m_cpn_ft_h36m_dbb.npz

Download pretrained model

The pretrained model can be found in Google_Drive, please download it and put in the './checkpoint' dictory.

Test the model

To test on pretrained model on Human3.6M:

python main.py --reload --previous_dir 'checkpoint/pretrained'

Here, we compare our MHFormer with recent state-of-the-art methods on Human3.6M dataset. Evaluation metric is Mean Per Joint Position Error (MPJPE) in mm.

Models	MPJPE
VideoPose3D	46.8
PoseFormer	44.3
MHFormer	43.0

Train the model

To train on Human3.6M:

python main.py --train

Citation

If you find our work useful in your research, please consider citing:

@article{li2021mhformer,
  title={MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation},
  author={Li, Wenhao and Liu, Hong and Tang, Hao and Wang, Pichao and Van Gool, Luc},
  journal={arXiv preprint},
  year={2021}
}

Acknowledgement

Our code is extended from the following repositories. We thank the authors for releasing the codes.

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Related tags

Overview

MHFormer: Multi-Hypothesis Transformer for 3D Human Pose Estimation

Dependencies

Dataset setup

Download pretrained model

Test the model

Train the model

Citation

Acknowledgement

Owner

Vegetabird

This repository provides an unified frameworks to train and test the state-of-the-art few-shot font generation (FFG) models.

A Comprehensive Analysis of Weakly-Supervised Semantic Segmentation in Different Image Domains (IJCV submission)

atmaCup #11 の Public 4th / Pricvate 5th Solution のリポジトリです。

This folder contains the implementation of the multi-relational attribute propagation algorithm.

Using Self-Supervised Pretext Tasks for Active Learning - Official Pytorch Implementation

Eye-Blink-Counter - Python based Computer Vision project which counts how many time a person blinks

Marvis is Mastouri's Jarvis version of the AI-powered Python personal assistant.

A PyTorch Implementation of SphereFace.

code for Fast Point Cloud Registration with Optimal Transport

[ICCV 2021 Oral] Deep Evidential Action Recognition

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Code for a real-time distributed cooperative slam(RDC-SLAM) system for ROS compatible platforms.

FTIR-Deep Learning - FTIR Deep Learning With Python

NDE: Climate Modeling with Neural Diffusion Equation, ICDM'21

This is a project based on retinaface face detection, including ghostnet and mobilenetv3

[ICML 2020] DrRepair: Learning to Repair Programs from Error Messages

DC3: A Learning Method for Optimization with Hard Constraints

Codes for TIM2021 paper "Anchor-Based Spatio-Temporal Attention 3-D Convolutional Networks for Dynamic 3-D Point Cloud Sequences"

Reproduction of Vision Transformer in Tensorflow2. Train from scratch and Finetune.

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis