PRTR: Pose Recognition with Cascade Transformers

Last update: Dec 30, 2022

Related tags

Overview

PRTR: Pose Recognition with Cascade Transformers

Introduction

This repository is the official implementation for Pose Recognition with Cascade Transformers. It proposes two types of cascade Transformers, as follows, for pose recognition.

Two-stage Transformers

Please refer to README.md for detailed usage of the two-stage model variant.

Sequential Transformers

Please refer to README.md for detailed usage of the sequential (end-to-end) model variant.

For more details, please see Pose Recognition with Cascade Transformers by Ke Li*, Shijie Wang*, Xiang Zhang*, Yifan Xu, Weijian Xu, and Zhuowen Tu.

Updates

Code and pretrained models will be released soon.

Citation

@misc{li2021pose,
      title={Pose Recognition with Cascade Transformers}, 
      author={Ke Li and Shijie Wang and Xiang Zhang and Yifan Xu and Weijian Xu and Zhuowen Tu},
      year={2021},
      eprint={2104.06976},
      archivePrefix={arXiv},
      primaryClass={cs.CV}
}

License

This repository is released under the Apache License 2.0. License can be found in LICENSE file.

Acknowledgments

This project is based on the following open source repositories, which greatly facilitate our research.

Thanks to DETR for the implementation of Detection Transformer
Thanks to HRNet-Human-Pose-Estimation for the training and evaluation pipeline
Thanks to HRNet-Image-Classification for HRNet backbone implementation

PRTR: Pose Recognition with Cascade Transformers

Related tags

Overview

PRTR: Pose Recognition with Cascade Transformers

Introduction

Two-stage Transformers

Sequential Transformers

Updates

Citation

License

Acknowledgments

Owner

mlpc-ucsd

NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.

PyTorch implementation of paper: HPNet: Deep Primitive Segmentation Using Hybrid Representations.

Implementation of Cross Transformer for spatially-aware few-shot transfer, in Pytorch

A transformer-based method for Healthcare Image Captioning in Vietnamese

Контрольная работа по математическим методам машинного обучения

Implement Decoupled Neural Interfaces using Synthetic Gradients in Pytorch

Reusable constraint types to use with typing.Annotated

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Script utilizando OpenCV e modelo Machine Learning para detectar o uso de máscaras.

Scalable Graph Neural Networks for Heterogeneous Graphs

An open-source, low-cost, image-based weed detection device for fallow scenarios.

Official Pytorch implementation of "Learning to Estimate Robust 3D Human Mesh from In-the-Wild Crowded Scenes", CVPR 2022

[CVPR2021] The source code for our paper 《Removing the Background by Adding the Background: Towards Background Robust Self-supervised Video Representation Learning》.

Spatial-Temporal Transformer for Dynamic Scene Graph Generation, ICCV2021

Phy-Q: A Benchmark for Physical Reasoning

MBPO (paper: When to trust your model: Model-based policy optimization) in offline RL settings

Pytorch implementation of MLP-Mixer with loading pre-trained models.

PyTorch implementation of DD3D: Is Pseudo-Lidar needed for Monocular 3D Object detection?

Implementation of "Learning to Match Features with Seeded Graph Matching Network" ICCV2021

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation, CVPR2022