Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Last update: Dec 13, 2022

Overview

Long-term-Motion-in-3D-Scenes

This is an implementation of the CVPR'21 paper "Synthesizing Long-Term 3D Human Motion and Interaction in 3D".

Please check our paper and the project webpage for more details.

Citation

If you use our code or paper, please consider citing:

@article{wang2020synthesizing,
  title={Synthesizing Long-Term 3D Human Motion and Interaction in 3D Scenes},
  author={Wang, Jiashun and Xu, Huazhe and Xu, Jingwei and Liu, Sifei and Wang, Xiaolong},
  journal={arXiv preprint arXiv:2012.05522},
  year={2020}
}

Dependencies

Requirements:

python3.6
pytorch==1.1.0
trimesh
open3d
Chamfer Pytorch
Human Body Prior
SMPL-X

Datasets

We use PROX and PROXE datasets as our training data. After downloading them, please put them in './data/'. We provide generate_routepose_data.ipynb and generate_sub_data.ipynb for data generation. Note in PROX, the human meshes and the scene meshes are not in the same area in the world coordinates. Different from PROX and PROXE, we apply the inverse of the camera extrinsics to the scene mesh. Since the scene is the input and we need it to be aligned with the human bodies. This is done in the data generation code. Thus for contact calculating, you do not need to apply transformation to them. While for collision calculating, you still need to apply the transformation to the human bodies similar to PROXE to make it be aligned with SDF. Please be careful with this during training or testing, especially if you want to test on other scenes such as Matterport3D. Please put body_segments data in './data/' as well.

Demo

We provide demo.ipynb to help you play with our method. Before running, please put a downsampled MPH16.ply mesh and the SDF data of this scene in './demo_data/'. You can download them from PROX and PROXE. Still, please be careful with the camera extrinsics when you want to test other scenes, make sure the human body is in the scene. This code will also show you how to optimize the whole motion.

Models

We use SMPL-X to represent human bodies. Please download the SMPL-X models and put them in './models/' and it may look like './models/smplx/SMPLX_NEUTRAL.npz'. Please download vposer model and put it in './' ('./vposer_v1_0/').

We also provide our pretrained model here

Training

After you generate the data. You can train the networks directly,

python train_subgoal.py

python train_route.py

Please train the posenet after you finished training routenet with your own pretrained routenet model,

python train_pose.py

Acknowledgement

This work was supported, in part, by grants from DARPA LwLL, NSF 1730158 CI-New: Cognitive Hardware and Software Ecosystem Community Infrastructure (CHASE-CI), NSF ACI-1541349 CC*DNI Pacific Research Platform, and gifts from Qualcomm and TuSimple. Part of our code is based on PROXE and it may help you with the dependencies and dataset parts as well. Many thanks!

License

Apache-2.0 License

Synthesizing Long-Term 3D Human Motion and Interaction in 3D in CVPR2021

Related tags

Overview

Long-term-Motion-in-3D-Scenes

Citation

Dependencies

Datasets

Demo

Models

Training

Acknowledgement

License

Owner

Jiashun Wang

PyZebrascope - an open-source Python platform for brain-wide neural activity imaging in behaving zebrafish

Code for PackNet: Adding Multiple Tasks to a Single Network by Iterative Pruning

An implementation of DeepMind's Relational Recurrent Neural Networks in PyTorch.

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Point detection through multi-instance deep heatmap regression for sutures in endoscopy

PyTorch-Multi-Style-Transfer - Neural Style and MSG-Net

Self-Supervised Generative Style Transfer for One-Shot Medical Image Segmentation

A curated list of programmatic weak supervision papers and resources

A repository for generating stylized talking 3D and 3D face

Alignment Attention Fusion framework for Few-Shot Object Detection

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

CPT: A Pre-Trained Unbalanced Transformer for Both Chinese Language Understanding and Generation

ByteTrack: Multi-Object Tracking by Associating Every Detection Box

Algo-burn - Script to configure an Algorand address as a "burn" address for one or more ASA tokens

Agent-based model simulator for air quality and pandemic risk assessment in architectural spaces

A flexible and extensible framework for gait recognition.

Face Recognition and Emotion Detector Device

Apply our monocular depth boosting to your own network!

Code for Max-Margin Contrastive Learning - AAAI 2022

Pytorch implemenation of Stochastic Multi-Label Image-to-image Translation (SMIT)