This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

Last update: Dec 05, 2022

Related tags

Deep Learning motion2prog_release

Overview

Hierarchical Motion Understanding via Motion Programs (CVPR 2021)

This repository contains the official implementation of:

Hierarchical Motion Understanding via Motion Programs

full paper | short talk | long talk | project webpage

Running motion2prog

0. We start with video file and first prepare the input data

$ ffmpeg -i ${video_dir}/video.mp4 ${video_dir}/frames/%05d.jpg
$ python AlphaPose/scripts/demo_inference.py \
    --cfg AlphaPose/pretrained_models/256x192_res50_lr1e-3_1x.yaml \
    --checkpoint AlphaPose/pretrained_models/halpe26_fast_res50_256x192.pth \
    --indir ${video_dir}/frames --outdir ${video_dir}/pose_mpii_track \
    --pose_track --showbox --flip --qsize 256
$ mv ${video_dir}/pose_mpii_track/alphapose-results.json \
    ${video_dir}/alphapose-results-halpe26-posetrack.json

We packaged a demo video with necessary inputs for quickly testing our code

$ wget https://sumith1896.github.io/motion2prog/static/demo.zip
$ mv demo.zip data/  && cd data/ && unzip demo.zip && cd ..

We need 2D pose detection results & extracted frames of video (for visualization)
We support loading from different pose detector formats in the load function in lkeypoints.py.
We used AlphaPose with the above commands for all pose detection results.

Run motion program synthesis pipeline

1. With the data prepared, you can run the synthesis with the following command:

$ python fit.py -d data/demo/276_reg -k coco -a -x -c -p 1 -w 20 --no-acc \
--stat-thres 5 --span-thres 5 --cores 9 -r 1600 -o ./visualization/static/data/demo

The various options and their descriptions are explained in the fit.py file.
The results can be found under ./visualization/static/data/demo.

Visualizing the synthesized programs

2. We package a visualization server for visualizing the generated programs

$ cd visualization/
$ bash deploy.sh p

Open the directed the webpage and browse the results interactively.

Citations

If you find our code or paper useful to your research, please consider citing:

@inproceedings{motion2prog2021,
    Author = {Sumith Kulal and Jiayuan Mao and Alex Aiken and Jiajun Wu},
    Title = {Hierarchical Motion Understanding via Motion Programs},
    booktitle={CVPR},
    year={2021},
}

Checklist

Please open a GitHub issue or contact [email protected] for any issues or questions!

Upload pre-processed data used in paper.
Add for-loop synthesis layer.

Acknowledgements

We thank Karan Chadha, Shivam Garg and Shubham Goel for helpful discussions. This work is in part supported by Magic Grant from the Brown Institute for Media Innovation, the Samsung Global Research Outreach (GRO) Program, Autodesk, Amazon Web Services, and Stanford HAI for AWS Cloud Credits.

Parts of this repo use materials from SCANimate and fit.

This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

Related tags

Overview

Hierarchical Motion Understanding via Motion Programs (CVPR 2021)

Running motion2prog

Run motion program synthesis pipeline

Visualizing the synthesized programs

Citations

Checklist

Acknowledgements

Owner

Sumith Kulal

Code to accompany our paper "Continual Learning Through Synaptic Intelligence" ICML 2017

Doing fast searching of nearest neighbors in high dimensional spaces is an increasingly important problem

Official pytorch implementation of DeformSyncNet: Deformation Transfer via Synchronized Shape Deformation Spaces

The Medical Detection Toolkit contains 2D + 3D implementations of prevalent object detectors such as Mask R-CNN, Retina Net, Retina U-Net, as well as a training and inference framework focused on dealing with medical images.

Bilinear attention networks for visual question answering

A Web API for automatic background removal using Deep Learning. App is made using Flask and deployed on Heroku.

Fine-grained Post-training for Improving Retrieval-based Dialogue Systems - NAACL 2021

[CVPR 2020] Transform and Tell: Entity-Aware News Image Captioning

Little Ball of Fur - A graph sampling extension library for NetworKit and NetworkX (CIKM 2020)

STRIVE: Scene Text Replacement In Videos

Repository of the paper Compressing Sensor Data for Remote Assistance of Autonomous Vehicles using Deep Generative Models at ML4AD @ NeurIPS 2021.

FlowTorch is a PyTorch library for learning and sampling from complex probability distributions using a class of methods called Normalizing Flows

Inverse Rendering for Complex Indoor Scenes: Shape, Spatially-Varying Lighting and SVBRDF From a Single Image

A simple software for capturing human body movements using the Kinect camera.

A PyTorch implementation of Implicit Q-Learning

BaseCls BaseCls 是一个基于 MegEngine 的预训练模型库，帮助大家挑选或训练出更适合自己科研或者业务的模型结构

DeLag: Detecting Latency Degradation Patterns in Service-based Systems

Cosine Annealing With Warmup

Doosan robotic arm, simulation, control, visualization in Gazebo and ROS2 for Reinforcement Learning.

This game was designed to encourage young people not to gamble on lotteries, as the probablity of correctly guessing the number is infinitesimal!