Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Last update: Dec 30, 2022

Related tags

Deep Learning mint

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

This package contains the model implementation and training infrastructure of our AI Choreographer.

Get started

Pull the code

git clone https://github.com/liruilong940607/mint --recursive

Note here --recursive is important as it will automatically clone the submodule (orbit) as well.

Install dependencies

conda create -n mint python=3.7
conda activate mint
conda install protobuf numpy
pip install tensorflow absl-py tensorflow-datasets librosa

sudo apt-get install libopenexr-dev
pip install --upgrade OpenEXR
pip install tensorflow-graphics tensorflow-graphics-gpu

git clone https://github.com/arogozhnikov/einops /tmp/einops
cd /tmp/einops/ && pip install . -U

git clone https://github.com/google/aistplusplus_api /tmp/aistplusplus_api
cd /tmp/aistplusplus_api && pip install -r requirements.txt && pip install . -U

Note if you meet environment conflicts about numpy, you can try with pip install numpy==1.20.

Get the data

See the website

Get the checkpoint

Download from google drive here, and put them to the folder ./checkpoints/

Run the code

complie protocols

protoc ./mint/protos/*.proto

preprocess dataset into tfrecord

python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=train
python tools/preprocessing.py \
    --anno_dir="/mnt/data/aist_plusplus_final/" \
    --audio_dir="/mnt/data/AIST/music/" \
    --split=testval

run training

python trainer.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints

Note you might want to change the batch_size in the config file if you meet OUT-OF-MEMORY issue.

run testing and evaluation

# caching the generated motions (seed included) to `./outputs`
python evaluator.py --config_path ./configs/fact_v5_deeper_t10_cm12.config --model_dir ./checkpoints
# calculate FIDs
python tools/calculate_scores.py

Citation

@inproceedings{li2021dance,
  title={AI Choreographer: Music Conditioned 3D Dance Generation with AIST++},
  author={Ruilong Li and Shan Yang and David A. Ross and Angjoo Kanazawa},
  booktitle = {The IEEE International Conference on Computer Vision (ICCV)},
  year = {2021}
}

Multi-modal Content Creation Model Training Infrastructure including the FACT model (AI Choreographer) implementation.

Related tags

Overview

AI Choreographer: Music Conditioned 3D Dance Generation with AIST++ [ICCV-2021].

Overview

Get started

Pull the code

Install dependencies

Get the data

Get the checkpoint

Run the code

Citation

Owner

Google Research

Python Algorithm Interview Book Review

Deep motion transfer

SBINN: Systems-biology informed neural network

Federated_learning codes used for the the paper "Evaluation of Federated Learning Aggregation Algorithms" and "A Federated Learning Aggregation Algorithm for Pervasive Computing: Evaluation and Comparison"

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

The InterScript dataset contains interactive user feedback on scripts generated by a T5-XXL model.

ImageNet Adversarial Image Evaluation

Pytorch implementations of the paper Value Functions Factorization with Latent State Information Sharing in Decentralized Multi-Agent Policy Gradients

A library for graph deep learning research

This is an example of a reproducible modelling project

Language models are open knowledge graphs ( non official implementation )

Saliency - Framework-agnostic implementation for state-of-the-art saliency methods (XRAI, BlurIG, SmoothGrad, and more).

Full body anonymization - Realistic Full-Body Anonymization with Surface-Guided GANs

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

A simple pytorch pipeline for semantic segmentation.

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Task Transformer Network for Joint MRI Reconstruction and Super-Resolution (MICCAI 2021)

Spectrum is an AI that uses machine learning to generate Rap song lyrics

graph-theoretic framework for robust pairwise data association