Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Last update: Jan 02, 2023

Overview

RAVE: Realtime Audio Variational autoEncoder

Official implementation of RAVE: A variational autoencoder for fast and high-quality neural audio synthesis (article link)

Installation

RAVE needs python 3.9. Install the dependencies using

pip install -r requirements.txt

Training

Both RAVE and the prior model are available in this repo. For most users we recommand to use the cli_helper.py script, since it will generate a set of instructions allowing the training and export of both RAVE and the prior model on a specific dataset.

python cli_helper.py

However, if you want to customize even more your training, you can use the provided train_{rave, prior}.py and export_{rave, prior}.py scripts manually.

Realtime usage

[NOT AVAILABLE YET]

RAVE and the prior model can be used in realtime inside max/msp, allowing creative interactions with both models. Code and details about this part of the project are not available yet, we are currently working on the corresponding article !

An audio example of the prior sampling patch is available in the docs/ folder.

Official implementation of the RAVE model: a Realtime Audio Variational autoEncoder

Related tags

Overview

RAVE: Realtime Audio Variational autoEncoder

Installation

Training

Realtime usage

Owner

Antoine Caillon

Implementation of SETR model, Original paper: Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers.

Official repository for GCR rerank, a GCN-based reranking method for both image and video re-ID

City Surfaces: City-scale Semantic Segmentation of Sidewalk Surfaces

An implementation of quantum convolutional neural network with MindQuantum. Huawei, classifying MNIST dataset

Resources complimenting the Machine Learning Course led in the Faculty of mathematics and informatics part of Sofia University.

Fast and Simple Neural Vocoder, the Multiband RNNMS

PyTorch implementation of federated learning framework based on the acceleration of global momentum

Training vision models with full-batch gradient descent and regularization

Code for reproducing our analysis in the paper titled: Image Cropping on Twitter: Fairness Metrics, their Limitations, and the Importance of Representation, Design, and Agency

PyTorch implementation of a collections of scalable Video Transformer Benchmarks.

Job-Recommend-Competition - Vectorwise Interpretable Attentions for Multimodal Tabular Data

[NeurIPS 2021] Garment4D: Garment Reconstruction from Point Cloud Sequences

A Closer Look at Invalid Action Masking in Policy Gradient Algorithms

Simple, efficient and flexible vision toolbox for mxnet framework.

Segmentation-Aware Convolutional Networks Using Local Attention Masks

《Fst Lerning of Temporl Action Proposl vi Dense Boundry Genertor》(AAAI 2020)

Yoloxkeypointsegment - An anchor-free version of YOLO, with a simpler design but better performance

Torch implementation of SegNet and deconvolutional network

nextPARS, a novel Illumina-based implementation of in-vitro parallel probing of RNA structures.

[Pedestron] Generalizable Pedestrian Detection: The Elephant In The Room. @ CVPR2021