Pytorch implementation of the popular Improv RNN model originally proposed by the Magenta team.

Last update: Nov 11, 2022

Overview

Pytorch Implementation of Improv RNN

Overview

This code is a pytorch implementation of the popular Improv RNN model originally implemented by the Magenta team. The model is able to generate melodies conditioned on a given chord progression.
The specific model implemented in this repository is the Chord Pitches Improv model which encodes chords as the concatenation of the following length-12 vectors:

a one-hot encoding of the chord root pitch class, e.g. [0, 0, 1, 0, 0, 0, 0, 0, 0, 0, 0, 0] for a D major (or minor, etc.) chord
a binary vector indicating presence or absence of each pitch class, e.g. [1, 0, 0, 1, 1, 0, 0, 1, 0, 0, 1, 0] for a C7#9 chord
a one-hot encoding of the chord bass pitch class, which is usually the same as the chord root pitch class except in the case of "slash chords" like C/E

You can either use a pre-trained checkpoint of the model or train your own using the steps below.

Installation

Install Required Libraries

pip install -r requirements.txt

Generate a Melody Given Backing Chords

A pretrained checkpoint of the model can be found in the "checkpoints" folder. The checkpoint has been trained for 1000 epochs on the OpenEWLD dataset.

python 003_generate_melody.py --backing_chords "C G Am F C G F C" --output out.mid

This will generate a melody starting with a middle C over the chord progression C G Am F C G F C, where each chord lasts one bar. You can modify the backing chords as you like using the backing_chords parameter. You can define where the generated midi file should be saved with the output parameter.

An example of the generated RNN features is visualized here:

Train Your Own Model

Download OpenEWLD Dataset

To train the model, the OpenEWLD dataset is used. OpenEWLD is a subset of the Wikifonia Leadsheet Dataset reduced to only copyright free songs. A lead sheet is a musical score that contains a notation of the melody and the underlying chord progression of a song.
The song examples are in the compressed musicxml (*.MXL) format which can be parsed in to sequences of note events using the note-seq library.

Dataset Preparation

Extract features from musicxml files and store them in a h5 file.

python 001_create_dataset.py --input C:/Datasets/OpenEWLD/dataset

Training

Track metrics using Tensorboard

python 002_train.py --num_epochs 1000

Track metrics using Tensorboard

tensorboard --logdir ./logs/

The curves of the loss and accuracy over the training epochs are shown in tensorboard:

Pytorch implementation of the popular Improv RNN model originally proposed by the Magenta team.

Related tags

Overview

Pytorch Implementation of Improv RNN

Overview

Installation

Install Required Libraries

Generate a Melody Given Backing Chords

Train Your Own Model

Download OpenEWLD Dataset

Dataset Preparation

Training

Owner

Sebastian Murgul

Implementation of FSGNN

This library is a location of the LegacyLogger for PyTorch Lightning.

Motion planning environment for Sampling-based Planners

"Graph Neural Controlled Differential Equations for Traffic Forecasting", AAAI 2022

Real-time Neural Representation Fusion for Robust Volumetric Mapping

PiRapGenerator - Make anyone rap the digits of pi

Consistency Regularization for Adversarial Robustness

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context Code in both PyTorch and TensorFlow

The coda and data for "Measuring Fine-Grained Domain Relevance of Terms: A Hierarchical Core-Fringe Approach" (ACL '21)

Human POSEitioning System (HPS): 3D Human Pose Estimation and Self-localization in Large Scenes from Body-Mounted Sensors, CVPR 2021

Specification language for generating Generalized Linear Models (with or without mixed effects) from conceptual models

HugsVision is a easy to use huggingface wrapper for state-of-the-art computer vision

Implementations of LSTM: A Search Space Odyssey variants and their training results on the PTB dataset.

KwaiRec: A Fully-observed Dataset for Recommender Systems (Density: Almost 100%)

Pytorch implementation of "Geometrically Adaptive Dictionary Attack on Face Recognition" (WACV 2022)

CenterFace(size of 7.3MB) is a practical anchor-free face detection and alignment method for edge devices.

Implementation of OpenAI paper with Simple Noise Scale on Fastai V2

Propagate Yourself: Exploring Pixel-Level Consistency for Unsupervised Visual Representation Learning, CVPR 2021

meProp: Sparsified Back Propagation for Accelerated Deep Learning (ICML 2017)

A PyTorch implementation of "CoAtNet: Marrying Convolution and Attention for All Data Sizes".