A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Last update: May 25, 2022

Related tags

Text Data & NLP midi_language

Overview

MIDI Language

Introduction

Reference

Paper: Pop Music Transformer: Beat-based Modeling and Generation of Expressive Pop Piano Compositions: code

This is a modified version with an extension of multi-instrumental support.

Function

Convert Midi into event sequence, and represented by mapped integer array.

This could send to NLP models for AI auto music composition.

Due to this project considers more about music structures as well as its chord and melody on higher level, including note, drum, tempo, musical instrument (program in midi) and its expressions (tempo and velocity), rather than digging into too much details like sound source & direction, instrumental performing techniques (such as, bend sound, piano sustain pedal, violin overtones), the language of MIDI is design this way (see chapter Details below).

Usage

See language.py, it contains procedures:

load w2i (word to integer) and i2w (integer to word), for not calculating it every time;
encode midi to iteger array, each object handle one mid file;
decode integer array to midi, each object handle many results and export to mid files;

The code language.py has arguments:

input: input file of audio file to encode/decode;
output: output file of audio file to encode;
train: if have, it will switch to training mode with variations (data augmentation);

MidiEncoder data augmentation:

pitch_variation_range: a random pitch shift within a range for whole midi;
velocity_scale_variation_range: a random note/drum velocity scale for whole midi;
velocity_noise_scale_variation_range: a random note/drum velocity scale for each element within midi;
tempo_scale_variation_range: a random tempo change for whole midi;

MidiDecoder needs numerator and denominator time signatures for reconstructing midi files.

Details

Event Structure

Required:

Bar
Position (0~split-1)

Optional:

note:
- Note
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
drum:
- Drum
- Program (0~127)
- Pitch (0~127)
- Velocity (0~127)
- Duration (0~split*bar_scale-1)
chord:
- Chord (chroma_name:chord_name)
tempo:
- Tempo_Class (T0~Ti)
- Tempo_Value (0~59)

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Related tags

Overview

MIDI Language

Introduction

Reference

Function

Usage

Details

Event Structure

Required:

Optional:

Owner

Robert Bogan Kang

Ray-based parallel data preprocessing for NLP and ML.

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

A library for end-to-end learning of embedding index and retrieval model

A simple version of DeTR

DeepPavlov Tutorials

Task-based datasets, preprocessing, and evaluation for sequence models.

Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

MASS: Masked Sequence to Sequence Pre-training for Language Generation

An open source library for deep learning end-to-end dialog systems and chatbots.

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

Prompt tuning toolkit for GPT-2 and GPT-Neo

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

NSFW A chatbot based on GPT2-chitchat

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Scikit-learn style model finetuning for NLP

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

A design of MIDI language for music generation task, specifically for Natural Language Processing (NLP) models.

Related tags

Overview

MIDI Language

Introduction

Reference

Function

Usage

Details

Event Structure

Required:

Optional:

Owner

Robert Bogan Kang

Ray-based parallel data preprocessing for NLP and ML.

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

A library for end-to-end learning of embedding index and retrieval model

A simple version of DeTR

DeepPavlov Tutorials

Task-based datasets, preprocessing, and evaluation for sequence models.

Unofficial Python library for using the Polish Wordnet (plWordNet / Słowosieć)

Final Project for the Intel AI Readiness Boot Camp NLP (Jan)

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

This project aims to conduct a text information retrieval and text mining on medical research publication regarding Covid19 - treatments and vaccinations.

MASS: Masked Sequence to Sequence Pre-training for Language Generation

An open source library for deep learning end-to-end dialog systems and chatbots.

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

Prompt tuning toolkit for GPT-2 and GPT-Neo

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

**NSFW** A chatbot based on GPT2-chitchat

XLNet: Generalized Autoregressive Pretraining for Language Understanding

Scikit-learn style model finetuning for NLP

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Contains analysis of trends from Fitbit Dataset (source: Kaggle) to see how the trends can be applied to Bellabeat customers and Bellabeat products

NSFW A chatbot based on GPT2-chitchat