E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Last update: Dec 15, 2022

Overview

End-to-end Music Remastering System

This repository includes source code and pre-trained models of the work End-to-end Music Remastering System Using Self-supervised and Adversarial Training by Junghyun Koo, Seungryeol Paik, and Kyogu Lee.

We provide inference code of the proposed system, which targets to alter the mastering style of a song to desired reference track.

Pre-trained Models

Model	Number of Epochs Trained	Details
Music Effects Encoder	1000	Trained with MTG-Jamendo Dataset
Mastering Cloner	1000	Trained with the above pre-trained Music Effects Encoder and Projection Discriminator

Inference

To run the inference code,

Download pre-trained models above and place them under the folder named 'model_checkpoints' (default)
Prepare input and reference tracks under the folder named 'inference_samples' (default).
Target files should be organized as follow:

    "path_to_data_directory"/"song_name_#1"/input.wav
    "path_to_data_directory"/"song_name_#1"/reference.wav
    ...
    "path_to_data_directory"/"song_name_#n"/input.wav
    "path_to_data_directory"/"song_name_#n"/reference.wav

Run 'inference.py'

python inference.py \
    --ckpt_dir "path_to_checkpoint_directory" \
    --data_dir_test "path_to_directory_containing_inference_samples"

Outputs will be stored under the folder 'inference_samples' (default)

Note: The system accepts WAV files of stereo-channeled, 44.1kHZ, and 16-bit rate. Target files shold be named "input.wav" and "reference.wav".

Configurations of each sub-networks

A detailed configuration of each sub-networks can also be found at

Self_Supervised_Music_Remastering_System/configs.yaml

E2e music remastering system - End-to-end Music Remastering System Using Self-supervised and Adversarial Training

Related tags

Overview

End-to-end Music Remastering System

Pre-trained Models

Inference

Configurations of each sub-networks

Owner

Junghyun (Tony) Koo

Code release for "Self-Tuning for Data-Efficient Deep Learning" (ICML 2021)

Pmapper is a super-resolution and deconvolution toolkit for python 3.6+

Dynamic hair modeling from monocular videos using deep neural networks

Pytorch implementation of our paper accepted by NeurIPS 2021 -- Revisiting Discriminator in GAN Compression: A Generator-discriminator Cooperative Compression Scheme

Learning Super-Features for Image Retrieval

Official PyTorch implementation of Spatial Dependency Networks.

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Automatic Differentiation Multipole Moment Molecular Forcefield

GemNet model in PyTorch, as proposed in "GemNet: Universal Directional Graph Neural Networks for Molecules" (NeurIPS 2021)

This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

Face Identity Disentanglement via Latent Space Mapping [SIGGRAPH ASIA 2020]

Temporal-Relational CrossTransformers

Label-Free Model Evaluation with Semi-Structured Dataset Representations

An self sufficient AI that crawls the web to learn how to generate art from keywords

Code for Towards Streaming Perception (ECCV 2020) :car:

Mae segmentation - Reproduction of semantic segmentation using masked autoencoder (mae)

Joint deep network for feature line detection and description

MultiMix: Sparingly Supervised, Extreme Multitask Learning From Medical Images (ISBI 2021, MELBA 2021)

Generative Models as a Data Source for Multiview Representation Learning