Music source separation is a task to separate audio recordings into individual sources

Last update: Jan 03, 2023

Related tags

Overview

Music Source Separation

Music source separation is a task to separate audio recordings into individual sources. This repository is an PyTorch implmementation of music source separation. Users can separate their favorite songs into different sources by installing this repository. In addition, users can train their own music source separation systems using this repository. This repository also includes speech enhancement, instruments separation, etc.

Demos

Vocals and accompaniment separation: https://www.youtube.com/watch?v=WH4m5HYzHsg

Separation

Users can easily separate their favorite audio recordings into vocals and accompaniment using the pretrained checkpoints.

Method 1. Separate by installing the package

python3 setup.py install

python3 separate_scripts/separate.py 
    --audio_path="./resources/vocals_accompaniment_10s.mp3" 
    --source_type="accompaniment"

Method 2. Separate by using the source code

1. Install dependencies

pip install -r requirements.txt

2. Download checkpoints

./separate_scripts/download_checkpoints.sh

3. Separate vocals and accompaniment

./separate_scripts/separate_vocals.sh "resources/vocals_accompaniment_10s.mp3" "sep_vocals.mp3"
./separate_scripts/separate_accompaniment.sh "resources/vocals_accompaniment_10s.mp3" "sep_accompaniment.mp3"

Train a music source separation system from scratch

1. Download dataset

We use the MUSDB18 dataset to train music source separation systems. The trained system can be used to separate vocals, accompaniments, bass, and other sources. Execute the following script to download and decompress the MUSDB18 dataset:

./scripts/0_download_datasets/musdb18.sh

The dataset looks like:

./datasets/musdb18
├── train (100 files)
│   ├── 'A Classic Education - NightOwl.stem.mp4'
│   └── ...
├── test (50 files)
│   ├── 'Al James - Schoolboy Facination.stem.mp4'
│   └── ...
└── README.md

2. Pack audio files into hdf5 files

We pack audio waveforms into hdf5 files to speed up training.

."/scripts/1_pack_audios_to_hdf5s/musdb18/sr=44100,chn=2.sh"

3. Create indexes for training

./scripts/2_create_indexes/musdb18/create_indexes.sh

3. Create evaluation audios

./scripts/3_create_evaluation_audios/musdb18/create_evaluation_audios.sh

4. Train & evaluate & save checkpoints

./scripts/4_train/musdb18/train.sh

5. Inference

./scripts/5_inference/musdb18/inference.sh

##

Results

Model	Size (MB)	SDR (dB)	process 1s time (GPU Tesla V100)	process 1s time (CPU Core i7)
ResUNet143 vocals	461	8.9	0.036	2.513
ResUNet143 acc.	461	16.8	0.036	2.513
ResUNet143 Subband vocals	414	8.8	0.012	0.614
ResUNet143 Subband acc.	414	16.4	0.012	0.614

Reference

[1] Qiuqiang Kong, Yin Cao, Haohe Liu, Keunwoo Choi, Yuxuan Wang, Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation, International Society for Music Information Retrieval (ISMIR), 2021.

@inproceedings{kong2021decoupling,
  title={Decoupling Magnitude and Phase Estimation with Deep ResUNet for Music Source Separation.},
  author={Kong, Qiuqiang and Cao, Yin and Liu, Haohe and Choi, Keunwoo and Wang, Yuxuan },
  booktitle={ISMIR},
  year={2021},
  organization={Citeseer}
}

FAQ

On Mac OSX, if users met "ModuleNotFoundError: No module named ..." error, then execute the following commands:

PYTHONPATH="./"
export PYTHONPATH

Music source separation is a task to separate audio recordings into individual sources

Related tags

Overview

Music Source Separation

Demos

Separation

Method 1. Separate by installing the package

Method 2. Separate by using the source code

1. Install dependencies

2. Download checkpoints

3. Separate vocals and accompaniment

Train a music source separation system from scratch

1. Download dataset

2. Pack audio files into hdf5 files

3. Create indexes for training

3. Create evaluation audios

4. Train & evaluate & save checkpoints

5. Inference

Results

Reference

FAQ

Owner

Bytedance Inc.

Employee-Managment - Company employee registration software in the face recognition system

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

Pytorch implementation of the unsupervised object discovery method LOST.

Eff video representation - Efficient video representation through neural fields

PyTorch implementation of Asymmetric Siamese (https://arxiv.org/abs/2204.00613)

Drone detection using YOLOv5

AI assistant built in python.the features are it can display time,say weather,open-google,youtube,instagram.

discovering subdomains, hidden paths, extracting unique links

Trash Sorter Extraordinaire is a software which efficiently detects the different types of waste in a pile of random trash through feeding it pictures or videos.

The code written during my Bachelor Thesis "Classification of Human Whole-Body Motion using Hidden Markov Models".

某学校选课系统GIF验证码数据集 + Baseline模型 + 上下游相关工具

Implementation of H-UCRL Algorithm

Differentiable molecular simulation of proteins with a coarse-grained potential

Official PyTorch Implementation of paper "Deep 3D Mask Volume for View Synthesis of Dynamic Scenes", ICCV 2021.

This repository includes the code of the sequence-to-sequence model for discontinuous constituent parsing described in paper Discontinuous Grammar as a Foreign Language.

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

PyTorch implementation of the supervised learning experiments from the paper Model-Agnostic Meta-Learning (MAML)

Package for extracting emotions from social media text. Tailored for financial data.

an Evolutionary Algorithm assisted GAN

A hybrid framework (neural mass model + ML) for SC-to-FC prediction