A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Last update: Jun 04, 2022

Related tags

Deep Learning LASAFT-Net-v2

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Woosung Choi, Yeong-Seok Jeong, Jinsung Kim, Jaehwa Chung, Soonyoung Jung, and Joshua D. Reiss

Demonstration (under construction)

Experimental Results

Musdb 18

model	vocals	drums	bass	other	AVG
Meta-TasNet	6.40	5.91	5.58	4.19	5.52
AMSS-Net	6.78	5.92	5.10	4.51	5.58
LaSAFT-Net-v1	7.33	5.68	5.63	4.87	5.88
LASAFT-Net-v2	7.57	6.13	5.28	4.87	5.96

MDX Challenge (Leaderboard A)

model	model type	vocals	drums	bass	other	AVG
KUILAB-MDX-Net	dedicated (1 source/ 1 model)	8.901	7.173	7.232	5.636	7.236
LaSAFT-Net-v1 (light)	conditioned (4 sources/ 1 model)	7.275	5.935	5.823	4.557	5.897
LASAFT-Net-v2 (light)	conditioned (4 sources/ 1 model)	7.324	5.976	5.884	4.642	5.957

How to reproduce

1. Environment

Ubuntu 20.04
wandb for logging

You must create .env file by copying .env.sample to set environmental variables.

wandb_api_key=[Your Key] # "xxxxxxxxxxxxxxxxxxxxxxxx"
data_dir=[Your Path] # "/home/ielab/repos/musdbHQ"

about wandb_api_key
- we currently only support wandb for logging.
- for wandb_api_key, visit wandb, go to setting, and then copy your api key
about data_dir
- the absolute path where datasets are stored

2. Installation (cuda)

conda env create -f environment.yaml -n lasaftv2
conda activate lasaftv2
pip install -r requirements.txt

A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"

Related tags

Overview

LASAFT-Net-v2

Listen, Attend and Separate by Attentively aggregating Frequency Transformation

Experimental Results

How to reproduce

1. Environment

2. Installation (cuda)

Owner

Woosung Choi

InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images

Official Repository for the ICCV 2021 paper "PixelSynth: Generating a 3D-Consistent Experience from a Single Image"

A treasure chest for visual recognition powered by PaddlePaddle

Boosted neural network for tabular data

Semi-supervised Stance Detection of Tweets Via Distant Network Supervision

FedJAX is a library for developing custom Federated Learning (FL) algorithms in JAX.

Politecnico of Turin Thesis: "Implementation and Evaluation of an Educational Chatbot based on NLP Techniques"

UniFormer - official implementation of UniFormer

[IJCAI'21] Deep Automatic Natural Image Matting

Pytorch implementation of Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors

Train SN-GAN with AdaBelief

Deduplicating Training Data Makes Language Models Better

A PyTorch implementation of "Multi-Scale Contrastive Siamese Networks for Self-Supervised Graph Representation Learning", IJCAI-21

FG-transformer-TTS Fine-grained style control in transformer-based text-to-speech synthesis

Text Extraction Formulation + Feedback Loop for state-of-the-art WSD (EMNLP 2021)

Fast, differentiable sorting and ranking in PyTorch

CellRank's reproducibility repository.

MINOS: Multimodal Indoor Simulator

FSL-Mate: A collection of resources for few-shot learning (FSL).

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.