Zalo AI challenge 2021 task hum to song

Last update: Dec 16, 2022

Related tags

Deep Learning hum2song

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Sửa các file đường dẫn trong config/preprocess.yaml
- raw_path: đường dẫn đến data thô
- preprocessed_path: đường dẫn đầu ra của quá trình rút trích mel
- temp_dir: đường dẫn chứa dữ liệu mp3 được chuẩn hóa
- Chạy lần lượt các lệnh sau:

        python preprocessing.py

        python utils/split_train_val_by_id.py
   
        python utils/augment_mp3.py
   
        python utils/preprocess_augment.py

Train model:

Sửa các file đường dẫn trong config/config.py
- meta_train: đường dẫn đến file train_meta.csv trong preprocessed_path
- train_root: đường dẫn đến dữ liệu mel đã tiền xử lý
- train_list = 'full_data_train.txt'
- val_list = 'full_data_val.txt'
Chạy lần lượt các lệnh sau:

        python convert_data.py

        python train.py

Infer public test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/public_test (bên trong chứa 2 thư mục full_song và hum)
Chạy lần lượt các lệnh sau:

./predict.sh

Infer private test:

Đặt dữ liệu mp3 thô ở địa chỉ /data/private_test (bên trong chứa 2 thư mục full_song và hum)

Chạy lần lượt các lệnh sau:

./predict_private_test.sh

Team:

Võ Văn Phúc

Nguyễn Văn Thiều

Lâm Bá Thịnh

Zalo AI challenge 2021 task hum to song

Related tags

Overview

Zalo AI challenge 2021 task Hum to Song

pipeline:

Chuẩn bị dữ liệu cho quá trình train:

Train model:

Infer public test:

Infer private test:

Team:

Owner

Vo Van Phuc

Repository for the AugmentedPCA Python package.

A demonstration of using a live Tensorflow session to create an interactive face-GAN explorer.

Set of methods to ensemble boxes from different object detection models, including implementation of "Weighted boxes fusion (WBF)" method.

DCT-Mask: Discrete Cosine Transform Mask Representation for Instance Segmentation

Self-Supervised Contrastive Learning of Music Spectrograms

DeepVoxels is an object-specific, persistent 3D feature embedding.

Monk is a low code Deep Learning tool and a unified wrapper for Computer Vision.

A colab notebook for training Stylegan2-ada on colab, transfer learning onto your own dataset.

The code for "Deep Level Set for Box-supervised Instance Segmentation in Aerial Images".

Deep Learning tutorials in jupyter notebooks.

This repository contains the code for TACL2021 paper: SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Real-time analysis of intracranial neurophysiology recordings.

Research code of ICCV 2021 paper "Mesh Graphormer"

Open source Python module for computer vision

UA-GEC: Grammatical Error Correction and Fluency Corpus for the Ukrainian Language

The official implementation of A Unified Game-Theoretic Interpretation of Adversarial Robustness.

Code for our paper "SimCLS: A Simple Framework for Contrastive Learning of Abstractive Summarization", ACL 2021

A graph-to-sequence model for one-step retrosynthesis and reaction outcome prediction.

TensorFlow GNN is a library to build Graph Neural Networks on the TensorFlow platform.

Hough Transform and Hough Line Transform Using OpenCV