EmoTag helps you train emotion detection model for Chinese audios

Last update: Sep 07, 2022

Overview

emoTag

emoTag helps you train emotion detection model for Chinese audios.

Environment

pip install -r requirement.txt

Data

We used Emotional Speech Dataset (ESD) for Speech Synthesis and Voice Conversion from HLT Singapore.

Train Emotion Classifier

Use this command to train a classifier. Adjust training setups in conf/logfbank_train-emo.json.

python train.py --config conf/logfbank_train-emo.json --name task_trial_1

Models and logs will be find in exp/.

usage: train.py [-h] [-c CONFIG] [-r RESUME] [-n NAME] [--lr LR] [--bs BS]
                [--train_utt2wav TRAIN_UTT2WAV] [--val_utt2wav VAL_UTT2WAV]
                [--blocks BLOCKS] [--optimizer OPTIMIZER]
                [--train_pad0 TRAIN_PAD0] [--devel_pad0 DEVEL_PAD0]
                [--pretrain PRETRAIN]

PyTorch Template

optional arguments:
  -h, --help            show this help message and exit
  -c CONFIG, --config CONFIG
                        config file path (default: None)
  -r RESUME, --resume RESUME
                        path to latest checkpoint (default: None)
  -n NAME, --name NAME
  --lr LR, --learning_rate LR
  --bs BS, --batch_size BS
  --train_utt2wav TRAIN_UTT2WAV
  --val_utt2wav VAL_UTT2WAV
  --blocks BLOCKS
  --optimizer OPTIMIZER
  --train_pad0 TRAIN_PAD0
  --devel_pad0 DEVEL_PAD0
  --pretrain PRETRAIN

Infer labels

python infer_label.py

Adjust the vad_file param and code if necessary to adapt to new tasks. infer_label.py adopted multiprocessing, increased cpu utilities rate and inference efficiency. See usage details below.

usage: infer_label.py [-h] [--vad_file VAD_FILE] [--model_dir MODEL_DIR]
                      [--output_dir OUTPUT_DIR]

parse model info

optional arguments:
  -h, --help            show this help message and exit
  --vad_file VAD_FILE
  --model_dir MODEL_DIR
  --output_dir OUTPUT_DIR

EmoTag helps you train emotion detection model for Chinese audios

Related tags

Overview

emoTag

Environment

Data

Train Emotion Classifier

Infer labels

Owner

_zza

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering (NAACL 2021)

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

A library to inspect itermediate layers of PyTorch models.

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

9th place solution

SemEval2022 Patronizing and Condescending Language (PCL) Detection

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

This is the repository for Learning to Generate Piano Music With Sustain Pedals

Out-of-Town Recommendation with Travel Intention Modeling (AAAI2021)

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

验证码识别深度学习 tensorflow 神经网络

Hierarchical Time Series Forecasting with a familiar API

EmoTag helps you train emotion detection model for Chinese audios

Related tags

Overview

emoTag

Environment

Data

Train Emotion Classifier

Infer labels

Owner

_zza

URIE: Universal Image Enhancementfor Visual Recognition in the Wild

Pytorch implementation of the paper DocEnTr: An End-to-End Document Image Enhancement Transformer.

Designing a Minimal Retrieve-and-Read System for Open-Domain Question Answering (NAACL 2021)

This is a project based on ConvNets used to identify whether a road is clean or dirty. We have used MobileNet as our base architecture and the weights are based on imagenet.

Python implementation of MULTIseq barcode alignment using fuzzy string matching and GMM barcode assignment

A library to inspect itermediate layers of PyTorch models.

Official implementation for the paper: Permutation Invariant Graph Generation via Score-Based Generative Modeling

Voice Conversion Using Speech-to-Speech Neuro-Style Transfer

9th place solution

SemEval2022 Patronizing and Condescending Language (PCL) Detection

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

This is the repository for Learning to Generate Piano Music With Sustain Pedals

Out-of-Town Recommendation with Travel Intention Modeling (AAAI2021)

Object tracking implemented with YOLOv4, DeepSort, and TensorFlow.

Torch implementation of various types of GAN (e.g. DCGAN, ALI, Context-encoder, DiscoGAN, CycleGAN, EBGAN, LSGAN)

Implementation of Deep Deterministic Policy Gradiet Algorithm in Tensorflow

PyTorch implementation for MINE: Continuous-Depth MPI with Neural Radiance Fields

Code for the paper: Adversarial Training Against Location-Optimized Adversarial Patches. ECCV-W 2020.

验证码识别 深度学习 tensorflow 神经网络

Hierarchical Time Series Forecasting with a familiar API

验证码识别深度学习 tensorflow 神经网络