ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Last update: Dec 02, 2022

Related tags

Text Data & NLP ConferencingSpeech2022

Overview

ConferencingSpeech 2022 challenge

This repository contains the datasets list and scripts required for the ConferencingSpeech 2022 challenge. For more details about the challenge, please see our website.

Details

baseline, this folder contains baseline system include inference model exported by inference scripts;
eval, this folder contains evaluation scripts to calculate PLCC, RMSE and SRCC;
data-sets, this folder contains training and development test data-sets provied to the participant;
- Tencent Corpus, this dataset includes about 14,000 speech chinese speech clips with simulated (e.g. codecs, packet-loss, background noise) and live conditions.
- NISQA Corpus, the NISQA Corpus includes more than 14,000 speech samples with simulated (e.g. codecs, packet-loss, background noise) and live (e.g. mobile phone, Zoom, Skype, WhatsApp) conditions.
- IU Bloomington Corpus, there are 10,000 speech signals extracted from COSINE and VOiCESdatasets, each truncated between 3 to 6 seconds long.
- PSTN Corpus, there are about 80,000 speech clips through classic public switched telephone networks, each truncated 10 seconds long.

Requirements

To install requirements install Anaconda and then use:

conda env create -f envs.yml

This will create a new environment with the name "conferencingSpeech". Activate this environment to go on:

conda activate conferencingSpeech

Code license

Apache 2.0

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

Related tags

Overview

ConferencingSpeech 2022 challenge

Details

Requirements

Code license

Owner

Transformation spoken text to written text

MMDA - multimodal document analysis

Trex is a tool to match semantically similar functions based on transfer learning.

Code for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021

A Facebook Messenger Chatbot using NLP

中文空间语义理解评测

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Generating new names based on trends in data using GPT2 (Transformer network)

Train and use generative text models in a few lines of code.

Pytorch-version BERT-flow: One can apply BERT-flow to any PLM within Pytorch framework.

End-to-End Speech Processing Toolkit

Beyond Accuracy: Behavioral Testing of NLP models with CheckList

NLP codes implemented with Pytorch (w/o library such as huggingface)

A fast hierarchical dimensionality reduction algorithm.

Intent parsing and slot filling in PyTorch with seq2seq + attention

translate using your voice

Flaxformer: transformer architectures in JAX/Flax

Neural network sequence labeling model

Deep Learning Topics with Computer Vision & NLP

SASE : Self-Adaptive noise distribution network for Speech Enhancement with heterogeneous data of Cross-Silo Federated learning