Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Last update: Aug 28, 2022

Related tags

Deep Learning AequeVox

Overview

AequeVox

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

README under development.

Python Packages Required

numpy
scipy
math
librosa
random
time
json
threading
re
nltk

ASR Specific Packages

Google Cloud

speech
Storage

Microsoft Azure

Azure.cognitiveservices.speech

IBM Cloud

ibm_watson
ibm_watson.websocket
Ibm_cloud_sdk_core.authenticators

The code is separated into 2 sections, Generation and Analysis.

Generation:

transGen.py

Lists all transformation types and magnitudes to be used. Can be modified as necessary.
Requires the specification of file names of all the original speech files.

Generates transformed speech files with form {Original File Name}{Transformation Type Abbreviation}{Magnitude of Transformation Parameter, theta}.wav

List of Abbreviations.

A - Amplitude
C - Clipping
D - Drop
F - Frame
HP - Highpass
LP - LP
N - Noise
S - Scale

GCP_Recog.py

Requires Google cloud client libraries and associated keys.

Takes a group name and the list of all original files in the group to generate transcripts.

MS_Recog.py

Requires Microsoft Azure client libraries and associated key and region.

Takes a group name and the list of all original files in the group to generate transcripts.

IBM_Recog.py

Requires IBM client libraries and associated key and service URL..

Takes a group name and the list of all original files in the group to generate transcripts.

compASR.py

Takes the names of two ASR systems and group names to generate a distance metric. Result yields text files with distance metrics for specified groups.

Users are requested to use the distance metrics to calculate the D values for each transformation.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Related tags

Overview

AequeVox

Owner

Sai Sathiesh

A Python package to process & model ChEMBL data.

Official implementation of the network presented in the paper "M4Depth: A motion-based approach for monocular depth estimation on video sequences"

Code and data (Incidents Dataset) for ECCV 2020 Paper "Detecting natural disasters, damage, and incidents in the wild".

The world's largest toxicity dataset.

Cross-modal Retrieval using Transformer Encoder Reasoning Networks (TERN). With use of Metric Learning and FAISS for fast similarity search on GPU

Learning hierarchical attention for weakly-supervised chest X-ray abnormality localization and diagnosis

The official code repository for examples in the O'Reilly book 'Generative Deep Learning'

Official implementation of "SinIR: Efficient General Image Manipulation with Single Image Reconstruction" (ICML 2021)

The mini-MusicNet dataset

On-device wake word detection powered by deep learning.

Few-shot Learning of GPT-3

Visualization toolkit for neural networks in PyTorch! Demo -->

This is an open source python repository for various python tests

Code and data to accompany the camera-ready version of "Cross-Attention is All You Need: Adapting Pretrained Transformers for Machine Translation" in EMNLP 2021

RLHive: a framework designed to facilitate research in reinforcement learning.

Code for "Localization with Sampling-Argmax", NeurIPS 2021

the code of the paper: Recurrent Multi-view Alignment Network for Unsupervised Surface Registration (CVPR 2021)

Aircraft design optimization made fast through modern automatic differentiation

Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets" (ECCV 2020 Spotlight)

Decoding the Protein-ligand Interactions Using Parallel Graph Neural Networks