wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

wenet-kws

Typical Scenario

Dataset

Runtime

Owner

Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

Tools to download and cleanup Common Crawl data

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Python wrapper for Stanford CoreNLP tools v3.4.1

Non-Autoregressive Predictive Coding

Code and checkpoints for training the transformer-based Table QA models introduced in the paper TAPAS: Weakly Supervised Table Parsing via Pre-training.

Syntax-aware Multi-spans Generation for Reading Comprehension (TASLP 2022)

Code Generation using a large neural network called GPT-J

Dense Passage Retriever - is a set of tools and models for open domain Q&A task.

Wind Speed Prediction using LSTMs in PyTorch

Library for fast text representation and classification.

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

An open collection of annotated voices in Japanese language

华为商城抢购手机的Python脚本 Python script of Huawei Store snapping up mobile phones

A program that uses real statistics to choose the best times to bet on BloxFlip's crash gamemode

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Pretrained Japanese BERT models

English loanwords in the world's languages

Image2pcl - Enter the metaverse with 2D image to 3D projections