Python SDK for working with Voicegain Speech-to-Text

Last update: Dec 14, 2022

Overview

Voicegain Speech-to-Text Python SDK

Python SDK for the Voicegain Speech-to-Text API.

This API allows for large vocabulary speech-to-text transcription as well as grammar-based speech recognition. Both real-time and offline use cases are supported.

You can see the core Voicegain API documentation here.

The complete documentation for the API covered by this SDK is available here - this link requires an account on the Voicegain portal - see below for how to sign up.

Requirements

In order to use this API you need account with Voicegain. You can create an account by signing up on Voicegain Portal. No credit card required to sign up.

You can see pricing here - basically, it is 1 cent a minute for off-line and 1.25 cents a minute for real-time. There is a Free Tier of 600 minutes that renews each month.

Installation

From PyPI directly:

pip install voicegain-speech

Examples

sync_transcribe example:

configuration:

" configuration = Configuration() configuration.access_token = JWT api_client = ApiClient(configuration=configuration) ">

from voicegain_speech import ApiClient
from voicegain_speech import Configuration
from voicegain_speech import TranscribeApi
import base64


# configure your JWT token
JWT = "Your 
   
    "
   

configuration = Configuration()
configuration.access_token = JWT

api_client = ApiClient(configuration=configuration)

transcribe local file:

transcribe_api = TranscribeApi(api_client)
file_path = "Your local file path"

with open(file_path, "rb") as f:
    audio_base64 = base64.b64encode(f.read()).decode()

response = transcribe_api.asr_transcribe_post(
    sync_transcription_request={
        "audio": {
            "source": {
                "inline": {
                    "data": audio_base64
                }
            }
        }
    }
)

alternatives = response.result.alternatives
if alternatives:
    local_result = alternatives[0].utterance
    print("result from file: ", local_result)

else:
    local_result = None
    print("no transcription")

More examples can be found in examples folder on our GitHub

Learn more about Voicegain Platform at www.voicegain.ai

In this repository, I have developed an end to end Automatic speech recognition project. I have developed the neural network model for automatic speech recognition with PyTorch and used MLflow to manage the ML lifecycle, including experimentation, reproducibility, deployment, and a central model registry.

End to End Automatic Speech Recognition In this repository, I have developed an end to end Automatic speech recognition project. I have developed the

22 Nov 13, 2022

Speech Recognition for Uyghur using Speech transformer

Speech Recognition for Uyghur using Speech transformer Training: this model using CTC loss and Cross Entropy loss for training. Download pretrained mo

11 Nov 17, 2022

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

Text-Summarization-using-NLP Text Summarization using NLP to fetch BBC News Arti

21 Aug 6, 2022

easySpeech is an open-source Python wrapper for google speech to text API that doesn't require PyAudio(So you especially windows user don't have to deal with the errors while installing PyAudio) and also works with hugging face transformers

easySpeech easySpeech is an open source python wrapper for google speech to text api that doesn't require PyAaudio(So you specially windows user don't

14 May 24, 2022

Text to speech converter with GUI made in Python.

Python SDK for working with Voicegain Speech-to-Text

Related tags

Overview

Voicegain Speech-to-Text Python SDK

Requirements

Installation

Examples

You might also like...

Speech Recognition for Uyghur using Speech transformer

Text-Summarization-using-NLP - Text Summarization using NLP to fetch BBC News Article and summarize its text and also it includes custom article Summarization

easySpeech is an open-source Python wrapper for google speech to text API that doesn't require PyAudio(So you especially windows user don't have to deal with the errors while installing PyAudio) and also works with hugging face transformers

Text to speech converter with GUI made in Python.

A relatively simple python program to generate one of those reddit text to speech videos dominating youtube.

This is a really simple text-to-speech app made with python and tkinter.

ttslearn: Library for Pythonで学ぶ音声合成 (Text-to-speech with Python)

A Python wrapper for simple offline real-time dictation (speech-to-text) and speaker-recognition using Vosk.

Releases(1.73.0)

1.73.0(Jan 6, 2023)

1.72.0(Dec 15, 2022)

1.71.1(Dec 9, 2022)

1.71.0(Dec 8, 2022)

1.70.2(Nov 23, 2022)

1.70.1(Nov 22, 2022)

1.70.0(Nov 22, 2022)

1.69.0(Nov 17, 2022)

1.68.1(Nov 11, 2022)

1.68.0(Oct 28, 2022)

1.67.0(Oct 25, 2022)

1.66.1(Oct 21, 2022)

1.66.0(Oct 18, 2022)

1.65.0(Sep 27, 2022)

1.64.1(Sep 19, 2022)

1.64.0(Sep 15, 2022)

1.63.0(Sep 7, 2022)

1.62.1(Aug 30, 2022)

1.62.0(Aug 26, 2022)

1.61.0(Aug 18, 2022)

1.60.4(Aug 11, 2022)

1.60.3(Jul 6, 2022)

1.60.2(Jun 30, 2022)

1.60.1(Jun 22, 2022)

1.60.0(Jun 17, 2022)

1.59.2(Jun 15, 2022)

1.59.1(Jun 9, 2022)

1.59.0(Jun 1, 2022)

1.58.1(May 24, 2022)

1.58.0(May 24, 2022)

Owner

Voicegain

:id: A python library for accurate and scalable fuzzy matching, record deduplication and entity-resolution.

PyTorch implementation of "data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language" from Meta AI

Python library to make development of portfolio analysis faster and easier

ASCEND Chinese-English code-switching dataset

CJK computer science terms comparison / 中日韓電腦科學術語對照 / 日中韓のコンピュータ科学の用語対照 / 한·중·일 전산학 용어 대조

translate using your voice

A Domain Specific Language (DSL) for building language patterns. These can be later compiled into spaCy patterns, pure regex, or any other format

Associated Repository for "Translation between Molecules and Natural Language"

Active learning for text classification in Python

Train 🤗transformers with DeepSpeed: ZeRO-2, ZeRO-3

A natural language modeling framework based on PyTorch

Free and Open Source Machine Translation API. 100% self-hosted, offline capable and easy to setup.

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

A fast and lightweight python-based CTC beam search decoder for speech recognition.

Flaxformer: transformer architectures in JAX/Flax

Wake: Context-Sensitive Automatic Keyword Extraction Using Word2vec

NLP codes implemented with Pytorch (w/o library such as huggingface)

Constituency Tree Labeling Tool

Simple bots or Simbots is a library designed to create simple bots using the power of python. This library utilises Intent, Entity, Relation and Context model to create bots .