Terminal-based audio-to-text converter

Overview

att

Terminal-based audio-to-text converter


GitHub language count GitHub top language GitHub search hit counter GitHub code size in bytes GitHub repo size GitHub last commit


Project description

A terminal-based audio-to-text converter written in python, enabling you to convert .wav files or microphone input into text and save it to a file.

Requirements

To run the main python modules att_wav.py and mtt.py, you need to install the following packages:

  • speech_recognition
  • pydub
  • time
  • pyaudio

The installation method depends on the environment/ package manager you are using. The following examples show the installation of pydub for a standard python environment with pip and for an Anaconda environment via conda.

pip install pydub
conda install -c conda-forge pydub

License

This code is licensed under GPL-3.0 License.

Usage

To convert an audio file to text, start a terminal session, navigate to the location of the required module (e.g. att_wav.py) and start a python shell running the code by typing python att_wav.py.

Note that the att_wav.py can only handle .wav files due to the implementation of the underlying speech recognition API.

Hardware & Software Requirements

These programs can be run without much computing power. They can be executed on any modern device fullfilling minimal RAM/ CPU standards.

Owner
Sven Eschlbeck
"The more I C, the less I see."
Sven Eschlbeck
Okaeri-Music is a telegram music bot project, allow you to play music on voice chat group telegram.

πŸ—„οΈ PROJECT MUSIC,THIS IS MAINTAINED Okaeri-Music is a telegram bot project that's allow you to play music on telegram voice chat group Features πŸ”₯ Th

Okaeri-Project 2 Dec 23, 2021
α΄€ ʙᴏᴛ α΄›Κœα΄€α΄› α΄„α΄€Ι΄ α΄˜ΚŸα΄€Κ ᴍᴜꜱΙͺα΄„ ΙͺΙ΄ α΄›α΄‡ΚŸα΄‡Ι’Κ€α΄€α΄ Ι’Κ€α΄α΄œα΄˜ ᴏɴ ᴠᴏΙͺᴄᴇ α΄„α΄€ΚŸΚŸ

GJ516 LOVER'S Δ±Δ±llΔ±llΔ± β™₯️ βž€βƒGᴊ516_ᴍᴜꜱΙͺα΄„_ʙᴏᴛ β™₯️ Δ±llΔ±llΔ± α΄€ ʙᴏᴛ α΄›Κœα΄€α΄› α΄„α΄€Ι΄ α΄˜ΚŸα΄€Κ ᴍᴜꜱΙͺα΄„ ΙͺΙ΄ α΄›α΄‡ΚŸα΄‡Ι’Κ€α΄€α΄ Ι’Κ€α΄α΄œα΄˜ ᴏɴ ᴠᴏΙͺᴄᴇ α΄„α΄€ΚŸΚŸ Requirements πŸ“ FFmpeg NodeJS nodesou

1 Nov 22, 2021
Algorithmic and AI MIDI Drums Generator Implementation

Algorithmic and AI MIDI Drums Generator Implementation

Tegridy Code 8 Dec 30, 2022
𝙰 π™Όπšžπšœπš’πšŒ π™±πš˜πš π™²πš›πšŽπšŠπšπšŽπš π™±πš’ πšƒπšŽπšŠπš–π™³πš•πš πŸ’–

TeamDltmusic 𝙰 π™Όπšžπšœπš’πšŒ π™±πš˜πš π™²πš›πšŽπšŠπšπšŽπš π™±πš’ πšƒπšŽπšŠπš–π™³πš•πš πŸ’– Deploy String Session String Click hear you can find string session OR join He

TeamDlt 5 Jan 18, 2022
Spotify Song Recommendation Program

Spotify-Song-Recommendation-Program Made by Esra Nur Γ–zΓΌm Written in Python The aim of this project was to build a recommendation system that recommen

esra nur ΓΆzΓΌm 1 Jun 30, 2022
Dataset and baseline code for the VocalSound dataset (ICASSP2022).

VocalSound: A Dataset for Improving Human Vocal Sounds Recognition Introduction Citing Download VocalSound Dataset Details Baseline Experiment Contact

Yuan Gong 58 Jan 03, 2023
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

⚠️ Checkout develop branch to see what is coming in pyannote.audio 2.0: a much smaller and cleaner codebase Python-first API (the good old pyannote-au

pyannote 2.1k Dec 31, 2022
Audio Retrieval with Natural Language Queries: A Benchmark Study

Audio Retrieval with Natural Language Queries: A Benchmark Study Paper | Project page | Text-to-audio search demo This repository is the implementatio

21 Oct 31, 2022
Full LAKH MIDI dataset converted to MuseNet MIDI output format (9 instruments + drums)

LAKH MuseNet MIDI Dataset Full LAKH MIDI dataset converted to MuseNet MIDI output format (9 instruments + drums) Bonus: Choir on Channel 10 Please CC

Alex 6 Nov 20, 2022
The venturimeter works on the principle of Bernoulli's equation, i.e., the pressure decreases as the velocity increases.

The venturimeter works on the principle of Bernoulli's equation, i.e., the pressure decreases as the velocity increases. The cross-section of the throat is less than the cross-section of the inlet pi

Shankar Mahadevan L 1 Dec 03, 2021
commonfate πŸ“¦commonfate πŸ“¦ - Common Fate Model and Transform.

Common Fate Transform and Model for Python This package is a python implementation of the Common Fate Transform and Model to be used for audio source

Fabian-Robert StΓΆter 18 Jan 08, 2022
Python game programming in Jupyter notebooks.

Jupylet Jupylet is a Python library for programming 2D and 3D games, graphics, music and sound synthesizers, interactively in a Jupyter notebook. It i

Nir Aides 178 Dec 09, 2022
A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

I'm writing a python script to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks called ReCut. So far there are two

DΓΆnerspiess 1 Oct 27, 2021
Basically Play Pauses the song when it is safe to do so. when you die in a round

Basically Play Pauses the song when it is safe to do so. when you die in a round

AG_1436 1 Feb 13, 2022
Small Python application that links a Digico console and Reaper, handling automatic marker insertion and tracking.

Digico-Reaper-Link This is a small GUI based helper application designed to help with using Digico's Copy Audio function with a Reaper DAW used for re

Justin Stasiw 10 Oct 24, 2022
Noinoi music is smoothly playing music on voice chat of telegram.

NOINOI MUSIC BOT ✨ Features Music & Video stream support MultiChat support Playlist & Queue support Skip, Pause, Resume, Stop feature Music & Video do

2 Feb 13, 2022
An Amazon Music client for Linux (unpretentious)

Amusiz An Amazon Music client for Linux (unpretentious) ↗️ Install You can install Amusiz in multiple ways, choose your favorite. πŸš€ AppImage Here you

Mirko Brombin 25 Nov 08, 2022
Praat in Python, the Pythonic way

Parselmouth - Praat in Python, the Pythonic way Parselmouth is a Python library for the Praat software. Though other attempts have been made at portin

Yannick Jadoul 786 Jan 09, 2023
Speech recognition module for Python, supporting several engines and APIs, online and offline.

SpeechRecognition Library for performing speech recognition, with support for several engines and APIs, online and offline. Speech recognition engine/

Anthony Zhang 6.7k Jan 08, 2023
nicfit 425 Jan 01, 2023