A collection of python scripts for extracting and analyzing acoustics from audio files.

Last update: Dec 26, 2022

Related tags

Audio pyAcoustics

Overview

pyAcoustics

https://img.shields.io/badge/license-MIT-blue.svg?

A collection of python scripts for extracting and analyzing acoustics from audio files.

Contents

1 Common Use Cases
2 Major revisions
3 Features as they are added
4 Requirements
5 Installation
6 Example usage
7 Citing LMEDS
8 Acknowledgements

2 Major revisions

Ver 1.0 (June 7, 2015)

first public release.

3 Features as they are added

Mask speech with speech shaped noise (March 21, 2016)

Find syllable nuclei/estimate speech rate using Uwe Reichel's matlab code (July 29, 2015)

Find the valley bottom between peaks (July 7th, 2015)

4 Requirements

Many of the individual features require different packages. If you aren't using those packages then you don't need to install the dependencies.

pyacoustics.intensity_and_pitch.praat_pi requires praat

pyacoustics.intensity_and_pitch.get_f0 requires the ESPS getF0 function as implemented by Snack although I recall having difficulty installing it.

pyacoustics.speech_rate/dictionary_estimate.py requires my library psyle

pyacoustics.signals.data_fitting.py requires SciPy, NumPy, and scikit-learn

My praatIO library is used extensively and can be downloaded here

5 Installation

If you on Windows, you can use the installer found here (check that it is up to date though) Windows installer

PyAcoustics is on pypi and can be installed or upgraded from the command-line shell with pip like so:

python -m pip install pyacoustics --upgrade

Otherwise, to manually install, after downloading the source from github, from a command-line shell, navigate to the directory containing setup.py and type:

python setup.py install

If python is not in your path, you'll need to enter the full path e.g.:

C:\Python36\python.exe setup.py install

6 Example usage

See the example folders for a few real-world examples using this library.

examples/split_audio_on_silence.py

Detects the presence of speech in a recording based on acoustic intensity. Everything louder than some threshold specified by the user is considered speech.
examples/split_audio_on_tone.py

Detects the presence of pure tones in a recording. One can use this to automatically segment stimuli. Beeps can be played while the speech is being recorded and then later this tool can automatically segment the speech, based on the presence of those tones.

Also detects speech using a pitch analysis. Most syllables contain some voicing, so a stream of modulating pitch values suggests that someone is speaking. This aspect is not extensively tested but it works well for the example files.
examples/estimate_speech_rate.py

Calculates the speech rate through a matlab script written by Uwe Reichel that estimates the location of syllable boundaries.

7 Citing LMEDS

PyAcoustics is general purpose coding and doesn't need to be cited but if you would like to, it can be cited like so:

Tim Mahrt. PyAcoustics. https://github.com/timmahrt/pyAcoustics, 2016.

PyAcoustics is an ongoing collection of code with contributions from a number of projects worked on over several years. Development of various aspects of PyAcoustics was possible thanks to NSF grant IIS 07-03624 to Jennifer Cole and Mark Hasegawa-Johnson, NSF grant BCS 12-51343 to Jennifer Cole, José Hualde, and Caroline Smith, and NSF grant IBSS SMA 14-16791 to Jennifer Cole, Nancy McElwain, and Daniel Berry.

A collection of python scripts for extracting and analyzing acoustics from audio files.

Related tags

Overview

pyAcoustics

1 Common Use Cases

2 Major revisions

3 Features as they are added

4 Requirements

5 Installation

6 Example usage

7 Citing LMEDS

8 Acknowledgements

Owner

Tim

A Python library and tools AUCTUS A6 based radios.

AudioDVP:Photorealistic Audio-driven Video Portraits

BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

Python library for handling audio datasets.

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

A useful tool to generate chord progressions according to melody MIDIs

Klangbecken: The RaBe Endless Music Player

Automatically move or copy files based on metadata associated with the files. For example, file your photos based on EXIF metadata or use MP3 tags to file your music files.

Guide & Examples to create deeplearning gstreamer plugins and use them in your pipeline

GNOME powered sound conversion

convert-to-opus-cli is a Python CLI program for converting audio files to opus audio format.

Real-Time Spherical Microphone Renderer for binaural reproduction in Python

Tradutor de um arquivo MIDI para ser usado em um simulador RISC-V(RARS)

Terminal-based music player written in Python for the best music in the world 🎵 🎧 💻

Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists with a Plex server.

This is an AI that runs in the terminal. It is a voice assistant that can do common activities and can also help in your coding doubts like

ᴀ ʙᴏᴛ ᴛʜᴀᴛ ᴄᴀɴ ᴘʟᴀʏ ᴍᴜꜱɪᴄ ɪɴ ᴛᴇʟᴇɢʀᴀᴍ ɢʀᴏᴜᴘ ᴏɴ ᴠᴏɪᴄᴇ ᴄᴀʟʟ

A python program to cut longer MP3 files (i.e. recordings of several songs) into the individual tracks.

Analyze, visualize and process sound field data recorded by spherical microphone arrays.

:notes: Cross-platform music player