User-friendly Voice Cloning Application

Last update: Dec 30, 2022

Overview

Multi-Language-RTVC stands for Multi-Language Real Time Voice Cloning and is a Voice Cloning Tool capable of transfering speaker-specific audio features to synthesize speeches in that voice based on just a few seconds of unknown audio data.

License

This code is licensed under MIT. For more information regarding the license model or associated duties and rights, click here.

Project History

This project was started in 2021 with the goal of inheriting Corentin Jemine's Real-Time-Voice-Cloning. The project originated from the wish of multi-language support for voice cloning models and is now maintained and enhanced by contributing volunteers.

Contributing

We welcome all those interested in the project, from beginners to experts. The MLRTVC community standard is a nice, open-minded and efficient working climate. We encourage all those with ideas to take part in the project by sharing their thoughts.
There are multiple meaningful ways of contributing:

Developing code (new features, fixes, enhancements)
Writing documentation
Raising issues (bugs, feature requests, enhancement proposals, code refacturing, etc.)
Providing pre-trained models
Participating in community tasks (code reviews, discussions, maintenance, etc.)

For transparacy reasons, we ask you to engage with this project via the official ways (issues, pull requests) to share knowledge and questions publicly. Only in cases where privacy or confidentiality is of great importance, other communication channels are accepted (email, chat, etc.).

Further information can be gained in the Contributing Guidelines.

User-friendly Voice Cloning Application

Related tags

Overview

License

Project History

Contributing

Owner

Sven Eschlbeck

A Python wrapper for the high-quality vocoder "World"

SinGlow: Generative Flow for SVS tasks in Tensorflow 2

Algorithmic Multi-Instrumental MIDI Continuation Implementation

Deep learning transformer model that generates unique music sequences.

Pianote - An application that helps musicians practice piano ear training

LibXtract is a simple, portable, lightweight library of audio feature extraction functions.

A collection of python scripts for extracting and analyzing acoustics from audio files.

?️ Open Source Audio Matching and Mastering

BART aids transcribe tasks by taking a source audio file and creating automatic repeated loops, allowing transcribers to listen to fragments multiple times

A python package for calculating the PESQ.

DCL - An easy to use diacritic library used for diacritic and accent manipulation.

Bot Music Pintar. Created by Rio

A voice assistant which can handle your everyday task and allows you to book items from your favourite store!

PianoPlayer - Automatic fingering generator for piano scores

Anaphones are like anagrams, but for sounds.

spafe: Simplified Python Audio-Features Extraction

A python script that can play .mp3 URLs upon the ringing or motion detection of a Ring doorbell. The sound plays through Sonos speakers.

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

Library for working with sound files of the format: .ogg, .mp3, .wav

Tune in is a Collaborative Music Playing Systems where multiple guests can join a room and enjoy the song being played