無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Last update: Jul 05, 2022

Related tags

Text Data & NLP voicevox_engine

Overview

VOICEVOX ENGINE

VOICEVOXの音声合成エンジン。実態は HTTP サーバーなので、リクエストを送信すればテキスト音声合成できます。

API ドキュメント

VOICEVOX ソフトウェアを起動した状態で、ブラウザから http://localhost:50021/docs にアクセスするとドキュメントが表示されます。
VOICEVOX 音声合成エンジンとの連携も参考になるかもしれません。

HTTP リクエストで音声合成するサンプルコード

query.json curl -s \ -H "Content-Type: application/json" \ -X POST \ -d @query.json \ localhost:50021/synthesis?speaker=1 \ > audio.wav ">

text="ABCDEFG"

curl -s \
    -X POST \
    "localhost:50021/audio_query?text=$text&speaker=1"\
    > query.json

curl -s \
    -H "Content-Type: application/json" \
    -X POST \
    -d @query.json \
    localhost:50021/synthesis?speaker=1 \
    > audio.wav

貢献者の方へ

Issue を解決するプルリクエストを作成される際は、別の方と同じ Issue に取り組むことを避けるため、 Issue 側で取り組み始めたことを伝えるか、最初に Draft プルリクエストを作成してください。

環境構築

# 開発に必要なライブラリのインストール
pip install -r requirements-test.txt

# とりあえず実行したいだけなら代わりにこちら
pip install -r requirements.txt

実行

# 製品版 VOICEVOX でサーバーを起動
VOICEVOX_DIR="C:/path/to/voicevox" # 製品版 VOICEVOX ディレクトリのパス
python run.py --voicevox_dir=$VOICEVOX_DIR

# モックでサーバー起動
python run.py

コードフォーマット

コードのフォーマットを整えます。プルリクエストを送る前に実行してください。

pysen run format lint

ビルド

Build Tools for Visual Studio 2019 が必要です。

pip install -r requirements-dev.txt

python -m nuitka \
    --standalone \
    --plugin-enable=numpy \
    --follow-import-to=numpy \
    --follow-import-to=aiofiles \
    --include-package=uvicorn \
    --include-package-data=pyopenjtalk \
    --include-data-file=VERSION.txt=./ \
    --include-data-file=speakers.json=./ \
    --include-data-file=C:/音声ライブラリへのパス/Release/*.dll=./ \
    --include-data-file=C:/音声ライブラリへのパス/*.bin=./ \
    --include-data-dir=.venv/Lib/site-packages/_soundfile_data=./_soundfile_data \
    --msvc=14.2 \
    --follow-imports \
    --no-prefer-source-code \
    run.py

ライセンス

LGPL v3 と、ソースコードの公開が不要な別ライセンスのデュアルライセンスです。別ライセンスを取得したい場合は、ヒホ（twitter: @hiho_karuta）に求めてください。

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXの音声合成エンジン

Related tags

Overview

VOICEVOX ENGINE

API ドキュメント

HTTP リクエストで音声合成するサンプルコード

貢献者の方へ

環境構築

実行

コードフォーマット

ビルド

ライセンス

You might also like...

Releases(check-code-sign-8)

check-code-sign-8(Jul 10, 2022)

Owner

Hiroshiba

Python module (C extension and plain python) implementing Aho-Corasick algorithm

Code for "Generative adversarial networks for reconstructing natural images from brain activity".

DiY Oxygen Concentrator based on the OxiKit

Visual Automata is a Python 3 library built as a wrapper for Caleb Evans' Automata library to add more visualization features.

A repo for materials relating to the tutorial of CS-332 NLP

Learning to Rewrite for Non-Autoregressive Neural Machine Translation

Pytorch version of BERT-whitening

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Code for the Python code smells video on the ArjanCodes channel.

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

Conditional Transformer Language Model for Controllable Generation

A PyTorch implementation of paper "Learning Shared Semantic Space for Speech-to-Text Translation", ACL (Findings) 2021

Code for the paper "Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer"

Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code

Implementation of "Adversarial purification with Score-based generative models", ICML 2021

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

CoNLL-English NER Task (NER in English)

NLP-based analysis of poor Chinese movie reviews on Douban

Use the power of GPT3 to execute any function inside your programs just by giving some doctests