無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Last update: Aug 29, 2022

Related tags

Audio voicevox_core

Overview

VOICEVOX CORE

VOICEVOX の音声合成コア。

Releases にビルド済みのコアライブラリ（.so/.dll）があります。

依存関係

CUDA 11.1 と CUDNN のインストールと LibTorch のダウンロードが必要です。

API

core.h をご参照ください。

サンプルの実行

まず Releases からコアライブラリが入った zip をダウンロードしておきます。

Python 3

ソースコードから実行

cd example/python

# example/python のディレクトリにコアライブラリが入った zip ファイルを展開

# Windowsの場合、DLLからLIBファイルの作成
./makelib.bat core

# 環境構築
pip install -r requirements.txt
python setup.py install  # Linuxの場合は先頭に `LIBRARY_PATH="$LIBRARY_PATH:."` が必要

# # うまく行かないときは毎回以下を実行すると良いかも
# python setup.py clean
# rm -r build *.cpp

# 実行（Windowsの場合）
PATH="$PATH:$HOME/libtorch/lib/" python run.py \
    --text "これは本当に実行できているんですか" \
    --speaker_id 1

# 実行（Windows以外の場合）
LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/libtorch/lib/" python run.py \
    --text "これは本当に実行できているんですか" \
    --speaker_id 1

# 引数の紹介
# --text 読み上げるテキスト
# --speaker_id 話者ID
# --use_gpu GPUを使う
# --f0_speaker_id 音高の話者ID（デフォルト値はspeaker_id）
# --f0_correct 音高の補正値（デフォルト値は0。+-0.3くらいで結果が大きく変わります）

「ImportError: DLL load failed: 指定されたモジュールが見つかりません。」というエラーが出た場合は libtorch のパスが間違っているかもしれません。

Docker から

# イメージのビルド
docker build -t voicevox_core example/python

# コンテナの起動(音声を保存しておくボリュームを作成)
docker run -it -v ~/voicevox:/root/voice voicevox_core bash

# テスト音声 `おはようございます-1.wav` を生成
python run.py --text おはようございます --speaker_id 1
mv *.wav ~/voice
exit

# 音声の再生
aplay ~/voice/おはようございます-1.wav

その他の言語

サンプルコードを実装された際はぜひお知らせください。こちらに追記させて頂きます。

事例紹介

VOICEVOX ENGINE SHARP @yamachu ･･･ VOICEVOX ENGINE の C# 実装
Node VOICEVOX Engine @y-chan ･･･ VOICEVOX ENGINE の Node.js/C++ 実装

ライセンス

サンプルコードおよび core.h は MIT LICENSE です。

Releases にあるビルド済みのコアライブラリは別ライセンスなのでご注意ください。

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Related tags

Overview

VOICEVOX CORE

依存関係

API

サンプルの実行

Python 3

ソースコードから実行

Docker から

その他の言語

事例紹介

ライセンス

You might also like...

Releases(0.13.0-check-directml-rust.0)

0.13.0-check-directml-rust.0(Aug 18, 2022)

check-2022-04-11(Apr 10, 2022)

Owner

Hiroshiba

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

Music generation using ml / dl

Expressive Digital Signal Processing (DSP) package for Python

Audio library for modelling loudness

Basically Play Pauses the song when it is safe to do so. when you die in a round

A collection of python scripts for extracting and analyzing acoustics from audio files.

Bot Music Pintar. Created by Rio

Port Hitsuboku Kumi Chinese CVVC voicebank to deepvocal. / 筆墨クミDeepvocal中文音源

A2DP agent for promiscuous/permissive audio sinc.

A voice assistant which can handle your everyday task and allows you to book items from your favourite store!

Pythonic bindings for FFmpeg's libraries.

Powerful, simple, audio tag editor for GNU/Linux

Algorithmic and AI MIDI Drums Generator Implementation

This is a short program that takes the input from your microphone and uses OpenGL to draw a live colourful pattern

An AI for Music Generation

Musillow is a music recommender app that finds songs similar to your favourites.

Python audio and music signal processing library

Bot duniya Music Player

F.R.I.D.A.Y. ----- Female Replacement Intelligent Digital Assistant Youth

Gateware for the Terasic/Arrow DECA board, to become a USB2 high speed audio interface

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Related tags

Overview

VOICEVOX CORE

依存関係

API

サンプルの実行

Python 3

ソースコードから実行

Docker から

その他の言語

事例紹介

ライセンス

You might also like...

Releases(0.13.0-check-directml-rust.0)

0.13.0-check-directml-rust.0(Aug 18, 2022)

check-2022-04-11(Apr 10, 2022)

Owner

Hiroshiba

Open-Source Tools & Data for Music Source Separation: A Pragmatic Guide for the MIR Practitioner

Music generation using ml / dl

Expressive Digital Signal Processing (DSP) package for Python

Audio library for modelling loudness

Basically Play Pauses the song when it is safe to do so. when you die in a round

A collection of python scripts for extracting and analyzing acoustics from audio files.

Bot Music Pintar. Created by Rio

Port Hitsuboku Kumi Chinese CVVC voicebank to deepvocal. / 筆墨クミDeepvocal中文音源

A2DP agent for promiscuous/permissive audio sinc.

A voice assistant which can handle your everyday task and allows you to book items from your favourite store!

﻿﻿Pythonic bindings for FFmpeg's libraries.

Powerful, simple, audio tag editor for GNU/Linux

Algorithmic and AI MIDI Drums Generator Implementation

This is a short program that takes the input from your microphone and uses OpenGL to draw a live colourful pattern

An AI for Music Generation

Musillow is a music recommender app that finds songs similar to your favourites.

Python audio and music signal processing library

Bot duniya Music Player

F.R.I.D.A.Y. ----- Female Replacement Intelligent Digital Assistant Youth

Gateware for the Terasic/Arrow DECA board, to become a USB2 high speed audio interface

Pythonic bindings for FFmpeg's libraries.