vits chinese, tts chinese, tts mandarin

Last update: Dec 14, 2022

Related tags

Text Data & NLP tts

Overview

vits实现的中文TTS

this is the copy of https://github.com/jaywalnut310/vits

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Espnet连接：github.com/espnet/espnet/tree/master/espnet2/gan_tts/vits

coqui-ai/TTS连接：github.com/coqui-ai/TTS/tree/main/recipes/ljspeech/vits_tts

如果有侵权行为，请联系我，我将删除项目

If there is infringement, please contact me and I will delete the item

基于VITS 实现 16K baker TTS 的流程记录

apt-get install espeak

pip install -r requirements.txt

cd monotonic_align

python setup.py build_ext --inplace

将16K标贝音频拷贝到./baker_waves/，启动训练

python train.py -c configs/baker_base.json -m baker_base

两张1080卡，训练两天，基本可以使用了

测试

python vits_strings.py

上面的模型训练出来后存在，明显停顿的问题

原因：

1，本来已经在音素后面强插边界了，VITS又强插边界了，具体是配置参数："add_blank": true

2，可能影响，随机时长预测，具体配置参数：use_sdp=True

vits chinese, tts chinese, tts mandarin

Related tags

Overview

基于VITS 实现 16K baker TTS 的流程记录

将16K标贝音频拷贝到./baker_waves/，启动训练

测试

Owner

AmorTX

Precision Medicine Knowledge Graph (PrimeKG)

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

DeeBERT: Dynamic Early Exiting for Accelerating BERT Inference

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

Higher quality textures for the Metal Gear Solid series.

Phomber is infomation grathering tool that reverse search phone numbers and get their details, written in python3.

Neural network sequence labeling model

Official PyTorch implementation of SegFormer

PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop

Sequence-to-Sequence learning using PyTorch

This repository contains all the source code that is needed for the project : An Efficient Pipeline For Bloom’s Taxonomy Using Natural Language Processing and Deep Learning

The code from the whylogs workshop in DataTalks.Club on 29 March 2022

EMNLP'2021: Can Language Models be Biomedical Knowledge Bases?

A Fast Command Analyser based on Dict and Pydantic

🐍 A hyper-fast Python module for reading/writing JSON data using Rust's serde-json.

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

[NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields

Two-stage text summarization with BERT and BART

Simple multilingual lemmatizer for Python, especially useful for speed and efficiency

Natural Language Processing library built with AllenNLP 🌲🌱