A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Last update: Jul 14, 2022

Overview

WaveGlow

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Quick Start:

Install requirements:

pip install -r requirements.txt

Download dataset:

wget http://festvox.org/cmu_arctic/cmu_arctic/packed/cmu_us_slt_arctic-0.95-release.tar.bz2
tar xf cmu_us_slt_arctic-0.95-release.tar.bz2

Extract features: feature extracting pipeline is the same as tacotron
Training with default hyperparams:

python train.py

Synthesize from model:

python generate.py --checkpoint=/path/to/model --local_condition_file=/path/to/local_conditon

Notes:

This is not official implementation, some details are not necessarily correct.
Work in progress.

Owner

Yuchao Zhang

speech synthesis/machine learning

GitHub Repository

Video-Music Transformer

VMT Video-Music Transformer (VMT) is an attention-based multi-modal model, which generates piano music for a given video. Paper https://arxiv.org/abs/

5 Jul 13, 2022

Keras-1D-NN-Classifier

Keras-1D-NN-Classifier This code is based on the reference codes linked below. reference 1, reference 2 This code is for 1-D array data classification

6 May 18, 2021

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks This is a Pytorch-Lightning implementation of the paper "Self-s

111 Dec 06, 2022

Pytorch implementation of Nueral Style transfer

Nueral Style Transfer Pytorch implementation of Nueral style transfer algorithm , it is used to apply artistic styles to content images . Content is t

9 Oct 15, 2022

kullanışlı ve işinizi kolaylaştıracak bir araç

Hey merhaba! işte çok sorulan sorularının cevabı ve sorunlarının çözümü; Soru= İçinde var denilen birçok şeyi göremiyorum bunun sebebi nedir? Cevap= B

16 Dec 17, 2022

Powerful and efficient Computer Vision Annotation Tool (CVAT)

Computer Vision Annotation Tool (CVAT) CVAT is free, online, interactive video and image annotation tool for computer vision. It is being used by our

8.6k Jan 01, 2023

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

Colorizer The point of this project is to write a program capable of taking a black and white / grayscale image, and generating a realistic or plausib

1 Jan 06, 2022

A Simple LSTM-Based Solution for "Heartbeat Signal Classification and Prediction" in Tianchi

LSTM-Time-Series-Prediction A Simple LSTM-Based Solution for "Heartbeat Signal Classification and Prediction" in Tianchi Contest. The Link of the Cont

1 Jun 13, 2022

training script for space time memory network

Trainig Script for Space Time Memory Network This codebase implemented training code for Space Time Memory Network with some cyclic features. Requirem

100 Dec 20, 2022

Image augmentation library in Python for machine learning.

Augmentor is an image augmentation library in Python for machine learning. It aims to be a standalone library that is platform and framework independe

4.8k Jan 07, 2023

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data.

212 Dec 25, 2022

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

WSDEC This is the official repo for our NeurIPS paper Weakly Supervised Dense Event Captioning in Videos. Description Repo directories ./: global conf

96 Nov 01, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

Related tags

Overview

WaveGlow

Quick Start:

Notes:

Owner

Yuchao Zhang

Video-Music Transformer

Keras-1D-NN-Classifier

Self-supervised Point Cloud Prediction Using 3D Spatio-temporal Convolutional Networks

Pytorch implementation of Nueral Style transfer

kullanışlı ve işinizi kolaylaştıracak bir araç

Powerful and efficient Computer Vision Annotation Tool (CVAT)

This Artificial Intelligence program can take a black and white/grayscale image and generate a realistic or plausible colorized version of the same picture.

A Simple LSTM-Based Solution for "Heartbeat Signal Classification and Prediction" in Tianchi

training script for space time memory network

Image augmentation library in Python for machine learning.

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data.

Weakly Supervised Dense Event Captioning in Videos, i.e. generating multiple sentence descriptions for a video in a weakly-supervised manner.

Res2Net for Instance segmentation and Object detection using MaskRCNN

Large Scale Fine-Grained Categorization and Domain-Specific Transfer Learning. CVPR 2018

A GUI to automatically create a TOPAS-readable MLC simulation file

Person Re-identification

Code for "Learning the Best Pooling Strategy for Visual Semantic Embedding", CVPR 2021

The pytorch implementation of SOKD (BMVC2021).

使用yolov5训练自己数据集(详细过程)并通过flask部署

Python package to add text to images, textures and different backgrounds