An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Last update: Jun 16, 2022

Related tags

Computer Vision AutoVC

Overview

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

This is an unofficial implementation of AutoVC based on the official one.

The repository is still under construction, so some details may be missing or incomplete.

Preprocessing

python preprocess.py <data_path> <save_path> <encoder_path> [--seg_len seg] [--n_workers workers]

Training

python train.py <config> <data_path> <save_path> [--n_steps steps] [--save_steps save] [--log_steps log] [--batch_size batch] [--seg_len seg]

Reference

Please cite the paper if you find it useful.

@InProceedings{pmlr-v97-qian19c,
  title = {{A}uto{VC}: Zero-Shot Voice Style Transfer with Only Autoencoder Loss},
  author = {Qian, Kaizhi and Zhang, Yang and Chang, Shiyu and Yang, Xuesong and Hasegawa-Johnson, Mark},
  pages = {5210--5219},
  year = {2019},
  editor = {Kamalika Chaudhuri and Ruslan Salakhutdinov},
  volume = {97},
  series = {Proceedings of Machine Learning Research},
  address = {Long Beach, California, USA},
  month = {09--15 Jun},
  publisher = {PMLR},
  pdf = {http://proceedings.mlr.press/v97/qian19c/qian19c.pdf},
  url = {http://proceedings.mlr.press/v97/qian19c.html}
}

An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".

Related tags

Overview

AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss

Preprocessing

Training

Reference

Owner

Chien-yu Huang

This is a pytorch re-implementation of EAST: An Efficient and Accurate Scene Text Detector.

Awesome multilingual OCR toolkits based on PaddlePaddle （practical ultra lightweight OCR system, provide data annotation and synthesis tools, support training and deployment among server, mobile, embedded and IoT devices）

Text recognition (optical character recognition) with deep learning methods.

Distilling Knowledge via Knowledge Review, CVPR 2021

chineseocr/table_line 表格线检测模型pytorch版

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

A real-time dolly zoom camera effect

MXNet OCR implementation. Including text recognition and detection.

(CVPR 2021) Back-tracing Representative Points for Voting-based 3D Object Detection in Point Clouds

📷 This repository is focused on having various feature implementation of OpenCV in Python.

Go package for OCR (Optical Character Recognition), by using Tesseract C++ library

A simple python program to record security cam footage by detecting a face and body of a person in the frame.

A curated list of papers, code and resources pertaining to image composition

Slice a single image into multiple pieces and create a dataset from them

Course material for the Multi-agents and computer graphics course

RRD: Rotation-Sensitive Regression for Oriented Scene Text Detection

Corner-based Region Proposal Network

scene-linear test images

A PyTorch implementation of ECCV2018 Paper: TextSnake: A Flexible Representation for Detecting Text of Arbitrary Shapes

轻量级公式 OCR 小工具：一键识别各类公式图片，并转换为 LaTeX 格式