Skip to content

lbqin/SpeechSynthesis

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 

Repository files navigation

Text-to-Speech Synthesis

Voice synthesis related materials using deep learning

Lectures & Seminars

Dataset

Korean Corpus

WaveNet

Paper

Articles

Source Code

Multi-GPU

WaveNet takes too long to learn, so I do not seem to get the answer unless I use a multi-GPU. The related code links are summarized.

Fast WaveNet

Paper

Articles

Source Code

Parallel WaveNet

Paper

Articles

Source Code

WaveRNN

Paper

Deep Voice

Paper

Deep Voice 2

Paper

Deep Voice 3

Paper

Source Code

Tacotron

Paper

Source Code

Tacotron 2

Paper

Articles

Source Code

HybridNet

ClariNet

Articles

Demo

Voice Cloning

Paper

Speed ​​Up Strategy

  • Fast Generation for Convolutional Autoregressive Models (2017.04) - Prajit Ramachandran et al.
    • This technique was applied to Wavenet and PixelCNN ++ models, and it was said that there was a speed increase of up to 21 times and 183 times, respectively. It is important to note that the speed improvement may not be greater than expected in a real environment because it is the maximum performance improvement for a specific situation.

About

语音合成综述

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published