Skip to content

foamliu/GST-Tacotron

Repository files navigation

Tacotron 2

A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.

image

Dataset

THCHS-30 Dataset.

Dependency

  • Python 3.5.2
  • PyTorch 1.0.0

Usage

Data Pre-processing

Extract data_thchs30.tgz and generate features:

$ python extract.py
$ python pre_process.py

Train

$ python train.py

If you want to visualize during training, run in your terminal:

$ tensorboard --logdir runs

Demo

Generate mel-spectrogram for text "相对论直接和间接的催生了量子力学的诞生 也为研究微观世界的高速运动确立了全新的数学模型"

$ python demo.py

image

About

PyTorch implementation of Style Tokens

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages