PyTacotron

PyTorch implementation of Tacotron: Towards End-to-End Speech Synthesis, and
PyTorch implementation of Natural TTS synthesis by conditioning Wavenet on MEL spectogram predictions.

Features

Easy switch between Tacotron and Tacotron2
Detailed model structure configuration with json
- For Tacotron: tacotron1.json
- For Tacotron2: tacotron2.json

Branches

New configurations can be created by merging features from the following different branches.

master: Basic Tacotron and Tacotron2 implementation
dynamic_r: Dynamic reduction factor (r) changing along with training schedule
gst: Global style token (GST) support
multispeaker: Multi-speaker support with speaker embeddings

Setup

Prepare DATASET directory
- Prepare train.csv.txt and val.csv.txt files
- Change training_files and validation_files in hparams.py to the above two files respectively
- Make necessary modifications to files_to_list to retrieve 'mel_file_path' and 'text' in utils/dataset.py
Install PyTorch
Install python requirements or build docker image
- Install python requirements: pip install -r requirements.txt

Training

Training from scratch

python train.py -o outdir -l logdir
(OPTIONAL) tensorboard --logdir=logdir

Training using a pre-trained model

Training using a pre-trained model can lead to faster convergence
By default, the dataset dependent text embedding layers are ignored

Download the published Tacotron model
python train.py -o outdir -l logdir -c tacotron_statedict.pt --warm_start

Multi-GPU (distributed) Training

python train.py -o outdir -l logdir --hparams=distributed_run=True

Inference demo

Download published Tacotron model
Download published WaveGAN model
jupyter notebook --ip=127.0.0.1 --port=31337
Load inference.ipynb

Note: When performing Mel-Spectrogram to Audio synthesis, make sure Tacotron and the Mel decoder were trained on the same mel-spectrogram representation.

Acknowledgements

This implementation uses code from the following repos as described in our code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model

model

text

text

utils

utils

LICENSE

LICENSE

README.md

README.md

hparams.py

hparams.py

requirements.txt

requirements.txt

tacotron1.json

tacotron1.json

tacotron2.json

tacotron2.json

train.py

train.py

Repository files navigation

PyTacotron

Features

Branches

Setup

Training

Training from scratch

Training using a pre-trained model

Multi-GPU (distributed) Training

Inference demo

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
model		model
text		text
utils		utils
LICENSE		LICENSE
README.md		README.md
hparams.py		hparams.py
requirements.txt		requirements.txt
tacotron1.json		tacotron1.json
tacotron2.json		tacotron2.json
train.py		train.py

License

thuhcsi/tacotron

Folders and files

Latest commit

History

Repository files navigation

PyTacotron

Features

Branches

Setup

Training

Training from scratch

Training using a pre-trained model

Multi-GPU (distributed) Training

Inference demo

Acknowledgements

About

Topics

Resources

License

Stars

Watchers

Forks

Languages