A PyTorch implementation of Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis.
- Python 3.5.2
- PyTorch 1.0.0
Extract data_thchs30.tgz and generate features:
$ python extract.py
$ python pre_process.py
$ python train.py
If you want to visualize during training, run in your terminal:
$ tensorboard --logdir runs
Generate mel-spectrogram for text "相对论直接和间接的催生了量子力学的诞生 也为研究微观世界的高速运动确立了全新的数学模型"
$ python demo.py