[question] training chinese tacotron2 model with my own wavs

Hi,
I try to train chinese tacotron2 model with baker. 
I replaced the wav files with my own wav files and re-define 000001-010000.txt following original format.
It's ok in preprocess and traing progress, but something is wrong when I copy  the model to sample baker colab for test.
Input is about 20 chinese-words sentence. 
It is just about 6 sec sentnece but  Tacotron2 + MB-MelGAN generate 1:39 wav(as attachated image). 
It can correctly speak the sentence in the  first 6 sec but strange sound in the other duration.
any suggestion about this problem?? 
Thanks a lot!!
![ScreenHunter 51](https://user-images.githubusercontent.com/45844949/98065922-904cac80-1e90-11eb-926f-b94a9e069b35.png)


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[question] training chinese tacotron2 model with my own wavs #347

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[question] training chinese tacotron2 model with my own wavs #347

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions