Skip to content

[question] training chinese tacotron2 model with my own wavs #347

@hudsonx1123

Description

@hudsonx1123

Hi,
I try to train chinese tacotron2 model with baker.
I replaced the wav files with my own wav files and re-define 000001-010000.txt following original format.
It's ok in preprocess and traing progress, but something is wrong when I copy the model to sample baker colab for test.
Input is about 20 chinese-words sentence.
It is just about 6 sec sentnece but Tacotron2 + MB-MelGAN generate 1:39 wav(as attachated image).
It can correctly speak the sentence in the first 6 sec but strange sound in the other duration.
any suggestion about this problem??
Thanks a lot!!
ScreenHunter 51

Metadata

Metadata

Assignees

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions