the synthesis result is bad when using pretrain model #6

mnfutao · 2021-09-24T09:53:01Z

hello sir, thanks for your sharing.

i meet a problem when i using pretrain model to synthsize demo file. the effect of synthesized wav is so bad.

do you konw what problem happened?

pretrain_model: output/ckpt/LibriTTS_meta_learner/200000.pth.tar
ref_audio: ref_audio.zip
demo_txt: {Promises are often like the butterfly, which disappear after beautiful hover. No matter the ending is perfect or not, you cannot disappear from my world.}
demo_wav:demo.zip

keonlee9420 · 2021-09-27T02:31:21Z

Hi @mnfutao , thanks for sharing results. It is mainly because of the sampling rate where 22050Hz is used in this repo but the paper used 16kHz. This makes the model capacity relatively smaller, and hence the output quality is degraded. In my point of view, there are two options: 1. increase model size, 2. training model with more steps than 200k. It might work at certain level and increase the output quality.

mnfutao · 2021-10-08T03:19:02Z

thanks， i will try it with increasing model size.
i have already trained the model more than 200k， but the quality is still bad :(

keonlee9420 · 2021-10-14T10:40:26Z

That's bad news. Please let me know if increasing the model size helps.

keonlee9420 · 2021-11-18T01:37:38Z

Close due to inactivity.

keonlee9420 closed this as completed Nov 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

the synthesis result is bad when using pretrain model #6

the synthesis result is bad when using pretrain model #6

mnfutao commented Sep 24, 2021

keonlee9420 commented Sep 27, 2021

mnfutao commented Oct 8, 2021

keonlee9420 commented Oct 14, 2021

keonlee9420 commented Nov 18, 2021

the synthesis result is bad when using pretrain model #6

the synthesis result is bad when using pretrain model #6

Comments

mnfutao commented Sep 24, 2021

keonlee9420 commented Sep 27, 2021

mnfutao commented Oct 8, 2021

keonlee9420 commented Oct 14, 2021

keonlee9420 commented Nov 18, 2021