Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

synthesized speaker quality changed #46

Open
kannadaraj opened this issue Mar 12, 2020 · 5 comments
Open

synthesized speaker quality changed #46

kannadaraj opened this issue Mar 12, 2020 · 5 comments

Comments

@kannadaraj
Copy link

kannadaraj commented Mar 12, 2020

Thanks fro sharing the repo. I have trained the model using this repo on LJ speech. I am performing inference using only GST. During inference i use a out of dataset file as style file. The synthesized speaker quality changes very much. The synthesized quality is decent but it doesn't sound like the original speaker of LJ speech. How to fix that? Please can anyone help. Thanks.

@rafaelvalle
Copy link
Contributor

Are you selecting one style token or using a sound file to sample the style tokens?

@kannadaraj
Copy link
Author

@rafaelvalle : Sorry for late reply. I am training with single speaker database, I am using a file sample. from the same data set.

@rafaelvalle
Copy link
Contributor

Do the attention maps look correct?

@kannadaraj
Copy link
Author

Yes. the attention maps look good. Good diagonal line..

@rafaelvalle
Copy link
Contributor

Can you share mel-spectrograms, audio files and attention plots?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants