Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

loss explode question #3

Closed
junkeon opened this issue Sep 22, 2020 · 7 comments
Closed

loss explode question #3

junkeon opened this issue Sep 22, 2020 · 7 comments

Comments

@junkeon
Copy link

junkeon commented Sep 22, 2020

Hello, Thanks for your work. it is very helpful to me.
I have a question about loss exploding
I tried to train ljspeech data which you used with default setting (lr = 1e-3) and i had NaN loss issue
So i reduced lr to 5e-4 then there is no NaN loss isuue but loss exploding (normal loss : < 0.07, exploded loss : > 732M)
I know there are codes for prevent loss exploding like lr schedule, clipping however it is not working
Can you help me?

@ivanvovk
Copy link
Owner

Hello, sure. Try decrease LR scheduler gamma a bit in the first place. From 0.95 to 0.9 with default initial LR (1e-3). I used 0.95 and it was kinda borderline between normal training and training with loss explosion. And please report if it helps or not.

@junkeon
Copy link
Author

junkeon commented Sep 23, 2020

Thank you for your advice, i will try and report it!

@ivanvovk
Copy link
Owner

Any updates, @junkeon ?

@junkeon
Copy link
Author

junkeon commented Sep 28, 2020

As your advice, i decreased gamma varies from 0.9 to 0.5 and there are all loss explode
But it seems okay with lr = 5e-4 and gamma = 0.9 on LJ speech data
In that setting, i got l1 loss slight above 0.1 and the synthesized quality is fine

@ivanvovk
Copy link
Owner

Thank you for your feedback. And what was your batch size?

@junkeon
Copy link
Author

junkeon commented Sep 28, 2020

sorry for late the previous feedback...
I can't remember the exact the number, it might be 48, the default setting
I can give you the number tomorrow (i am not allowed to access the server from outside), can't i?

@ivanvovk
Copy link
Owner

No-no, thank you very much, its enough information. I'm just trying to figure out why I had no loss explode problems with default setting on my server. However, I am glad that you managed to finish training, finally. Since the issue is solved, I'm closing it.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants