New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError: 'validation/main/loss' in pytorch when use multiple gpu in training #1723
Comments
The problem seems to be in this code snippet:
when I comment out these code, it can be trained normally. But can't save the best model and can't show the "main/loss_ctc", "main/loss_att", "main/acc", and "main/loss" in the log file. |
What kind of multiple GPU environments are you using? |
thanks for your reply!
|
OK. I'm concerning the following warning.
Could you try it with an older version of pytorch? |
I have try many older version of pytorch include:
but all have the same problem. |
Many thanks. |
I just tested it and I did not get any errors.
|
Thank you very mush! |
Cool! |
hi,
I use aishell recipe and running asr_train.py. When I use single GPU, it work well. However, when I use 2GPU, it finish training at the end of first epoch and throw the error: KeyError: 'validation/main/loss' error.
How to fix this?
The text was updated successfully, but these errors were encountered: