-
Notifications
You must be signed in to change notification settings - Fork 1.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ValueError: Out of range float values are not JSON compliant #1589
Comments
@martsalz Hi, can you log the value of logs["val_loss"] ? I suspect that the value of logs["val_loss"] is out of range for the failed trail. Sometimes if learning rate is too large, the loss value could be 'nan'. |
Because the experiments take a few hours/days and the error message occurs sporadically in my opinion, the reproducibility is not that easy..... |
@chicm-ms Yes, logs["val_loss"] returns the value `[11/22/2019, 12:05:09 PM] PRINT - ETA: 0s - loss: nan [11/22/2019, 12:05:13 PM] ERROR (mnist_keras/MainThread) Out of range float values are not JSON compliant |
How can this bug be fixed quickly? In my experiment with |
We are trying to fix this with PR #1958 Too large learning rate can lead to nan loss value, a quick fix is to check your trial code / search space and set learning rate to a smaller value. Since the loss value of the failed jobs are |
Closing this issue since the problem is fixed in nni v1.4. @martsalz , you can check our latest nni version. |
Short summary about the issue/question:
When executing an experiment, the following error message appears for some trials:
ValueError: Out of range float values are not JSON compliant
What's the reason for this?
nni Environment:
Anything else we need to know:
stderr:
The text was updated successfully, but these errors were encountered: