Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

learning_rate becomes nan in rtho.py #7

Open
stevenyu530 opened this issue May 5, 2021 · 1 comment
Open

learning_rate becomes nan in rtho.py #7

stevenyu530 opened this issue May 5, 2021 · 1 comment

Comments

@stevenyu530
Copy link

Team,

I am trying to run your sample script ./bin/rtho.py, using the sample command in README.
python bin/rtho.py --network vgg --dataset cifar_10 --optimizer sgd --momentum 0.9 --hyper-lr 1e-8

After seeing val_cacurace increased in the first few epochs, learning_rate became nan. and training accuracy stuck at 10.0.
Sample output is attached at the end.

Running baseline.py using the provided command runs fine. Best val_accuracy will reach good value.

Can you help to advise what is causing the learning rate nan issue in rtho.py ?
Is is behaviour expected for the methods?

Screenshot 2021-05-05 at 12 25 27

@orchidmajumder
Copy link
Contributor

It will most likely be due to the other hyperparameters. I recommend using a lower value for those.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants