-
Notifications
You must be signed in to change notification settings - Fork 168
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
train_shallow_layer.py doesn't train correctly #18
Comments
I've confirmed that there is an issue with Any ideas on what might be causing |
Hi @hanrelan That may be caused by using improper learning rate. In Thanks! Wonseok |
Ah, makes sense, thanks. I'll try it again tonight and close this issue if it works |
That fixed it, getting 78% lx accuracy now. Thanks! |
I'm trying to train the shallow-layer model and after 4-5 epochs I'm still seeing
acc_lx
close to zero. Is that normal? If you have an example training run log and the associated losses, that would be great. I want to make sure that something isn't broken before letting it train for a couple days.The loss actually doesn't seem to change at all between epochs so I think training isn't happening, but I haven't modified the source code other than the paths.
Including my training configuration and output log below. I have an 11GB GPU so I had to change the batch size and gradient accumulation to prevent out of memory errors.
The text was updated successfully, but these errors were encountered: