Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about default weight decay in model_training_imagenet.py #7

Closed
DongHwanJang opened this issue Nov 2, 2021 · 1 comment
Closed

Comments

@DongHwanJang
Copy link

DongHwanJang commented Nov 2, 2021

Thank you for the nice work and code :)
I have a quick question.
According to the paper, it seems like weight decay is set to 5e-4 for VGG & ResNet50 regardless of the dataset, while the default value in model_training_imagenet.py is 1e-4.

I guess the value reported in the paper is correct, but just wanted to make really sure about this.
Which number is correct one?

Thanks a lot

@lorenmt
Copy link
Owner

lorenmt commented Nov 2, 2021

Hello.

Sorry for the confusion. Yes, I believe the hyper-parameters listed in the original paper should be the correct ones. Otherwise, you can just run both experiments, and the one closest to the reported performance is the correct one.

@lorenmt lorenmt closed this as completed Nov 6, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants