You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for the nice work and code :)
I have a quick question.
According to the paper, it seems like weight decay is set to 5e-4 for VGG & ResNet50 regardless of the dataset, while the default value in model_training_imagenet.py is 1e-4.
I guess the value reported in the paper is correct, but just wanted to make really sure about this.
Which number is correct one?
Thanks a lot
The text was updated successfully, but these errors were encountered:
Sorry for the confusion. Yes, I believe the hyper-parameters listed in the original paper should be the correct ones. Otherwise, you can just run both experiments, and the one closest to the reported performance is the correct one.
Thank you for the nice work and code :)
I have a quick question.
According to the paper, it seems like weight decay is set to 5e-4 for VGG & ResNet50 regardless of the dataset, while the default value in model_training_imagenet.py is 1e-4.
I guess the value reported in the paper is correct, but just wanted to make really sure about this.
Which number is correct one?
Thanks a lot
The text was updated successfully, but these errors were encountered: