Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

About hyper-parameters #8

Closed
jelleopard opened this issue Dec 10, 2021 · 2 comments
Closed

About hyper-parameters #8

jelleopard opened this issue Dec 10, 2021 · 2 comments

Comments

@jelleopard
Copy link

It seems that some hyper-parameter settings in the code (train_script.sh) are inconsistent with those in the paper, for example, learning rate (0.027 vs. 0.08), loss weight \lambda_{2} (0.05 vs. 0.03), batch size (900 vs. 1024), milestones (48 & 64 vs. 30 & 40), epoch number (50 vs. 80) and lr decay (0.2 vs. 0.1).
Of course these numbers are adjustable but important in the experiments. I want to know how to set these hyper-parameters with backbone mobilenetv2 to get a good performance as yours.

@choyingw
Copy link
Owner

choyingw commented Dec 10, 2021

The checkpoint we released is trained by the setting described in the paper. The default hyper-parameter in the code is something we tried recently for exploring more parameter tuning. I'll change the default back to those in the paper.

@jelleopard
Copy link
Author

Thank you for your response. I have no problems, so I will close this issue.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants