About hyper-parameters #8

jelleopard · 2021-12-10T02:12:25Z

It seems that some hyper-parameter settings in the code (train_script.sh) are inconsistent with those in the paper, for example, learning rate (0.027 vs. 0.08), loss weight \lambda_{2} (0.05 vs. 0.03), batch size (900 vs. 1024), milestones (48 & 64 vs. 30 & 40), epoch number (50 vs. 80) and lr decay (0.2 vs. 0.1).
Of course these numbers are adjustable but important in the experiments. I want to know how to set these hyper-parameters with backbone mobilenetv2 to get a good performance as yours.

choyingw · 2021-12-10T06:34:46Z

The checkpoint we released is trained by the setting described in the paper. The default hyper-parameter in the code is something we tried recently for exploring more parameter tuning. I'll change the default back to those in the paper.

jelleopard · 2021-12-29T04:44:03Z

Thank you for your response. I have no problems, so I will close this issue.

jelleopard closed this as completed Dec 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About hyper-parameters #8

About hyper-parameters #8

jelleopard commented Dec 10, 2021

choyingw commented Dec 10, 2021 •

edited

jelleopard commented Dec 29, 2021

About hyper-parameters #8

About hyper-parameters #8

Comments

jelleopard commented Dec 10, 2021

choyingw commented Dec 10, 2021 • edited

jelleopard commented Dec 29, 2021

choyingw commented Dec 10, 2021 •

edited