About Learning Rate Scheduler #19

1049451037 · 2022-01-17T09:18:43Z

❔Question

Why the step of learning rate scheduler after each epoch instead of each batch in main.py?

Won't the change rate of lr be too slow? (and unstable for various dataset sizes)

Yuxin-CV · 2022-01-17T10:35:24Z

Hi, Qingsong. Thanks for this issue.

To my knowledge, in image recognition, the lr is usually stepped by epoch if you choose the cosine lr scheduler, e.g., in the widely used timm library.

It seems that in NLP the lr scheduler is stepped after each iteration / batch. e.g., in the BEiT repo. This is also true for semantic segmentation in vision.

I agree with you that step by iteration is more reasonable than step by epoch, and step by iteration should yield no worse results than step by epoch.

1049451037 · 2022-01-17T10:59:34Z

Got it. Thank you for your reply!

1049451037 added the question Further information is requested label Jan 17, 2022

1049451037 closed this as completed Jan 17, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About Learning Rate Scheduler #19

About Learning Rate Scheduler #19

1049451037 commented Jan 17, 2022

Yuxin-CV commented Jan 17, 2022

1049451037 commented Jan 17, 2022

About Learning Rate Scheduler #19

About Learning Rate Scheduler #19

Comments

1049451037 commented Jan 17, 2022

❔Question

Yuxin-CV commented Jan 17, 2022

1049451037 commented Jan 17, 2022