About learning rate setting #22

bityangke · 2018-06-16T15:52:53Z

Hi Yi,
How did you decide the lr step ?
Did you follow somewhere else or experiment it youself ?
Thanks in advance!

bryanyzhu · 2018-06-16T18:25:31Z

Hi, I decided the lr step according to this paper.
But usually in my experiments, I just see when the loss/accuracy saturate, and then decay the lr. I find it more effective.

bityangke · 2018-06-17T04:47:59Z

Thanks very much.

bityangke · 2018-06-17T05:13:48Z

Hi Yi
I noted that Yuanjun used a batchsize（16x4，16 samples on each card） and iter size 4， so the “batch size” is equal to 256. They used step size 4000, 8000, 10000 （iters） , the corresponding batch steps should be 1000，2000，2500， and epoch number should be about 27，54， 67.
Am I right？
Thanks！

bryanyzhu · 2018-06-19T22:22:44Z

Hi, I am so sorry for the slow response, I am at CVPR this week.

I am also using similar strategy iter_size at here. So my corresponding batch steps are still 4000, 8000 and 10000, which is about 100, 200 and 250. Hope this is clear.

bityangke · 2018-06-20T02:02:46Z

Thank you！
Hope you have happy CVPR days！

bityangke closed this as completed Jun 17, 2018

bityangke reopened this Jun 17, 2018

bityangke closed this as completed Jun 20, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About learning rate setting #22

About learning rate setting #22

bityangke commented Jun 16, 2018

bryanyzhu commented Jun 16, 2018

bityangke commented Jun 17, 2018

bityangke commented Jun 17, 2018 •

edited

Loading

bryanyzhu commented Jun 19, 2018

bityangke commented Jun 20, 2018

About learning rate setting #22

About learning rate setting #22

Comments

bityangke commented Jun 16, 2018

bryanyzhu commented Jun 16, 2018

bityangke commented Jun 17, 2018

bityangke commented Jun 17, 2018 • edited Loading

bryanyzhu commented Jun 19, 2018

bityangke commented Jun 20, 2018

bityangke commented Jun 17, 2018 •

edited

Loading