Only 1 GPU is used for training #11

douyh · 2021-04-25T02:48:36Z

I noticed that only 1 GPU is used to train TransPose-R-A4 and lr=0.0001.
Should I change the lr if I want to use 4 or 8 gpus?
Or just keep the same?
Thanks for your reply.

douyh · 2021-04-25T09:25:19Z

Only 73.7AP I got.
4 gpus were used for training and I kept the other configs.

yangsenius · 2021-04-26T15:41:44Z

From my experience, the performances of transpose-r models are very sensitive to the initial learning rate. I did not train transpose-r-a4 on 4 or 8 GPUs. I suggest you increase the initial learning rate a little bit at such conditions (with larger batchsize).

yangsenius · 2021-04-29T05:53:00Z

Please let me know the results if you have tried such experiments.

EckoTan0804 · 2021-05-19T09:01:26Z

@yangsenius @douyh
FYI, I have trained TransPose-R-A4 on 4 GPUs. Initial and final learning rate were set to 5e-4 and 5e-5, respectively. Other configs kept unchanged.
I got 75.3 AP (+0.2 AP compared to README).

yangsenius · 2021-05-19T11:20:32Z

Thanks for sharing the results! Happy to see that can bring performance improvement @EckoTan0804

Larger batchsizes with more GPUs empirically bring performance improvement. The learning rate setting of DeiT -- 0.0005 ×batchsize/ constant may also work well.

yangsenius closed this as completed Apr 29, 2021

EckoTan0804 mentioned this issue May 11, 2021

Multiple GPUs training #13

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Only 1 GPU is used for training #11

Only 1 GPU is used for training #11

douyh commented Apr 25, 2021

douyh commented Apr 25, 2021

yangsenius commented Apr 26, 2021 •

edited

Loading

yangsenius commented Apr 29, 2021

EckoTan0804 commented May 19, 2021

yangsenius commented May 19, 2021

Only 1 GPU is used for training #11

Only 1 GPU is used for training #11

Comments

douyh commented Apr 25, 2021

douyh commented Apr 25, 2021

yangsenius commented Apr 26, 2021 • edited Loading

yangsenius commented Apr 29, 2021

EckoTan0804 commented May 19, 2021

yangsenius commented May 19, 2021

yangsenius commented Apr 26, 2021 •

edited

Loading