pretrain a model with the MLM objective #7

liujiqiang999 · 2019-02-20T12:15:49Z

Hi, How many GPU are used when training a model with the MLM objective?

glample · 2019-02-20T13:36:31Z

Hi,

In practice, we observed that bigger batches seem to help, and it significantly accelerates the overall training time. I would suggest to use at least 8 GPUs, especially if you use a big model.

jiahuigeng · 2019-04-09T13:13:52Z

Have you successfully implemented accumulated gradients or multi-gpu settings?

glample · 2019-04-09T14:08:15Z

Not yet. It will be easy in the next version of PyTorch distributed, so we will wait for that.

glample closed this as completed Feb 21, 2019

JianLiu91 mentioned this issue Oct 24, 2019

Multi-GPU training get stuck after one mini-batch #223

Open

JxuHenry mentioned this issue Oct 28, 2019

I train UNMT with multi-GPU got the following errors! #224

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pretrain a model with the MLM objective #7

pretrain a model with the MLM objective #7

liujiqiang999 commented Feb 20, 2019

glample commented Feb 20, 2019

jiahuigeng commented Apr 9, 2019

glample commented Apr 9, 2019

pretrain a model with the MLM objective #7

pretrain a model with the MLM objective #7

Comments

liujiqiang999 commented Feb 20, 2019

glample commented Feb 20, 2019

jiahuigeng commented Apr 9, 2019

glample commented Apr 9, 2019