how much gpus did you use? #48

Tord-Zhang · 2020-09-15T08:45:28Z

Hi, since the MidasNet is a very large model, how much gpus did you use and how long did it take to train the model? Since the batchsize is not larget ( 8 for each dataset), would multi-gpu training hurt the performance? Since there are a lot of batch normalization layers in the encoder.

Thanks.

ranftlr · 2020-09-15T10:51:41Z

The model was trained on 2 Quadro RTX 6000s for about 6 days: 1 day for pre-training on RedWeb + about 5 days for training on MIX5. Multi-GPU with low batch size was not an issue, because we froze all batch norm layers.

Tord-Zhang · 2020-09-16T03:15:58Z

@ranftlr thanks for your quick responce. I am training MidasNet on 2 tesla v100 gpus, but it's much slower. So the mean and val of bn layers is not updated during the training process?

ranftlr · 2020-09-16T11:44:37Z

Correct, mean and variance are fixed and are not updated. I can't comment on the speed on a v100 as I don't have any available.

mathmax12 · 2021-05-12T18:12:13Z

@Tord-Zhang
I am reproducing the training processing on one V100. I am pretty new to this. I wonder what's the amount of data and how long the training takes?
Thanks

ranftlr closed this as completed Oct 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

how much gpus did you use? #48

how much gpus did you use? #48

Tord-Zhang commented Sep 15, 2020

ranftlr commented Sep 15, 2020

Tord-Zhang commented Sep 16, 2020

ranftlr commented Sep 16, 2020

mathmax12 commented May 12, 2021

how much gpus did you use? #48

how much gpus did you use? #48

Comments

Tord-Zhang commented Sep 15, 2020

ranftlr commented Sep 15, 2020

Tord-Zhang commented Sep 16, 2020

ranftlr commented Sep 16, 2020

mathmax12 commented May 12, 2021