Training Time Required #81

kevaldoshi17 · 2021-10-14T15:50:25Z

Hi,

I was trying to train the Timesformer model from scratch on Kinetics-600 and the estimated time was shown as ~9 days. In the paper it was mentioned that the training time is roughly 440 V100 GPU hours. My setup is 8x Titan V GPUs, so I assumed that the training time would be closer to 50 hours. What am I missing here?

gberta · 2021-10-14T19:31:34Z

The numbers in the paper are reported on Kinetics-400, which is smaller than Kinetics-600. I haven't tested the code with Titan V GPUs so I can't really comment on that.

kevaldoshi17 · 2021-10-14T19:38:13Z

Thanks for the quick reply. Just to confirm, Kinetics-400 on 8 V100 GPUs for 15 epochs should take around 50 hours right?

gberta · 2021-10-15T02:36:39Z

It should be around ~55 hours, yes. Note that the training process will be significantly slower if you don't assign enough CPU processes for data loading. To the best of my knowledge, it shouldn't be a problem unless you are using SLURM.

kevaldoshi17 · 2021-10-21T00:44:29Z

Yes, I was using SLURM and didn't set enough CPU processes. Thanks for the help!

kevaldoshi17 closed this as completed Oct 21, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Training Time Required #81

Training Time Required #81

kevaldoshi17 commented Oct 14, 2021

gberta commented Oct 14, 2021

kevaldoshi17 commented Oct 14, 2021

gberta commented Oct 15, 2021 •

edited

kevaldoshi17 commented Oct 21, 2021

Training Time Required #81

Training Time Required #81

Comments

kevaldoshi17 commented Oct 14, 2021

gberta commented Oct 14, 2021

kevaldoshi17 commented Oct 14, 2021

gberta commented Oct 15, 2021 • edited

kevaldoshi17 commented Oct 21, 2021

gberta commented Oct 15, 2021 •

edited