Quick question about training time and compute #11

greeneggsandyaml · 2023-03-10T14:51:31Z

Hello, thanks for your great work. I have a very question about training.

I'm trying to run training and getting an OutOfMemoryError using a (single) 32 GB GPU (V100). What do you use for training? Also, with your compute setup, approximately how long does training take?

Thanks so much!

shariqfarooq123 · 2023-03-10T18:34:00Z

Hi, thanks for appreciating our work!

For metric fine-tuning, We use 4 NVIDIA A100 GPUs for training our largest model (BEiT-L). Training time on NYU (~25k samples, 5 epochs) on 4 A100s (40GB) is less than 2 hours.

Relative pre-training on 12 datasets (M12 from the paper) takes around 3-5 days on 8 RTX A6000-like GPUs. This gives us the MiDaS v3.1 models. Please refer to midas repo for more details.

greeneggsandyaml · 2023-03-10T23:42:15Z

Brilliant, thanks for the quick response. It's great to hear that the training time is quick.

I'll create another issue if I have any more questions, but this is resolved, so I'm closing the issue.

greeneggsandyaml closed this as completed Mar 10, 2023

MACILLAS mentioned this issue Jul 29, 2023

Error's in loading state_dict for ZoeDepthNK #53

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quick question about training time and compute #11

Quick question about training time and compute #11

greeneggsandyaml commented Mar 10, 2023

shariqfarooq123 commented Mar 10, 2023 •

edited

Loading

greeneggsandyaml commented Mar 10, 2023

Quick question about training time and compute #11

Quick question about training time and compute #11

Comments

greeneggsandyaml commented Mar 10, 2023

shariqfarooq123 commented Mar 10, 2023 • edited Loading

greeneggsandyaml commented Mar 10, 2023

shariqfarooq123 commented Mar 10, 2023 •

edited

Loading