You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thanks for the wonderful work. May I ask a question, when I pretrain LSTM model with the default settings, the memory is overflow. My server has 180G RAM, so may I ask how much RAM needed for pretraining?
Thanks and best regards.
The text was updated successfully, but these errors were encountered:
HI @shangqing-liu -- apologies about the delay. I am a coauthor on this work.
Can I ask what GPU you are using to train this model? Are you encountering a CUDA malloc error (out of memory on GPU) or host-side out of memory error?
At the moment, the dataloader reads the whole dataset in RAM before training. The pre-training dataset is quite large (almost 20GB) so this can be expensive. We pre-trained our models on machines with around 256GB of RAM. However, we didn't encounter OOMs.
If you could provide some more details on your setup, I can help debug!
Hi,
Thanks for the wonderful work. May I ask a question, when I pretrain LSTM model with the default settings, the memory is overflow. My server has 180G RAM, so may I ask how much RAM needed for pretraining?
Thanks and best regards.
The text was updated successfully, but these errors were encountered: