You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I get a CUDA OOM error. I also tried the same code on an 8x V100 32GB machine, with the same results. Any stage of ZeRO optimization also doesn't make any difference.
Expected behavior
Using downloaded and converted weights vs. using Hugging Face repository weights should not make a difference in terms of CUDA memory usage.
The text was updated successfully, but these errors were encountered:
System Info
transformers
version: 4.41.2Who can help?
No response
Information
Tasks
examples
folder (such as GLUE/SQuAD, ...)Reproduction
The above code fails without error on my L4 GPUs. However, when I download the weights from the Llama3 repository, convert the weights using
and run this code -
I get a CUDA OOM error. I also tried the same code on an 8x V100 32GB machine, with the same results. Any stage of ZeRO optimization also doesn't make any difference.
Expected behavior
Using downloaded and converted weights vs. using Hugging Face repository weights should not make a difference in terms of CUDA memory usage.
The text was updated successfully, but these errors were encountered: