-
Notifications
You must be signed in to change notification settings - Fork 8.3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OOM with A100 8*80G #125
Comments
when I change float32 to int8 , it has other problem. |
Silly me, thinking that I could run Grok on my two 3090TIs :) |
Clearly, the memory of this graphics card is still far from sufficient; it's too large! |
It will cost 65GB GPU memory in per A100 80G.. |
H100 SXM5 NVLink GPU x 8 AMD 100-000000802 EPYC 9124 Genoa 9004 Series 16-core 3 GHz Server Processor × 2 24 x 64GB DDR5 4800 ECC Reg Server Compatible Memory Kit (1.5TB Total) Micron MTFDKCB960TFR-1BC1ZABYYR 7450 PRO 960 GB Solid State Drive - 2.5" Internal - U.3 (PCI Express NVMe 4.0 x4) - Read Intensive - TAA Compliant total $297,019.00 (without station/power units) |
I can confirm that 512gb ram and 4*A100 40gb is not enough for it. |
you're so funny! |
How can i run the demo case with random data?
I use A100 8 * 80G GPU and still OOM error
I think it because I start the case with fp16 or fp32, how to use QW8Bit in random data?
thanks~
The text was updated successfully, but these errors were encountered: