-
Notifications
You must be signed in to change notification settings - Fork 12
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Clarify exactly which models of GPUs were used in benchmarks #87
Comments
The A100s are on Perlmutter. They're 40 GB, 1410 MHz versions. |
Maybe we should capture the output of The datasheet says there's a bunch of flavors of A100: |
The only difference between them is the amount of memory (40 or 80 GB) and the form factor (PCIe or SXM). Neither of those should have any difference in speed. Here's what nvidia-smi reports on the login node with the GPU idle.
Comparing to what you posted in #86 (comment), the max clock rates for graphics, SM, and video are the same, but the memory is slightly lower. Other factors that can affect performance are the type of bus (PCIe or NVLink, and the particular version of either one), the cooling system (influences whether it can actually sustain the maximum clock rate, bus topology (mainly for multi-GPU benchmarks), and CPU type (it's not a huge effect for GPU benchmarks, but it does make a difference). |
There seems to be significant variation in the performance of different models/variants of the same GPU (e.g. the multiple variants of A100 available), so we should provide more details in our benchmarks about exactly which model(s) were used.
The text was updated successfully, but these errors were encountered: