-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem about bandwidht test #6
Comments
At 10241024 elements the total data volume per array is 10241024 * sizeof(double) = 8MB. The 4080Ti has 32MB of L2 cache, so even the triad test, that uses 3 arrays, can fit its entire data into the L2 cache. Same for the A800, with its 2x40MB of L2 cache. You have essentially changed the test into a L2 cache bandwidth benchmark! :-) |
@te42kyfo Thanks for your answer, when i flush L2 cache, return to normal. But I still have a question, My device is A800 80GB PCIE, how do you know L2 cache is 2x40MB, where did you see that? |
As far as I could google, the A800 is made from two GA100 chips, each of which has 40MB of cache. You should also be able to query that as the device property "l2CacheSize" in the cuDeviceProp structure. |
Hey @te42kyfo , I tested
The results show that L2 Cache Size is 40MB, but as far as I could google, the L2 cache size of A800 80GB is 80MB.Where did you get the following information
|
You are right, my info was faulty. The A800 is just one model based on the GA100 chip, which has 40MB L2. I just googled really quickly because I haven't encountered that SKU yet, and drew the wrong conclusions. |
I tried to test bandwidth with cuda-stream benchmark, my device is 4060TI, bandwidth is 288GB/s.
I changed param
max_buffer_size
from128l * 1024 *1024 +2
to1024 * 1024
as follows:The result is much larger than 288GB/s, result as follows:
When I don't change the parameters, the test results are normal. The same phenomenon occurred on my other A800 machine.
So have you ever had that happen to you?
The text was updated successfully, but these errors were encountered: