-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
huge performance drop after some FLOPS/byte point #46
Comments
This can happen. One possibility could be potentially register spilling occurring If you have time, you could experiment by manually controlling the unroll factor of the loop. For example, you could add a mixbench/mixbench-opencl/mix_kernels.cl Line 36 in e51e196
|
I have try, 16 is not enough here, 128 maybe the best number, but still have a huge drop after 128 Compute iters:
|
So, this is device dependent. Maybe, for the OpenCL implementation this could be exposed as a parameter to the benchmark to provide more flexibility to the user. I'm not sure if it's worth though. |
I have try to build and run mixbench-ocl on Snapdragon 8 Gen2, its GPU is Adreno.
What reason cause this "problem"?
The text was updated successfully, but these errors were encountered: