New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
performance on amd 7950x ... #6
Comments
I even tried to use aocc compiler since support for zen4 is limited in gcc-12 but I ended with similar results.
|
Zen 4 based CPUs perform poorly because AMD's implementation of I've also run the benchmark on my 7700X CPU and also get extremely poor results for Zen 4. Thankfully, replacing calls to |
Thanks for the explanation. It is good that the instruction can be emulated and results can be similar to intel speedup. |
Hello,
I tried benchmark on 7950x cpu and performance is in some tests up to 2.3x faster but in other tests much slower (like 0.3x) compared to classical sorting. Is amd implementation of avx512 not so powerful (and your code is not suitable for zen4) or is it something else ?
Thanks,
Jan
The text was updated successfully, but these errors were encountered: