-
Notifications
You must be signed in to change notification settings - Fork 300
Use SIMD intrinsics whereever possible #1928
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
- Correct mistake in x86_64/adx.rs where it was not testing `_addcarryx` at all
1681388
to
27b1620
Compare
|
I restarted the runners and it seems to be working fine now. Do you want this merged now or try to find a solution for |
let's merge this for now. That can be another PR later (the |
Just discovered that the Edit: weirdly, the rotate instructions do wrap on overflow |
Fix mistake in #1928
Also, remove uses of some deprecated LLVM intrinsics
FIXME:
_mm256_test{z,c}_{ps,pd}
doesn't optimize well in the testsuite for some reason, it works on godbolt 😕