You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Unify the implementation of _mm_hsub[s]_* with unzip vector intrinsic.
The old implementation:
https://godbolt.org/z/7Ybof1Y1K
The better implementation with less assembly code for ARM32 and ARM64:
https://godbolt.org/z/rhdcP7Khehttps://godbolt.org/z/ehvn51To3
Extract variable declaration for readability.
Replace transpose vector intrinsic with unzip vector instrinsic for
unification.
Close#432.
The implementation for _mm_hsub_* functions varies. Maybe we should unify to the faster one
The text was updated successfully, but these errors were encountered: