Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix/improve intrinsics usage for ARM64EC #2306

Merged
merged 1 commit into from
Oct 29, 2021

Conversation

StephanTLavavej
Copy link
Member

This mirrors Ben Niu's MSVC-PR-361004 "Disable x64 code paths that use tzcnt for ARM64EC, and enable NEON paths for ARM64EC".

The _HAS_NEON_INTRINSICS change is a performance improvement - ARM64EC has access to these ARM64 intrinsics.

The tzcnt change is a bugfix (fixing broken builds) - ARM64EC does not have access to this x86/x64 intrinsic.

@StephanTLavavej StephanTLavavej added bug Something isn't working performance Must go faster labels Oct 28, 2021
@StephanTLavavej StephanTLavavej requested a review from a team as a code owner October 28, 2021 19:38
@CaseyCarter
Copy link
Member

This mirrors Ben Niu's MSVC-PR-361004 "Disable x64 code paths that use tzcnt for ARM64EC, and enable NEON paths for ARM64EC".

... which Casey approved without a GitHub port 🤦

@StephanTLavavej StephanTLavavej merged commit ee5a216 into microsoft:main Oct 29, 2021
@StephanTLavavej StephanTLavavej deleted the arm64ec_intrinsics branch October 29, 2021 10:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working performance Must go faster
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants