Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add SSE2 version of vfpu_dot #18289

Merged
merged 2 commits into from
Oct 3, 2023
Merged

Add SSE2 version of vfpu_dot #18289

merged 2 commits into from
Oct 3, 2023

Conversation

fp64
Copy link
Contributor

@fp64 fp64 commented Oct 2, 2023

See #18249. Speedup for this function ranges 10%..100%, depending on system. Updated verification and speed measurements: https://godbolt.org/z/W1z3sj6hz

Please feel free to double-check that it matches the current version (previous code - and verification - were wrong around NaNs, this is hopefully correct).

See hrydgard#18249. Speedup for this function ranges 10%..100%,
depending on system. Updated verification and speed measurements:
https://godbolt.org/z/W1z3sj6hz
Comment on lines 882 to 884
#endif
return vfpu_dot_cpp(a, b);
}
Copy link
Owner

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

please do an #else, if nothing else for clarity.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done.

@fp64
Copy link
Contributor Author

fp64 commented Oct 2, 2023

On a completely unrelated note, "JIT from IR" currently crashes in both local build and win32 build from buildbot (ppsspp-v1.16.5-84-g008055d24-windows-x86) in, apparently, any game (yes, I'm too lazy to file an issue properly).

@hrydgard
Copy link
Owner

hrydgard commented Oct 2, 2023

And in another unrelated note, Jit using IR is no longer crashing on Mac M1, for whatever reason.

@hrydgard hrydgard merged commit 7c184a7 into hrydgard:master Oct 3, 2023
18 checks passed
@fp64 fp64 deleted the sse2-vfpu-dot branch October 3, 2023 12:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants