Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

simplify SSE implementation of row_lazy match finder #2929

Merged
merged 1 commit into from Dec 15, 2021
Merged

Conversation

Cyan4973
Copy link
Contributor

@Cyan4973 Cyan4973 commented Dec 14, 2021

The refactor is proposed mostly for maintenance convenience.

Performance wise, there is very little change,
I measure it slightly faster for slog 3 & 4 (~+2%)
but neutral or very slightly negative for slot 5 & 6 (< 1 %)
on both gcc and clang.
It feels like an okay side effect
(we want to improve speed more for faster modes).

But the real goal was to reduce the amount of intrinsic code.

This approach could not be easily extended to NEON because the current implementation is significantly different between the 16, 32, and 64 entries.

mostly for maintenance convenience.

Performance wise, there is very little change,
slightly faster for slog 3 & 4,
neutral or very slightly negative for slot 5 & 6.
@Cyan4973 Cyan4973 changed the title simply SSE implementation of row_lazy match finder simplify SSE implementation of row_lazy match finder Dec 14, 2021
Copy link
Contributor

@felixhandte felixhandte left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks correct to me.

@Cyan4973 Cyan4973 merged commit eaf7862 into dev Dec 15, 2021
@Cyan4973 Cyan4973 deleted the sse_row_lazy branch January 13, 2023 04:26
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants