Skip to content

Feature/excess v3#47

Merged
Malkovsky merged 5 commits into
mainfrom
feature/excess_v3
May 16, 2026
Merged

Feature/excess v3#47
Malkovsky merged 5 commits into
mainfrom
feature/excess_v3

Conversation

@Malkovsky
Copy link
Copy Markdown
Owner

Optimized LUT and expand versions of excess algorithm

@codecov
Copy link
Copy Markdown

codecov Bot commented May 15, 2026

Codecov Report

❌ Patch coverage is 96.02888% with 11 lines in your changes missing coverage. Please review.
✅ Project coverage is 87.36%. Comparing base (a553366) to head (32355a2).

Files with missing lines Patch % Lines
include/pixie/experimental/excess.h 94.17% 7 Missing and 4 partials ⚠️
Additional details and impacted files
@@            Coverage Diff             @@
##             main      #47      +/-   ##
==========================================
+ Coverage   87.29%   87.36%   +0.07%     
==========================================
  Files          12       13       +1     
  Lines        3117     3214      +97     
  Branches      612      602      -10     
==========================================
+ Hits         2721     2808      +87     
- Misses        260      267       +7     
- Partials      136      139       +3     
Flag Coverage Δ
gcov 87.36% <96.02%> (+0.07%) ⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

@Malkovsky
Copy link
Copy Markdown
Owner Author

@codex please review

@chatgpt-codex-connector
Copy link
Copy Markdown

Codex Review: Didn't find any major issues. Chef's kiss.

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

@Malkovsky Malkovsky force-pushed the feature/excess_v3 branch from 08f5300 to 32355a2 Compare May 16, 2026 10:23
Copy link
Copy Markdown

@chatgpt-codex-connector chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 32355a2b0c

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

total_match = _mm512_maskz_mov_epi8(active, total_match);

const __m512i res = _mm512_maddubs_epi16(total_match, vmult);
const __m256i packed = _mm512_cvtepi16_epi8(res);
Copy link
Copy Markdown

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P1 Badge Pack LUT bytes with unsigned narrowing

res encodes two 4-bit position masks per byte via maddubs, so valid byte values frequently exceed 127 (e.g. when the high nibble contains bit 0x8, yielding values >= 128). Using _mm512_cvtepi16_epi8 performs signed saturation (vpmovwb), which clamps those bytes to 127 and clears high-nibble match bits, producing incorrect out masks for reachable targets on AVX-512 builds. Use an unsigned pack/narrow path (e.g. _mm512_cvtusepi16_epi8) to preserve full 0..255 byte values.

Useful? React with 👍 / 👎.

@Malkovsky Malkovsky merged commit d60a5cb into main May 16, 2026
6 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant