kaizen: core_matcher: pool nfaBuffers and flattener in the test helper path by sayrer · Pull Request #520 · timbray/quamina

sayrer · 2026-04-16T23:32:00Z

This one isn't a real win, but it's getting annoying to have the benchmarks with different behavior than the production path. When some flavor of more aggressive DFA optimization happens, this will be good.

matchesForJSONWithFlattener previously allocated a fresh *nfaBuffers per call, and matchesForJSONEvent additionally allocated a fresh *flattenJSON per call. These helpers are the path used by most tests and benchmarks (the production *Quamina.MatchesForEvent already reuses both).

Add two package-level sync.Pools and Get/Put on each call. On Put, re-seat bufs.resultBuf with a fresh slice so the []X returned to the caller can't be written through by the next pool user — the old backing array stays exclusive to the caller.

Bench (Apple M1 Ultra, n=6):

NumberMatching-20 890.2n -> 498.3n -44.03%
2288 B -> 280 B -87.76%
10 allocs -> 3 allocs
geomean (10 benchmarks): -4.71% time, -18.39% B/op

Other benchmarks that route through *Quamina.MatchesForEvent rather than the helpers are unchanged in behavior; a few show +1-3% noise likely from code layout (they don't touch the pooled path).

Tested #519 and #520 together under the race detector and it was clean, but it took 117 seconds.

matchesForJSONWithFlattener previously allocated a fresh *nfaBuffers per call, and matchesForJSONEvent additionally allocated a fresh *flattenJSON per call. These helpers are the path used by most tests and benchmarks (the production *Quamina.MatchesForEvent already reuses both). Add two package-level sync.Pools and Get/Put on each call. On Put, re-seat bufs.resultBuf with a fresh slice so the []X returned to the caller can't be written through by the next pool user — the old backing array stays exclusive to the caller. Bench (Apple M1 Ultra, n=6): NumberMatching-20 890.2n -> 498.3n -44.03% 2288 B -> 280 B -87.76% 10 allocs -> 3 allocs geomean (10 benchmarks): -4.71% time, -18.39% B/op Other benchmarks that route through *Quamina.MatchesForEvent rather than the helpers are unchanged in behavior; a few show +1-3% noise likely from code layout (they don't touch the pooled path). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

timbray

Currently failing memory_cost_test, will sit on this till we get that stuff sorted out.

sayrer changed the title ~~core_matcher: pool nfaBuffers and flattener in the test helper path~~ kaizen: core_matcher: pool nfaBuffers and flattener in the test helper path Apr 16, 2026

sayrer mentioned this pull request Apr 16, 2026

kaizen: nfa: simplify smallTable.step, eliminate per-traverse stepOut alloc #519

Merged

timbray approved these changes Apr 17, 2026

View reviewed changes

sayrer added 2 commits April 17, 2026 13:33

Merge branch 'main' into pool-nfa-buffers

6908cb3

Merge branch 'main' into pool-nfa-buffers

6c41e75

timbray reviewed Apr 18, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

kaizen: core_matcher: pool nfaBuffers and flattener in the test helper path#520

kaizen: core_matcher: pool nfaBuffers and flattener in the test helper path#520
sayrer wants to merge 3 commits into
timbray:mainfrom
sayrer:pool-nfa-buffers

sayrer commented Apr 16, 2026 •

edited

Loading

Uh oh!

timbray left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

sayrer commented Apr 16, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

timbray left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

sayrer commented Apr 16, 2026 •

edited

Loading