Skip to content

NoSQL: Use actual key serialization to split NoSQL index stripes#4567

Draft
snazy wants to merge 2 commits into
apache:mainfrom
snazy:nosql-index-stripe-sizing
Draft

NoSQL: Use actual key serialization to split NoSQL index stripes#4567
snazy wants to merge 2 commits into
apache:mainfrom
snazy:nosql-index-stripe-sizing

Conversation

@snazy
Copy link
Copy Markdown
Member

@snazy snazy commented May 28, 2026

Base stripe splitting on the actual serialized key shape, including prefix compression, instead of a loose full-key estimate.

The estimate was safe but too pessimistic and could over-split perfectly reasonable stripes.
Reuse the real serialization walk so the split logic and persisted bytes stay aligned.

Depends on and follow-up of #4566

snazy added 2 commits May 30, 2026 09:29
Replace the old count-oriented stripe split behavior with a target-size driven
path at the UpdatableIndexImpl level.

That keeps oversized stripes from dragging too much neighboring data along
when we have to rewrite them, and it gives us better control over large entry
distributions.
Base stripe splitting on the actual serialized key shape, including prefix
compression, instead of a loose full-key estimate.

The estimate was safe but too pessimistic and could over-split perfectly
reasonable stripes. Reuse the real serialization walk so the split logic and
persisted bytes stay aligned.
@snazy snazy force-pushed the nosql-index-stripe-sizing branch from 3cc7380 to 11b0cfe Compare May 30, 2026 07:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant