perf: fix O(N·K) slow row-id lookup on stable-row-id datasets by hamersaw · Pull Request #6716 · lance-format/lance

hamersaw · 2026-05-08T04:51:56Z

Dataset._take_rows could be very slow on datasets with stable row IDs enabled and any fragment containing deletions. RowIdIndex::get ends in U64Segment::get(pos), and for RangeWithHoles / RangeWithBitmap this was implemented as self.iter().nth(i) — an O(i) walk through the filtered iterator. Called once per input id in TakeBuilder::get_row_addrs, the total cost was O(N · K). At 1000 ids against a 4M-row dataset with 8 interior deletions, the take ran in 67s; the same workload now completes in ~7ms.

RangeWithHoles::get now binary-searches the hole-count function (which is monotone in holes[k] - k) for O(log H) lookups. RangeWithBitmap::get walks bytes with count_ones until it finds the byte holding the i-th set bit, then trailing_zeros within that byte — O(K/64). RangeWithHoles::position swaps take_while().count() for binary_search, also O(log H).

Adds RowIdIndex::get_many that sorts a working copy of the input and sweeps the chunk iterator once, amortizing the per-id RangeInclusiveMap walk from O(N · log F) to O(F + N). TakeBuilder::get_row_addrs now routes through it.

…letions `Dataset._take_rows` could effectively hang on datasets with stable row IDs enabled and any fragment containing deletions. `RowIdIndex::get` ends in `U64Segment::get(pos)`, and for `RangeWithHoles` / `RangeWithBitmap` this was implemented as `self.iter().nth(i)` — an O(i) walk through the filtered iterator. Called once per input id in `TakeBuilder::get_row_addrs`, the total cost was O(N · K). At 1000 ids against a 4M-row dataset with 8 interior deletions, the take ran in 67s; the same workload now completes in ~7ms. `RangeWithHoles::get` now binary-searches the hole-count function (which is monotone in `holes[k] - k`) for O(log H) lookups. `RangeWithBitmap::get` walks bytes with `count_ones` until it finds the byte holding the i-th set bit, then `trailing_zeros` within that byte — O(K/64). `RangeWithHoles::position` swaps `take_while().count()` for `binary_search`, also O(log H). Adds `RowIdIndex::get_many` that sorts a working copy of the input and sweeps the chunk iterator once, amortizing the per-id `RangeInclusiveMap` walk from O(N · log F) to O(F + N). `TakeBuilder::get_row_addrs` now routes through it. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

codecov · 2026-05-08T05:29:10Z

Codecov Report

❌ Patch coverage is 91.17647% with 6 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance-table/src/rowids/index.rs	88.88%	3 Missing ⚠️
rust/lance-table/src/rowids/segment.rs	91.89%	3 Missing ⚠️

📢 Thoughts on this report? Let us know!

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

westonpace · 2026-05-11T12:18:45Z

@claude review once

claude

No bugs found — algorithm is correct and the description's math holds up (the binary search uses the monotone holes[k] - k, and the bitmap select1 relies on new_full/From<&[bool]> both zero-padding bits past len, which I verified).

Extended reasoning...

Overview

Three files modified, ~140 net lines:

rust/lance-table/src/rowids/segment.rs: rewrites RangeWithHoles::get (O(N)→O(log H) via binary search on the monotone function holes[k]-k), RangeWithBitmap::get (O(N)→O(K/64) via byte popcount + trailing_zeros), and RangeWithHoles::position (O(N)→O(log H)).
rust/lance-table/src/rowids/index.rs: adds RowIdIndex::get_many — sorts a working copy of input ids and sweeps the RangeInclusiveMap chunk iterator once.
rust/lance/src/dataset/take.rs: routes TakeBuilder::get_row_addrs through get_many.

The motivating bug: Dataset._take_rows on stable-row-id datasets with any deletions degraded from ms to ~67s for 1000 ids on 4M rows.

Security risks

None. This is purely an algorithmic performance fix in internal row-id resolution. No new I/O, no new external input parsing, no auth/crypto/permissions. The new code does not introduce panics — the holes.get(mid).unwrap() is bounded by hi <= holes.len() and mid < hi.

Level of scrutiny

Higher-than-minimum. This is not a mechanical change — it is a non-trivial algorithmic rewrite of code on the hot path for take_rows, with subtle correctness invariants:

RangeWithHoles::get correctness depends on holes[k] - k being monotone non-decreasing (true because holes are sorted and unique).
RangeWithBitmap::get correctness depends on bytes past bitmap.len() being zero-padded — verified true for new_full (explicitly zeros tail bits) and From<&[bool]> (starts from new_empty which is all-zero).
get_many correctness depends on the input being sorted before the sweep, with the else { break } path safe because out is pre-initialized to None.

I worked through these by hand and they hold, but a human maintainer familiar with the rowid layer should still confirm — especially because this code path silently affects all stable-row-id reads.

Other factors

Test coverage is strong: existing proptest in rowids/index.rs, comprehensive test_segments covering all variants of get/position/iteration round-trip, and a new perf-regression test (test_large_range_segments_no_deletions) that would fail if the optimization is undone. Codecov flags 6 uncovered lines (mostly in error/edge paths). The reviewer (westonpace) explicitly asked for a review pass, which also suggests this should not skip a human.

westonpace

Awesome, thanks for the fix. I think my only question is whether we should also close #6486 now?

claude Bot reviewed May 8, 2026

View reviewed changes

github-actions Bot added the performance label May 8, 2026

chore: collapse nested if-let to fix clippy warning

8cb2164

Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

westonpace mentioned this pull request May 11, 2026

Stable row ID takes require up to 3x as many IOPs as non-stable #6486

Open

claude Bot reviewed May 11, 2026

View reviewed changes

westonpace approved these changes May 11, 2026

View reviewed changes

westonpace merged commit e12f421 into lance-format:main May 11, 2026
31 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: fix O(N·K) slow row-id lookup on stable-row-id datasets#6716

perf: fix O(N·K) slow row-id lookup on stable-row-id datasets#6716
westonpace merged 2 commits into
lance-format:mainfrom
hamersaw:perf/stable-row-id-take-rows

hamersaw commented May 8, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

codecov Bot commented May 8, 2026 •

edited

Loading

Uh oh!

westonpace commented May 11, 2026

Uh oh!

claude Bot left a comment

Uh oh!

westonpace left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hamersaw commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

codecov Bot commented May 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

westonpace commented May 11, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Overview

Security risks

Level of scrutiny

Other factors

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hamersaw commented May 8, 2026 •

edited

Loading

codecov Bot commented May 8, 2026 •

edited

Loading