feat: expose FTS exec internals to enable distributed planning by vivek-bharathan · Pull Request #6648 · lance-format/lance

vivek-bharathan · 2026-04-30T04:57:06Z

The FTS execution plan types (MatchQueryExec, PhraseQueryExec, BoostQueryExec,
BooleanQueryExec, FlatMatchQueryExec, FlatMatchFilterExec) and their supporting
helpers (load_segments, load_segment_details, build_global_bm25_scorer) are
currently private or pub(crate), with fields hidden behind constructors that always
assume that all committed segments exist on one node and are scored with statistics computed
locally.

This doesn't work for systems that distribute FTS queries across hosts
— a coordinator that wants to (for example) route segments 1–5 to host A,
segments 6–10 to host B, and still produce globally-correct BM25 scores can't do so
today: per-host execs each compute IDFs against their local segment subset, producing
locally-correct but globally-wrong scores.

This PR exposes the surface needed for that pattern, additively, without changing any
existing behavior

claude

Claude Code Review

This pull request is from a fork — automated review is disabled. A repository maintainer can comment @claude review to run a one-time review.

codecov · 2026-04-30T05:26:30Z

Codecov Report

❌ Patch coverage is 67.76612% with 215 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
rust/lance/src/io/exec/fts.rs	65.35%	187 Missing and 15 partials ⚠️
rust/lance-index/src/scalar/inverted.rs	86.00%	2 Missing and 5 partials ⚠️
rust/lance/src/dataset/scanner.rs	81.25%	0 Missing and 6 partials ⚠️

📢 Thoughts on this report? Let us know!

westonpace · 2026-04-30T17:12:36Z

@claude review

claude

⚠️ Code review skipped — your organization has reached its monthly code review spending cap.

An organization admin can view or raise the cap at claude.ai/admin-settings/claude-code. The cap resets at the start of the next billing period.

Once the cap resets or is raised, comment @claude review on this pull request to trigger a review.

westonpace

Just a few doc questions. It might also be nice (in this PR or in a separate issue) to document why we are making this change?

My blocking request is that we fill out the PR description with some kind of justification for why this change is desired? It looks like we are trying to make fts exec nodes a more complete part of the public API? This is a fine rationalization but we should at least describe it.

westonpace · 2026-04-30T17:09:01Z

+/// single corpus-wide scorer, so that BM25 IDF scoring uses *global*
+/// statistics rather than per-segment statistics. Computes the union of
+/// fuzzy-expanded terms when `params.fuzziness` is set.
+pub fn build_global_bm25_scorer(


Why does this method need to be made public? Is it to supply a MemBM25Scorer to some of the exec nodes?

Actually this is mostly for api completeness since it is paired with with_base_scorer. Keeps a single source of truth for BM25 IDF arithmetic across segments. I could move it back if you prefer

Adds public getters on every FTS exec type Promote segment loaders and aggregation arithmetic to pub Add serde for FtsSearchParams Add Segment-bound construction for FTS execs Add scorer injection for FTS execs

westonpace

Switching to approve as there is now a PR description. Thanks 😄

claude Bot reviewed Apr 30, 2026

View reviewed changes

github-actions Bot added the enhancement New feature or request label Apr 30, 2026

claude Bot reviewed Apr 30, 2026

View reviewed changes

wkalt reviewed Apr 30, 2026

View reviewed changes

Comment thread rust/lance-index/src/scalar/inverted.rs

westonpace requested changes Apr 30, 2026

View reviewed changes

vivek-bharathan force-pushed the feat/exposeftsinternals branch from 0794f90 to 2510209 Compare April 30, 2026 17:59

vivek-bharathan added 2 commits April 30, 2026 11:35

feat: expose FTS exec internals to enable distributed planning

2345153

Adds public getters on every FTS exec type Promote segment loaders and aggregation arithmetic to pub Add serde for FtsSearchParams Add Segment-bound construction for FTS execs Add scorer injection for FTS execs

feat: refactor and publish per-slot assembly logic for fts execs

9e06cc1

vivek-bharathan force-pushed the feat/exposeftsinternals branch from 2510209 to 9e06cc1 Compare April 30, 2026 18:36

westonpace approved these changes May 1, 2026

View reviewed changes

wkalt merged commit 0b5b95c into lance-format:main May 1, 2026
28 checks passed

vivek-bharathan deleted the feat/exposeftsinternals branch May 1, 2026 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: expose FTS exec internals to enable distributed planning#6648

feat: expose FTS exec internals to enable distributed planning#6648
wkalt merged 2 commits intolance-format:mainfrom
vivek-bharathan:feat/exposeftsinternals

vivek-bharathan commented Apr 30, 2026 •

edited

Loading

Uh oh!

claude Bot left a comment

Uh oh!

codecov Bot commented Apr 30, 2026

Uh oh!

westonpace commented Apr 30, 2026

Uh oh!

claude Bot left a comment

Uh oh!

Uh oh!

westonpace left a comment

Uh oh!

westonpace Apr 30, 2026

Uh oh!

vivek-bharathan Apr 30, 2026

Uh oh!

Uh oh!

westonpace left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

vivek-bharathan commented Apr 30, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Claude Code Review

Uh oh!

codecov Bot commented Apr 30, 2026

Codecov Report

Uh oh!

westonpace commented Apr 30, 2026

Uh oh!

claude Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

westonpace Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

vivek-bharathan Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

westonpace left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

vivek-bharathan commented Apr 30, 2026 •

edited

Loading