Skip to content

feat(table): add Scan.ReadTasks for reading pre-planned file scan tasks#781

Merged
zeroshade merged 2 commits intoapache:mainfrom
twuebi:tp/scan-read-tasks
Mar 17, 2026
Merged

feat(table): add Scan.ReadTasks for reading pre-planned file scan tasks#781
zeroshade merged 2 commits intoapache:mainfrom
twuebi:tp/scan-read-tasks

Conversation

@twuebi
Copy link
Contributor

@twuebi twuebi commented Mar 13, 2026

This extracts the read logic from ToArrowRecords as a shared readTasks method and exposes it as ReadTasks. This enables callers that already have FileScanTasks to read them without re-planning.

Currently ToArrowRecords always goes through PlanFiles first. If you've already selected or filtered your tasks (e.g. for compaction, selective re-reads, or custom merge strategies), there's no way of using them directly.

Copy link
Member

@zeroshade zeroshade left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

just need to add a test for ReadTasks, and then we're good 😄

@zeroshade zeroshade merged commit 3c01a04 into apache:main Mar 17, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants