Skip to content

Conversation

@BlakeOrth
Copy link
Contributor

Which issue does this PR close?

This is a follow-on PR spurred by this comment chain:

This work is associated with:

Rationale for this change

The implementation prior to merging #18146 was capable of only listing files under a specific prefix when the known prefixes could be matched to filters. This PR re-introduces that capability, alleviating the need to list and filter every file for a table when the filters match.

What changes are included in this PR?

  • Adds the ability to list files backing a table URL optionally filtered by a path prefix
  • Reintroduces the ability for partitioned listing tables to only list prefixes that match an input filter
  • Adds tests for new functionality

Are these changes tested?

Yes. There is existing coverage on many of the changes, new tests have been added, and existing integration tests have been updated to show the change in behavior.

Are there any user-facing changes?

no

cc @alamb

 - Adds the ability to list files backing a table URL optionally
   filtered by a path prefix
 - Reintroduces the ability for partitioned listing tables to only list
   prefixes that match an input filter
 - Adds tests for new functionality
@github-actions github-actions bot added core Core DataFusion crate catalog Related to the catalog crate datasource Changes to the datasource crate labels Nov 17, 2025
Copy link
Contributor

@alamb alamb left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you @BlakeOrth -- this looks great to me

RequestCountingObjectStore()
Total Requests: 2
- LIST prefix=data
- LIST prefix=data/a=2
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice

@alamb
Copy link
Contributor

alamb commented Nov 18, 2025

Thanks again @BlakeOrth

Merged via the queue into apache:main with commit 6601959 Nov 18, 2025
28 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

catalog Related to the catalog crate core Core DataFusion crate datasource Changes to the datasource crate

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants