Feature Request / Improvement
Hi pyiceberg community,
Is there a more efficient way than the API table.inspect.partitions() to query specific partitions?
The table.inspect.partitions() seems expensive as it reads all manifest files, especially when the table has many partitions, while we only need to check a few recent partitions.
I have a few questions about this use case:
- Is there any more efficient way to do this that I am missing?
- If not, can I propose something like
partitions_filtered(partition_filter) to get the specific partition by reusing the work
|
def plan_files(self) -> Iterable[FileScanTask]: |
Thanks!