Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Iceberg partitions #839

Merged
merged 5 commits into from
Apr 25, 2024
Merged

Iceberg partitions #839

merged 5 commits into from
Apr 25, 2024

Conversation

thorfour
Copy link
Contributor

Adds support to iceberg storage to filter on manifests.

This does include a non iceberg related change in commit 319be90 that adds support to the filter Eval function to allow ignoring missing columns while filtering.

@thorfour thorfour requested a review from asubiotto April 25, 2024 16:46
Using the latest iceberg-go version we can now filter
at the manifest layer using partitions
Perform a query that contains a column that doesn't exist in the
manifest partition
Missing columns are typically treated as a null column. However when
filtering on a Iceberg's manifest partition missing columns cannot be
treated as null because the partition only containes the columns that
are configured in the partition spec.

So filtering manifests we just treat missing columns as a positive hit
since we can't make a negative decision based on it not being included.
@thorfour thorfour merged commit d0d91b6 into main Apr 25, 2024
8 checks passed
@thorfour thorfour deleted the iceberg-partitions branch April 25, 2024 17:04
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants