Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Allow file_row_number with parquet schema option #9290

Merged
merged 5 commits into from Oct 19, 2023

Conversation

samansmink
Copy link
Contributor

@samansmink samansmink commented Oct 10, 2023

Building on @lnkuiper's earlier work #9123, this PR should complete the changes required in DuckDB to support scanning iceberg schema with the iceberg extension.

I've tested this feature with a dev branch of the iceberg extension which seems to work well, allowing us to properly read the iceberg schema which may be different from the parquet schema due to schema evolution. This now enables us to support schema evolution in iceberg.

TODOs

  • support nested types

@github-actions github-actions bot marked this pull request as draft October 10, 2023 11:26
@samansmink samansmink marked this pull request as ready for review October 10, 2023 11:26
Copy link
Contributor

@lnkuiper lnkuiper left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good! I just have one remark:

extension/parquet/parquet_extension.cpp Show resolved Hide resolved
@github-actions github-actions bot marked this pull request as draft October 18, 2023 12:33
@samansmink samansmink marked this pull request as ready for review October 18, 2023 12:55
@Mytherin Mytherin merged commit 2f4425c into duckdb:feature Oct 19, 2023
45 checks passed
@Mytherin
Copy link
Collaborator

Thanks! LGTM

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants