fix: fix read parquert file when schema change #1750

chenzl25 · 2025-10-16T12:49:13Z

Which issue does this PR close?

Resolve bug: failed to read iceberg table after adding new columns #1751

What changes are included in this PR?

Updated the logic in ArrowReader::get_arrow_projection_mask to allow missing columns in the Parquet file, skipping them instead of returning an error. Missing columns are now gracefully skipped during projection, and the RecordBatchTransformer adds them later with NULL/default values

Are these changes tested?

Testing schema evolution:

- Added an async test test_schema_evolution_add_column to verify that reading an old Parquet file (with only column 'a') using a newer schema (with columns 'a' and 'b') works as expected. The test checks that missing columns are filled with NULLs and the original data is preserved.

liurenjie1024

Thanks @chenzl25 for this fix!

chenzl25 and others added 3 commits October 16, 2025 20:45

fix

3f87ebc

fmt

58cba6b

Merge branch 'main' into dylan/fix_read_parquet_file_when_schema_change

2df7877

liurenjie1024 approved these changes Oct 17, 2025

View reviewed changes

liurenjie1024 merged commit fa07ec6 into apache:main Oct 17, 2025
16 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: fix read parquert file when schema change #1750

fix: fix read parquert file when schema change #1750

Uh oh!

chenzl25 commented Oct 16, 2025 •

edited

Loading

Uh oh!

liurenjie1024 left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

fix: fix read parquert file when schema change #1750

fix: fix read parquert file when schema change #1750

Uh oh!

Conversation

chenzl25 commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

What changes are included in this PR?

Are these changes tested?

Uh oh!

liurenjie1024 left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

chenzl25 commented Oct 16, 2025 •

edited

Loading