Leverage Iceberg-Rust for all the transforms #1833

Fokko · 2025-03-23T20:42:46Z

Rationale for this change

Testing out to use Iceberg Rust for all of the transforms. I think we have some rounding error in apache/iceberg-rust#1128

Closes #1591

Are these changes tested?

Are there any user-facing changes?

…ceberg-rust

tests/table/test_partitioning.py

pyiceberg/transforms.py

…ceberg-rust

kevinjqliu

LGTM! I fixed CI and added pyiceberg-core to the pyarrow install group for poetry

[tool.poetry.extras]
pyarrow = ["pyarrow", "pyiceberg-core"]

Fokko · 2025-06-07T18:28:16Z

pyproject.toml

@@ -289,7 +289,7 @@ generate-setup-file = false
 script = "build-module.py"

 [tool.poetry.extras]
-pyarrow = ["pyarrow"]
+pyarrow = ["pyarrow", "pyiceberg-core"]


technically this is not required, because we only use the transforms when writing to a partitioned table. But I think that it might lead to a lot of confusion if we don't do this.

Agreed i was considering this too and came to the same conclusion :)

Also I think the pyarrow_transform functions are used on the read path too

iceberg-python/pyiceberg/io/pyarrow.py

Line 2696 in a67c559

name, partition.transform.pyarrow_transform(source_field.field_type)(arrow_table[source_field.name])

Also I think the pyarrow_transform functions are used on the read path too

For completeness, I don't think that's true. _determine_partitions is only used by _dataframe_to_data_files. When reading, we often have a single value SELECT * FROM tbl WHERE created_at > '2025-01-01 19:25:00', so there performance is not that important, and we use non-Arrow transform (example for the MonthPartitioning).

Fokko added 3 commits March 23, 2025 21:40

Test with Iceberg-Rust

bf695e6

Merge branch 'main' of github.com:apache/iceberg-python into fd-use-i…

ae3ba82

…ceberg-rust

Lint

881aecd

Fokko mentioned this pull request Mar 23, 2025

Fix rounding of negative hour transform apache/iceberg-rust#1128

Merged

Fokko added 7 commits March 24, 2025 22:24

WIP

3bc74ee

Merge branch 'main' of github.com:apache/iceberg-python into fd-use-i…

b076077

…ceberg-rust

WIP

16778a7

Merge branch 'main' of github.com:apache/iceberg-python into fd-use-i…

4999aae

…ceberg-rust

Bump

059e067

Merge branch 'main' of github.com:apache/iceberg-python into fd-use-i…

2f56e2f

…ceberg-rust

Less is more!

0d8ec94

kevinjqliu reviewed Mar 27, 2025

View reviewed changes

tests/table/test_partitioning.py Show resolved Hide resolved

tests/table/test_partitioning.py Show resolved Hide resolved

pyiceberg/transforms.py Outdated Show resolved Hide resolved

Fokko changed the title ~~Test with Iceberg-Rust~~ Leverage Iceberg-Rust for all the transforms Mar 28, 2025

kevinjqliu mentioned this pull request Apr 1, 2025

feat: Support TimestampNs and TimestampTzNs` in bucket transform apache/iceberg-rust#1150

Merged

Fokko and others added 5 commits June 4, 2025 20:16

Merge branch 'main' of github.com:apache/iceberg-python into fd-use-i…

68a0dae

…ceberg-rust

check all transforms

6e445ec

Add poetry.lock now 0.5.1 has been released

3416dc4

Nice

957b7cb

Revert poetry lock

edbac4d

Fokko marked this pull request as ready for review June 6, 2025 09:18

Fokko force-pushed the fd-use-iceberg-rust branch from fab99e3 to edbac4d Compare June 6, 2025 09:19

Cleanup

d4702a4

This was referenced Jun 7, 2025

upgrade pyiceberg-core to 0.5.1 #2067

Closed

[test] Run partition transform tests for all transforms #1592

Closed

kevinjqliu added 3 commits June 7, 2025 09:43

poetry lock

fbc9738

fix error msg

add0a17

pyarrow should also install pyiceberg-core

6c279de

kevinjqliu approved these changes Jun 7, 2025

View reviewed changes

Fokko commented Jun 7, 2025

View reviewed changes

kevinjqliu merged commit f507dbd into apache:main Jun 7, 2025
10 checks passed

Fokko deleted the fd-use-iceberg-rust branch June 8, 2025 18:41

Fokko mentioned this pull request Jun 13, 2025

DayTransform issues with pyarrow timestamp[ms] #1980

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Leverage Iceberg-Rust for all the transforms #1833

Leverage Iceberg-Rust for all the transforms #1833

Uh oh!

Fokko commented Mar 23, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevinjqliu left a comment

Uh oh!

Fokko Jun 7, 2025

Uh oh!

kevinjqliu Jun 7, 2025

Uh oh!

Fokko Jun 7, 2025

Uh oh!

Uh oh!

Uh oh!

Leverage Iceberg-Rust for all the transforms #1833

Leverage Iceberg-Rust for all the transforms #1833

Uh oh!

Conversation

Fokko commented Mar 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rationale for this change

Are these changes tested?

Are there any user-facing changes?

Uh oh!

Uh oh!

Uh oh!

Uh oh!

kevinjqliu left a comment

Choose a reason for hiding this comment

Uh oh!

Fokko Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

kevinjqliu Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

Fokko Jun 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Fokko commented Mar 23, 2025 •

edited

Loading