Skip to content

[python] Support blob-external-storage-path for writing BLOB data to external storage#7967

Merged
JingsongLi merged 1 commit into
apache:masterfrom
zhoulii:feature/pypaimon-blob-external-storage
May 26, 2026
Merged

[python] Support blob-external-storage-path for writing BLOB data to external storage#7967
JingsongLi merged 1 commit into
apache:masterfrom
zhoulii:feature/pypaimon-blob-external-storage

Conversation

@zhoulii
Copy link
Copy Markdown
Member

@zhoulii zhoulii commented May 26, 2026

Purpose

close #7966

Tests

Copy link
Copy Markdown
Contributor

@leaves12138 leaves12138 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for the update. I reviewed the latest version again.

The external-storage writer now matches the Java-side behavior well: the configured external fields are validated against BLOB descriptor fields, the writer rewrites descriptor fields before normal data-file writing, and the offset/length calculation follows the blob format layout. The added tests cover validation, round-trip reads, multiple fields, nulls, mixed inline/external BLOBs, and descriptor input rewriting.

I also ran the focused Python test locally:

PYTHONPATH=. python -m unittest pypaimon.tests.external_storage_blob_test

It passed. LGTM.

@JingsongLi
Copy link
Copy Markdown
Contributor

+1

@JingsongLi JingsongLi merged commit 52a362c into apache:master May 26, 2026
6 checks passed
@zhoulii zhoulii deleted the feature/pypaimon-blob-external-storage branch May 26, 2026 10:13
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[Feature] adds support for blob-external-storage-path in pypaimon

3 participants