[python] Support value stats with truncate mode by default by XiaoHongbo-Hope · Pull Request #7701 · apache/paimon

XiaoHongbo-Hope · 2026-04-26T10:24:56Z

Purpose

Python-written append tables have no value stats in data files, preventing predicate pushdown from skipping irrelevant files during upsert-by-key reads. This PR enables default value stats for append table pruning. A follow-up PR will use these stats in the upsert_by_key lookup path.

Skip us/ns/tz timestamps: _serialize_timestamp only supports ms precision (8-byte millis). Java's TIMESTAMP(4-9) uses a compound millis+nanos format that requires a different serialization path. Timezone is also not yet supported in serialization.

Tests

JingsongLi

The binary column in Java does not generate min/max stats. If Java reads the manifest written in Python and pushes predicates down to the binary column, it may result in incorrect file skipping.

XiaoHongbo-Hope · 2026-05-14T15:55:22Z

The binary column in Java does not generate min/max stats. If Java reads the manifest written in Python and pushes predicates down to the binary column, it may result in incorrect file skipping.

Thanks, fixed

SteNicholas · 2026-05-15T06:51:41Z

                max_seq_number=max_seq_number(),
-                options=options)
+                options=options,
+                write_cols=self.write_cols)


The Java KeyValueDataFileWriter does not support writeCols, therefore is it necessary to pass write_cols?

The Java KeyValueDataFileWriter does not support writeCols, therefore is it necessary to pass write_cols?

Removed

JingsongLi

We should not enable this stats by default. Ideally, these statistical information should be obtained from the format.

XiaoHongbo-Hope · 2026-05-18T11:28:49Z

We should not enable this stats by default. Ideally, these statistical information should be obtained from the format.

Updated default config as none

XiaoHongbo-Hope marked this pull request as draft April 26, 2026 10:25

XiaoHongbo-Hope marked this pull request as ready for review May 7, 2026 06:49

XiaoHongbo-Hope requested a review from JingsongLi May 7, 2026 11:49

XiaoHongbo-Hope marked this pull request as draft May 7, 2026 12:52

XiaoHongbo-Hope marked this pull request as ready for review May 8, 2026 10:26

JingsongLi requested changes May 14, 2026

View reviewed changes

XiaoHongbo-Hope force-pushed the support_value_stats branch 5 times, most recently from 488ae1c to 23134b3 Compare May 15, 2026 02:30

SteNicholas reviewed May 15, 2026

View reviewed changes

XiaoHongbo-Hope marked this pull request as draft May 15, 2026 08:27

XiaoHongbo-Hope marked this pull request as ready for review May 15, 2026 08:29

JingsongLi requested changes May 18, 2026

View reviewed changes

XiaoHongbo-Hope force-pushed the support_value_stats branch 2 times, most recently from 16e4f9a to 90efd97 Compare May 18, 2026 10:17

[python] Support value stats with truncate mode by default

c1a4ac0

XiaoHongbo-Hope force-pushed the support_value_stats branch from 90efd97 to c1a4ac0 Compare May 18, 2026 10:20

fix REST test: explicitly set metadata.stats-mode for value stats test

08632b0

XiaoHongbo-Hope closed this May 18, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Support value stats with truncate mode by default#7701

[python] Support value stats with truncate mode by default#7701
XiaoHongbo-Hope wants to merge 2 commits into
apache:masterfrom
XiaoHongbo-Hope:support_value_stats

XiaoHongbo-Hope commented Apr 26, 2026 •

edited

Loading

Uh oh!

JingsongLi left a comment

Uh oh!

XiaoHongbo-Hope commented May 14, 2026

Uh oh!

SteNicholas May 15, 2026 •

edited

Loading

Uh oh!

XiaoHongbo-Hope May 15, 2026

Uh oh!

JingsongLi left a comment

Uh oh!

XiaoHongbo-Hope commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

XiaoHongbo-Hope commented Apr 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Purpose

Tests

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Uh oh!

XiaoHongbo-Hope commented May 14, 2026

Uh oh!

SteNicholas May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

XiaoHongbo-Hope May 15, 2026

Choose a reason for hiding this comment

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Uh oh!

XiaoHongbo-Hope commented May 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

XiaoHongbo-Hope commented Apr 26, 2026 •

edited

Loading

SteNicholas May 15, 2026 •

edited

Loading