feat: optimize projected read buffer copies by QuakeWang · Pull Request #5 · apache/paimon-mosaic

QuakeWang · 2026-05-18T14:11:03Z

Projected reads already avoided unnecessary IO for sparse paged buckets, but the reader still copied buffers at several internal boundaries.

Before this PR:

read_ranges coalesced IO ranges, then copied each logical range into its own Vec<u8>.
Paged projection copied selected slot bytes into an intermediate map before parsing.
parse_column_slot copied the decompressed page payload after parsing its header.

This PR keeps the existing format and read_ranges API, but adds a shared range buffer path for projected reads. The reader now borrows slices from coalesced buffers during parsing, stores paged slot locations instead of slot bytes, and lets ColumnPageReader parse from an offset inside the owned decompressed page buffer.

Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com>

JingsongLi · 2026-05-18T15:01:20Z

Nice Catch!

JingsongLi

+1

Optimize projected read buffer copies

97b7c0c

Signed-off-by: QuakeWang <wangfuzheng0814@foxmail.com>

JingsongLi approved these changes May 19, 2026

View reviewed changes

JingsongLi merged commit c838f62 into apache:main May 19, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: optimize projected read buffer copies#5

feat: optimize projected read buffer copies#5
JingsongLi merged 1 commit into
apache:mainfrom
QuakeWang:feat/read-copy-opt

QuakeWang commented May 18, 2026

Uh oh!

JingsongLi commented May 18, 2026

Uh oh!

JingsongLi left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

QuakeWang commented May 18, 2026

Uh oh!

JingsongLi commented May 18, 2026

Uh oh!

JingsongLi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants