[GLUTEN-10920][VL] Allow disabling hash/sort shuffle reader buffer by wForget · Pull Request #10922 · apache/gluten

wForget · 2025-10-22T05:41:39Z

What changes are proposed in this pull request?

Add buffer to the shuffle read input stream only if readerBufferSize is greater than 0.

How was this patch tested?

Manually testing internal test case:

add spark.gluten.sql.columnar.shuffle.readerBufferSize=1MB; conf:

add spark.gluten.sql.columnar.shuffle.readerBufferSize=0; conf:

Related issue: #10920

zuston

LGTM.

FelixYBW · 2025-10-22T20:06:38Z

@marin-ma why it's onheap copy? shouldn't reducer use netty to load data into direct memory?

FelixYBW · 2025-10-22T20:19:46Z

@wForget did you set spark.shuffle.io.preferDirectBufs=False?

zuston · 2025-10-23T01:53:00Z

@wForget did you set spark.shuffle.io.preferDirectBufs=False?

The issue occurs when using Uniffle, rather than the vanilla shuffle.

wForget · 2025-10-23T02:20:54Z

@marin-ma why it's onheap copy? shouldn't reducer use netty to load data into direct memory?

I filed #10923 for this issue

wForget · 2025-10-23T02:30:24Z

Thanks @zuston @FelixYBW for the review, merged to main

…pache#10922)

feat: Allow disabling hash/sort shuffle reader buffer

80f37e5

github-actions bot added the VELOX label Oct 22, 2025

wForget requested a review from marin-ma October 22, 2025 05:45

wForget mentioned this pull request Oct 22, 2025

[VL] Performance regression on uniffle hash shuffle reader #10920

Closed

zuston approved these changes Oct 22, 2025

View reviewed changes

FelixYBW approved these changes Oct 22, 2025

View reviewed changes

wForget merged commit 7b7ef95 into apache:main Oct 23, 2025
139 of 143 checks passed

PHILO-HE mentioned this pull request Nov 27, 2025

High deserialize time when doing shuffle read #10214

Open

warrenzhu25 pushed a commit to warrenzhu25/gluten that referenced this pull request Jan 10, 2026

[GLUTEN-10920][VL] Allow disabling hash/sort shuffle reader buffer (a…

99f4bb1

…pache#10922)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[GLUTEN-10920][VL] Allow disabling hash/sort shuffle reader buffer#10922

[GLUTEN-10920][VL] Allow disabling hash/sort shuffle reader buffer#10922
wForget merged 1 commit intoapache:mainfrom
wForget:GLUTEN-10920

wForget commented Oct 22, 2025 •

edited by github-actions bot

Loading

Uh oh!

zuston left a comment

Uh oh!

FelixYBW commented Oct 22, 2025

Uh oh!

FelixYBW commented Oct 22, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

wForget commented Oct 23, 2025

Uh oh!

Uh oh!

wForget commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

wForget commented Oct 22, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What changes are proposed in this pull request?

How was this patch tested?

Uh oh!

zuston left a comment

Choose a reason for hiding this comment

Uh oh!

FelixYBW commented Oct 22, 2025

Uh oh!

FelixYBW commented Oct 22, 2025

Uh oh!

zuston commented Oct 23, 2025

Uh oh!

wForget commented Oct 23, 2025

Uh oh!

Uh oh!

wForget commented Oct 23, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wForget commented Oct 22, 2025 •

edited by github-actions bot

Loading