perf: add spark.comet.exec.shuffle.maxBufferedBatches config by andygrove · Pull Request #3800 · apache/datafusion-comet

andygrove · 2026-03-26T14:17:41Z

Which issue does this PR close?

Closes #.

Rationale for this change

When shuffle spills only when the memory pool is exhausted, peak memory usage on executors can be very high — especially with many concurrent tasks. Spilling earlier, before memory pressure is critical, reduces peak memory at the cost of slightly more disk I/O.

What changes are included in this PR?

Adds spark.comet.exec.shuffle.maxBufferedBatches config (default 0 = disabled). When set, the native shuffle repartitioner spills once it has buffered this many batches, before waiting for the memory pool to refuse an allocation.
Fixes a file descriptor leak: spill files are now closed after each spill event and reopened in append mode for the next, so FD usage is proportional to active writes rather than to the number of partitions that have ever spilled.

How are these changes tested?

Existing shuffle tests cover the spill path. The new config defaults to 0 (disabled), so no existing behaviour changes without opt-in.

Add a new configuration option to limit the number of batches buffered in memory before spilling during native shuffle. Setting a small value causes earlier spilling, reducing peak memory usage on executors at the cost of more disk I/O. The default of 0 preserves existing behavior (spill only when the memory pool is exhausted). Also fix a too-many-open-files issue where each partition held one spill file descriptor open for the lifetime of the task. The spill file is now closed after each spill event and reopened in append mode for the next, keeping FD usage proportional to active writes rather than total partitions.

andygrove · 2026-03-27T15:51:53Z

This does not work in practice

andygrove added 2 commits March 26, 2026 07:15

style: cargo fmt

adb7c00

andygrove closed this Mar 27, 2026

andygrove deleted the shuffle-max-buffered-batches branch March 27, 2026 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: add spark.comet.exec.shuffle.maxBufferedBatches config#3800

perf: add spark.comet.exec.shuffle.maxBufferedBatches config#3800
andygrove wants to merge 2 commits intoapache:mainfrom
andygrove:shuffle-max-buffered-batches

andygrove commented Mar 26, 2026 •

edited

Loading

Uh oh!

andygrove commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

andygrove commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

How are these changes tested?

Uh oh!

andygrove commented Mar 27, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

andygrove commented Mar 26, 2026 •

edited

Loading