perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions by mbutrovich · Pull Request #20729 · apache/datafusion

mbutrovich · 2026-03-05T17:55:10Z

Which issue does this PR close?

Closes perf: Sort memory reservation causes performance regression with custom MemoryPool implementations #20728.

Rationale for this change

See full discussion in #20728, but the tl;dr is:

Commit 3f0b342 ("Improve sort memory resilience", #19494) added per-chunk reservation.try_grow(total_sorted_size) calls inside the sort's unfold closure (sort.rs:748-750). I believe this causes a performance regression for projects that implement custom MemoryPool backends where try_grow is not trivially cheap.

What changes are included in this PR?

Replace the pattern of free()->try_grow() with a try_resize() that in the common case doesn't do much work since a sorted output batch is likely the same size as the input batch.

Are these changes tested?

Existing tests, and we are running some longer benchmarks with Comet at the moment.

When I profile TPC-H Q21 SF100 locally I see the giant stack of memory allocations that occur on DF52...

...completely disappear...

... and TPC-H Q21 looks back to DF51 performance, at least locally.

Are there any user-facing changes?

No.

…educe memory pool interactions.

EmilyMatt

LGTM, Code also looks much cleaner!

andygrove

Thanks for the fix @mbutrovich!

comphead · 2026-03-05T20:48:07Z

Thanks @mbutrovich I just checked that try_resize doesn't do the same free+try_grow

Reg to memory fragmentation resize should be at least not worse than free+grow

…size() to reduce memory pool interactions (#20732) Backport #20729 to `branch-52`.

…size() to reduce memory pool interactions (#20733) Backport #20729 to `branch-53`.

Replace reservation.free()->try_grow() pattern with try_resize() to r…

bb030f5

…educe memory pool interactions.

github-actions bot added the physical-plan Changes to the physical-plan crate label Mar 5, 2026

mbutrovich changed the title ~~perf: sort replace reservation.free()->try_grow() pattern with try_resize() to reduce memory pool interactions~~ perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions Mar 5, 2026

mbutrovich mentioned this pull request Mar 5, 2026

perf: Sort memory reservation causes performance regression with custom MemoryPool implementations #20728

Closed

mbutrovich marked this pull request as ready for review March 5, 2026 19:02

mbutrovich mentioned this pull request Mar 5, 2026

Release DataFusion 52.3.0 (minor/) Release (Mar 2026) #20681

Open

12 tasks

EmilyMatt approved these changes Mar 5, 2026

View reviewed changes

mbutrovich requested a review from andygrove March 5, 2026 19:14

mbutrovich mentioned this pull request Mar 5, 2026

Release DataFusion 53.0.0 (Feb 2026 / Mar 2026) #19692

Open

26 tasks

mbutrovich added the performance Make DataFusion faster label Mar 5, 2026

mbutrovich requested a review from comphead March 5, 2026 19:47

andygrove approved these changes Mar 5, 2026

View reviewed changes

mbutrovich added this pull request to the merge queue Mar 5, 2026

This was referenced Mar 5, 2026

[branch-52] perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions #20732

Merged

[branch-53] perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions #20733

Merged

Merged via the queue into apache:main with commit 631c918 Mar 5, 2026
39 checks passed

mbutrovich deleted the sort_mem branch March 5, 2026 20:43

mbutrovich added a commit that referenced this pull request Mar 5, 2026

[branch-52] perf: sort replace free()->try_grow() pattern with try_re…

9797095

…size() to reduce memory pool interactions (#20732) Backport #20729 to `branch-52`.

mbutrovich added a commit that referenced this pull request Mar 6, 2026

[branch-53] perf: sort replace free()->try_grow() pattern with try_re…

3574960

…size() to reduce memory pool interactions (#20733) Backport #20729 to `branch-53`.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions#20729

perf: sort replace free()->try_grow() pattern with try_resize() to reduce memory pool interactions#20729
mbutrovich merged 1 commit intoapache:mainfrom
mbutrovich:sort_mem

mbutrovich commented Mar 5, 2026 •

edited

Loading

Uh oh!

EmilyMatt left a comment

Uh oh!

andygrove left a comment

Uh oh!

Uh oh!

comphead commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

mbutrovich commented Mar 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

Uh oh!

EmilyMatt left a comment

Choose a reason for hiding this comment

Uh oh!

andygrove left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

comphead commented Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

mbutrovich commented Mar 5, 2026 •

edited

Loading