[SPARK-56244][PYTHON][TEST][FOLLOWUP] Lazy scenario building for accurate peakmem benchmarks by Yicong-Huang · Pull Request #55059 · apache/spark

Yicong-Huang · 2026-03-27T10:50:35Z

What changes were proposed in this pull request?

Refactor all benchmark mixin classes to build scenario data lazily instead of eagerly at import time. Also consolidate duplicated type pool definitions into MockDataFactory.NAMED_TYPE_POOLS.

Why are the changes needed?

Follow-up to #55040. The eager _scenarios = _build_scenarios() pattern pre-built all scenario data at class definition time, inflating peak RSS. Since ASV's peakmem only increases and never decreases, all scenarios showed the same inflated reading regardless of actual benchmark memory usage.

Does this PR introduce any user-facing change?

No.

How was this patch tested?

Ran all existing ASV benchmarks via python/asv run. Time results are consistent with the eager version; peakmem now shows per-scenario differentiation instead of a single inflated value.

Was this patch authored or co-authored using generative AI tooling?

No.

zhengruifeng · 2026-03-30T11:31:26Z

merged to master

Yicong-Huang force-pushed the SPARK-56244/followup/peakmem-gc branch from 3e1f517 to 100bd83 Compare March 27, 2026 10:53

test: lazy scenario building for accurate peakmem benchmarks

f6b32f2

Yicong-Huang force-pushed the SPARK-56244/followup/peakmem-gc branch from 100bd83 to f6b32f2 Compare March 27, 2026 10:56

Yicong-Huang mentioned this pull request Mar 27, 2026

[SPARK-56120][PYTHON][TEST] Add ASV micro-benchmarks for SQL_WINDOW_AGG_ARROW_UDF #55056

Closed

zhengruifeng approved these changes Mar 30, 2026

View reviewed changes

zhengruifeng closed this in 1c807ad Mar 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[SPARK-56244][PYTHON][TEST][FOLLOWUP] Lazy scenario building for accurate peakmem benchmarks#55059

[SPARK-56244][PYTHON][TEST][FOLLOWUP] Lazy scenario building for accurate peakmem benchmarks#55059
Yicong-Huang wants to merge 1 commit intoapache:masterfrom
Yicong-Huang:SPARK-56244/followup/peakmem-gc

Yicong-Huang commented Mar 27, 2026

Uh oh!

zhengruifeng commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Yicong-Huang commented Mar 27, 2026

What changes were proposed in this pull request?

Why are the changes needed?

Does this PR introduce any user-facing change?

How was this patch tested?

Was this patch authored or co-authored using generative AI tooling?

Uh oh!

zhengruifeng commented Mar 30, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants