Add a by-benchmark longitudinal plot by mdboom · Pull Request #429 · faster-cpython/bench_runner

mdboom · 2025-04-29T16:36:22Z

This produces by-benchmark longitudinal plots, for example:

Plot by benchmark

Copilot

Pull Request Overview

This PR adds support for generating benchmark longitudinal plots to show performance changes over time per benchmark. Key changes include:

Updates to documentation in README files to introduce the new longitudinal plot.
Modifications in generate_results.py to include the benchmark longitudinal plot in the plot generation process.
Implementation of the new benchmark_longitudinal_plot function and its configuration in plot.py.

Reviewed Changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 1 comment.

File	Description
bench_runner/templates/README.md	Documentation update to add a link to the new longitudinal plot.
bench_runner/scripts/generate_results.py	Updated to include benchmark_longitudinal_plot plot generation.
bench_runner/plot.py	Added new function and configuration for benchmark longitudinal plots.
README.md	Updated documentation sections to include benchmark longitudinal plot.

Comments suppressed due to low confidence (1)

bench_runner/plot.py:561

Ensure that 'r.flags' is consistently sorted prior to comparing with 'cfg["head_flags"]' so that order discrepancies do not lead to unexpected mismatches; consider sorting both values or using set comparison if order is not significant.

if r.version.startswith(cfg["version"]) and r.flags == cfg["head_flags"]:

Copilot · 2025-04-29T16:37:02Z

+
+    base = None
+    for r in results:
+        if r.version == cfg["base"] and r.flags == cfg["base_flags"]:


Consider ensuring that 'r.flags' is consistently sorted before comparison with 'cfg["base_flags"]' to guarantee reliable equality checking; if the order is not guaranteed, using a sorted version or a set comparison might be more robust.

Suggested change

if r.version == cfg["base"] and r.flags == cfg["base_flags"]:

if r.version == cfg["base"] and set(r.flags) == set(cfg["base_flags"]):

Yhg1s · 2025-04-29T22:28:21Z

+
+    cfg = get_benchmark_longitudinal_plot_config()
+
+    results = [r for r in results if r.fork == "python" and r.nickname == cfg["runner"]]


Maybe this is better suited for a follow-up PR, but how about making 'runners' a list so we can plot different runners in a single graph?

Yeah, that's a good idea.

Add a by-benchmark longitudinal plot

d4f283f

mdboom requested review from Yhg1s and Copilot April 29, 2025 16:36

Copilot AI reviewed Apr 29, 2025

View reviewed changes

mdboom added 2 commits April 29, 2025 13:18

Fix tests

52f948d

Make plot skippable

4e9f4e0

Yhg1s approved these changes Apr 29, 2025

View reviewed changes

mdboom added 2 commits April 30, 2025 09:29

Be tighter about excluding insignificant results

af13ef6

Support multiple runners

8bfd244

Yhg1s approved these changes Apr 30, 2025

View reviewed changes

Fix tests

c816981

mdboom merged commit cb183f8 into faster-cpython:main May 1, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a by-benchmark longitudinal plot#429

Add a by-benchmark longitudinal plot#429
mdboom merged 6 commits intofaster-cpython:mainfrom
mdboom:benchmark_longitudinal

mdboom commented Apr 29, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 29, 2025

Uh oh!

Yhg1s Apr 29, 2025

Uh oh!

mdboom Apr 30, 2025

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if r.version == cfg["base"] and r.flags == cfg["base_flags"]:
	if r.version == cfg["base"] and set(r.flags) == set(cfg["base_flags"]):


		cfg = get_benchmark_longitudinal_plot_config()

		results = [r for r in results if r.fork == "python" and r.nickname == cfg["runner"]]

Conversation

mdboom commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

Yhg1s Apr 29, 2025

Choose a reason for hiding this comment

Uh oh!

mdboom Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

mdboom commented Apr 29, 2025 •

edited

Loading