Add a helper script to bisect a regression by mdboom · Pull Request #430 · faster-cpython/bench_runner

mdboom · 2025-04-30T14:27:55Z

A helper script to bisect a regression, usage:

Usage: __main__.py [-h] [--pgo] [--flags FLAGS] benchmark good bad

Run bisect on a benchmark to find the first regressing commit. A full checkout of CPython should be in the cpython
directory. If it doesn't exist, it will be cloned.

Positional Arguments:
  benchmark      The benchmark to run bisect on.
  good           The good commit hash for the bisect.
  bad            The bad commit hash for the bisect.

Options:
  -h, --help     show this help message and exit
  --pgo
  --flags FLAGS

Follow-on (but not part of this PR) will be to connect it to GitHub Actions so users can easily kick it off on one of the runners.

Copilot

Pull Request Overview

This PR adds a helper script for bisecting regressions in CPython by automating the checkout, build, and benchmark run steps. Key changes include:

Introducing a new helper function (format_seconds) for human-readable time formatting.
Updating compile_unix to add a reconfigure flag.
Implementing the bisect script that automates bisect operations and logs results.

Reviewed Changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
bench_runner/util.py	Added format_seconds for time formatting with a loop using magic numbers.
bench_runner/scripts/workflow.py	Updated compile_unix signature with a reconfigure flag to clean builds appropriately.
bench_runner/scripts/bisect.py	Implemented the bisect flow including result parsing, logging, and git bisect automation.
bench_runner/git.py	Added a simple checkout helper function.
bench_runner/main.py	Registered bisect as a new command in the command entry point.

Comments suppressed due to low confidence (1)

bench_runner/scripts/bisect.py:206

Parsing the 'pgo' argument by comparing against the string "True" can be error-prone. Consider converting the argument to a boolean explicitly during argument parsing for clarity.

timing = get_result(args.benchmark, args.pgo == "True", args.flags, cpython=cpython)

Copilot · 2025-04-30T14:29:45Z

+    for i in range(2, -9, -1):
+        if value >= 10.0**i:
+            break
+    else:
+        i = -9


The use of magic numbers in the loop (range from 2 to -9) reduces readability. Consider defining named constants or adding inline comments to clarify the purpose of these bounds.

Suggested change

for i in range(2, -9, -1):

if value >= 10.0**i:

break

else:

i = -9

MAX_POWER_OF_TEN = 2 # Maximum power of ten to check (10^2 = 100 seconds)

MIN_POWER_OF_TEN = -9 # Minimum power of ten to check (10^-9 = 1 nanosecond)

for i in range(MAX_POWER_OF_TEN, MIN_POWER_OF_TEN - 1, -1):

if value >= 10.0**i:

break

else:

i = MIN_POWER_OF_TEN

mdboom · 2025-04-30T15:39:27Z


    sys.argv = [sys.argv[0], *sys.argv[2:]]
-    mod = importlib.import_module(f"bench_runner.scripts.{command}")
+    mod = importlib.import_module(f".{command}", "bench_runner.scripts")


This is just due to picking the unfortunate name bisect that conflicts with the name of a stdlib module and not being careful about how things are run elsewhere.

Yhg1s · 2025-05-02T21:58:20Z

+    print("Bisect log:")
+
+    with get_log_file().open("r") as f:
+        for line in f.readlines():


Is there a reason to use readlines instead of just iterating over the file?

Suggested change

for line in f.readlines():

for line in f:

Yhg1s · 2025-05-02T22:06:13Z

    with contextlib.chdir(cpython):
-        subprocess.check_call(["./configure", *args], env=env)
+        if reconfigure:
+            subprocess.check_call(["./configure", *args], env=env)


Even if you don't explicitly run configure, the Makefile may decide to rerun configure when jumping through revisions... It's probably a good idea to pass -C (or --config-cache) to configure to significantly reduce configure time.

Yhg1s · 2025-05-02T22:08:37Z


 COMMANDS = {
    "backfill": "Schedule benchmarking a number of commits",
+    "bisect": "Run a bisect to find the commit that caused a regression",


Maybe add a comment that it won't work on Windows?

Yhg1s · 2025-05-02T22:09:59Z

+
+
+if __name__ == "__main__":
+    # This is the entry point when we are called from `git bisect run` itself


This feels a little magical and I would've probably just made this a separate script instead, but as long as it works...

Add a helper script to bisect a regression

09760ca

mdboom requested review from Yhg1s and Copilot April 30, 2025 14:27

Copilot AI reviewed Apr 30, 2025

View reviewed changes

mdboom added 2 commits April 30, 2025 10:46

Lint

ec0eed3

Fix tests

faa685f

mdboom commented Apr 30, 2025

View reviewed changes

Yhg1s approved these changes May 2, 2025

View reviewed changes

mdboom merged commit dfaf4ea into faster-cpython:main May 5, 2025
6 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a helper script to bisect a regression#430

Add a helper script to bisect a regression#430
mdboom merged 3 commits intofaster-cpython:mainfrom
mdboom:bisect

mdboom commented Apr 30, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Apr 30, 2025

Uh oh!

mdboom Apr 30, 2025

Uh oh!

Yhg1s May 2, 2025

Uh oh!

Yhg1s May 2, 2025

Uh oh!

Yhg1s May 2, 2025

Uh oh!

Yhg1s May 2, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

-    for i in range(2, -9, -1):
-        if value >= 10.0**i:
-            break
-    else:
-        i = -9
+    MAX_POWER_OF_TEN = 2  # Maximum power of ten to check (10^2 = 100 seconds)
+    MIN_POWER_OF_TEN = -9  # Minimum power of ten to check (10^-9 = 1 nanosecond)
+    for i in range(MAX_POWER_OF_TEN, MIN_POWER_OF_TEN - 1, -1):
+        if value >= 10.0**i:
+            break
+    else:
+        i = MIN_POWER_OF_TEN



		if __name__ == "__main__":
		# This is the entry point when we are called from `git bisect run` itself

Conversation

mdboom commented Apr 30, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

mdboom Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Yhg1s May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Yhg1s May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Yhg1s May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Yhg1s May 2, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants