Integrate benchmarking by johannahaffner · Pull Request #200 · patrick-kidger/optimistix

johannahaffner · 2025-12-24T16:58:57Z

This follows up on #128.

Two small things:

the script stays - having the benchmarks output this would entail customising the fixture provided by pytest-benchmark, and plotting is also something one would do every few runs, rather than every run - in particular whenever one would like to look at a comparison between solvers or implementation with others.
We'll continue to require benchmarking runs to specify pytest benchmarks --benchmark-only - if we'd put --benchmark-only into benchmarks/conftest.py then it would get collected when pytest is invoked and it would override our default --benchmark-skip.

And one weird thing: if I remove the OrderedDict, I still get results jumbled between solvers. But as you (@patrick-kidger) note, it should not be needed. Playing with it for a bit I can remove it in two of the three places, but the underlying logic is not apparent to me, nor am I happy to trust that this would work for a different number of solvers / problems / something.
I think the pragmatic thing to do here is to simply document the puzzling necessity of OrderedDict in this script. (Low-key itching to check whether this is an upstream thing...)

johannahaffner · 2025-12-25T16:41:05Z

Tests are failing due to JAX 0.8.2 being used with 3.12, and a different version of JAX being used with 3.10, for which JAX has recently dropped support.

patrick-kidger

I think this looks reasonable to me, with only one comment, can we target dev instead of main?

…luation of Optimistix' solvers.

… don't need them here. We would otherwise have to use context management for any comparison to Optax minimisers.

…rata

…nd, adapt to sif2jax usage of properties.

johannahaffner · 2025-12-27T19:36:42Z

Now failing checks because dev includes a module that triggers the error being addressed in patrick-kidger/lineax#184. Tests pass and benchmarks run locally (benchmarks are skipped in CI and no other tested code is added).

* Implement pytest-benchmark based setup for systematic performance evaluation of Optimistix' solvers. * version bump for sif2jax requirements * add semi-recent matplotlib version to specify a minimum * no more monkeypatching * set EQX_ON_ERROR with os.environ * give a reason for skipping compilation tests * Add L-BFGS solvers to benchmark suite * clarify what --benchmark-autosave will do. * remove strict dtype promotion rules - benchmarks are not tests, so we don't need them here. We would otherwise have to use context management for any comparison to Optax minimisers. * state purpose of --scipy flag more clearly. * improve contribution guidelines, inline decorator, specify pyright errata * pyproject.toml from main * add sif2jax * move benchmark dependencies to tests group * add benchmark-skip option * add example to contributing guidelines, document OrderedDict workaround, adapt to sif2jax usage of properties. --------- Co-authored-by: Johanna Haffner <johanna.haffner@bsse.ethz.ch>

johannahaffner mentioned this pull request Dec 25, 2025

Proposed structure for integration of benchmarks with pytest-benchmark #128

Closed

johannahaffner marked this pull request as ready for review December 25, 2025 16:41

patrick-kidger reviewed Dec 26, 2025

View reviewed changes

johannahaffner changed the base branch from main to dev December 27, 2025 18:59

johannahaffner force-pushed the pytest-benchmark branch from 99a1895 to fb59329 Compare December 27, 2025 19:00

Johanna Haffner added 16 commits December 27, 2025 20:32

Implement pytest-benchmark based setup for systematic performance eva…

bca91af

…luation of Optimistix' solvers.

version bump for sif2jax requirements

640eda8

add semi-recent matplotlib version to specify a minimum

9eb8e1f

no more monkeypatching

38353b5

set EQX_ON_ERROR with os.environ

a31a14f

give a reason for skipping compilation tests

09f082c

Add L-BFGS solvers to benchmark suite

9f13822

clarify what --benchmark-autosave will do.

2f6b4e6

remove strict dtype promotion rules - benchmarks are not tests, so we…

b09d5c8

… don't need them here. We would otherwise have to use context management for any comparison to Optax minimisers.

state purpose of --scipy flag more clearly.

5f00775

improve contribution guidelines, inline decorator, specify pyright er…

779109e

…rata

pyproject.toml from main

cb1061a

add sif2jax

8b06c37

move benchmark dependencies to tests group

10504ad

add benchmark-skip option

791a229

add example to contributing guidelines, document OrderedDict workarou…

9db0530

…nd, adapt to sif2jax usage of properties.

johannahaffner force-pushed the pytest-benchmark branch from fb59329 to 9db0530 Compare December 27, 2025 19:33

johannahaffner merged commit b1a908b into patrick-kidger:dev Dec 27, 2025
0 of 2 checks passed

johannahaffner deleted the pytest-benchmark branch December 27, 2025 19:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Integrate benchmarking#200

Integrate benchmarking#200
johannahaffner merged 16 commits intopatrick-kidger:devfrom
johannahaffner:pytest-benchmark

johannahaffner commented Dec 24, 2025 •

edited

Loading

Uh oh!

johannahaffner commented Dec 25, 2025

Uh oh!

patrick-kidger left a comment

Uh oh!

johannahaffner commented Dec 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

johannahaffner commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johannahaffner commented Dec 25, 2025

Uh oh!

patrick-kidger left a comment

Choose a reason for hiding this comment

Uh oh!

johannahaffner commented Dec 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

johannahaffner commented Dec 24, 2025 •

edited

Loading