benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate) by jack-champagne · Pull Request #93 · harmoniqs/DirectTrajOpt.jl

jack-champagne · 2026-05-16T05:06:21Z

Summary

Adds benchmark/convergence/ — a small DTO benchmark environment that
exercises HarmoniqsBenchmarks.jl v0.2.0's new convergence API
(InfidelityConvergence, ipopt_capture, compare_convergence).

Scope is deliberately tight (one IPOPT + one MadNLP testitem on the same
X-gate state-transfer); atoms-demo / spin-qubit-demo / bosonic-demo problems
land in separate follow-up PRs once this harness is proven green.

Depends on: HarmoniqsBenchmarks.jl main (v0.2.0 + convergence schema, not yet registered).

What's included

benchmark/convergence/Project.toml — isolated env with HBJ (main),
Ipopt, MadNLP, NamedTrajectories. DirectTrajOpt pulled in as a path dep
to ../.. so we test the package under PR HEAD, not General.
benchmark/convergence/problem_utils.jl — _make_xgate_prob (bilinear
4D rep + terminal cost + control regularizers) and
_build_convergence_result (rebuilds a BenchmarkResult with a
ConvergenceCriterion attached).
benchmark/convergence/convergence.jl — two @testitems:
- X gate convergence: Ipopt — ipopt_capture() extracts final
  iter_count + inf_pr, builds InfidelityConvergence with
  target_infidelity = 1e-3, feas_tol = 1e-6, asserts converged(...).
- X gate convergence: MadNLP — same problem, MadNLP solver; uses
  post-solve evaluator.constraint_violation as the primal-infeasibility
  proxy (no MadNLP capture analogue yet).
benchmark/convergence/README.md — usage instructions.
.github/workflows/benchmark-convergence.yml — Julia 1.12, 30 min
timeout, JLD2 artifact upload.

Local results

=== X gate convergence (Ipopt) ===
  iters=22  final_inf=4.430e-11  inf_pr=3.307e-09  wall=12.363s  converged=true
=== X gate convergence (MadNLP) ===
  final_inf=3.164e-14  cviol=1.537e-10  wall=13.647s  converged=true
Test Summary: | Pass  Total   Time
Package       |    6      6  37.2s

JLD2 artifacts produced:

xgate_convergence_ipopt_N51_<commit>.jld2
xgate_convergence_madnlp_N51_<commit>.jld2

Test plan

benchmark-convergence workflow green on this PR
Both testitems assert converged(result.convergence) == true
compare_convergence([result]) returns one row per converging benchmark
JLD2 artifacts uploaded as workflow artifact

Out of scope (follow-up PRs)

Atoms-demo convergence (Piccolo / QuantumCollocation downstream)
Spin-qubit-demo convergence
NYU bosonic-demo convergence
MadNLP capture-callback analogue to ipopt_capture()

Adds `benchmark/convergence/` exercising HarmoniqsBenchmarks.jl v0.2.0's new convergence API: `InfidelityConvergence`, `ipopt_capture`, and `compare_convergence`. Two `@testitem`s on a shared bilinear X-gate state-transfer problem (N=51, terminal cost + control regularizers), both asserting `converged(...) == true` against a `target_infidelity = 1e-3`, `feas_tol = 1e-6` criterion. Ipopt path uses `ipopt_capture()` to read final `iter_count` / `inf_pr`; MadNLP falls back to the post-solve evaluator's `constraint_violation` as the primal-infeasibility proxy (no MadNLP capture analogue yet). Adds a dedicated `.github/workflows/benchmark-convergence.yml` workflow (Julia 1.12, 30 min timeout, artifact upload) so this suite runs without piggybacking on the timing-only benchmark workflow from PR #75. Demo problems (atoms / spin-qubit / bosonic) intentionally deferred to follow-up PRs once this harness is proven green. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Strips aligned `=` padding from the new `benchmark/convergence/*.jl` files to satisfy the Formatter CI job (which fails on any diff vs. `JuliaFormatter.format(".")`). No semantic changes — both testitems still pass locally on Julia 1.12. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

The convergence suite under `benchmark/convergence/` uses `@testitem` blocks that depend on `HarmoniqsBenchmarks` — which isn't in the package's main test environment. `@run_package_tests` was picking those up and failing with `ArgumentError: Package HarmoniqsBenchmarks not found in current path` on Julia 1.10/1.11/1.12. Filter out anything whose path includes a `benchmark` component. Match the path component (via `splitpath`) rather than a substring so future test files like `foo_benchmark.jl` aren't accidentally excluded. Mirrors the same filter added in PR #75 (initial benchmark suite).

codecov · 2026-05-20T05:27:02Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

Tracking `rev = "main"` makes convergence-suite CI non-reproducible: any commit to HBJ silently changes the schema/extractors for every old DTO CI re-run, and cached CI runs diverge from cold ones. Pin to HBJ's current main HEAD (c38418cb7f — alloc analyzer + Piccolo-aligned CI), matching the pin pattern used in `benchmark/Project.toml` for the initial benchmark suite. Drop in favor of `[compat]` once HBJ registers in General (PR JuliaRegistries/General#155879).

After the deterministic+sweep rework no testitems use :experimental anymore, and the sweeps add ~5 seconds in aggregate on top of a ~10-minute matrix — not worth a parallel filter framework. Drops both INCLUDE_EXPERIMENTAL and INCLUDE_ROBUSTNESS env vars and the ti.tags closure. The robustness sweeps now run on every PR (along with their deterministic baselines). Keeps a `/benchmark/` path filter following Piccolissimo.jl's convention — the benchmark subtree has its own Project.toml + deps + workflow, so its @testItems shouldn't be discovered by `@run_package_tests`. This also preemptively unbreaks PR #93's CI (which fails today because benchmark/convergence/convergence.jl is being picked up). Verified locally on Julia 1.12: 355 pass / 1 broken / 0 fail. TimeConsistency sweep: pass_rate = 1.0 (20/20) NonlinearKnotPoint sweep: pass_rate = 0.8 (16/20, threshold 0.65) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

jack-champagne and others added 3 commits May 16, 2026 01:05

This was referenced May 20, 2026

test: deterministic seeds + multi-seed robustness sweeps for flaky tests #45

Draft

bench: Ipopt + MadNLP allocation profile script #71

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate)#93

benchmarks: convergence suite v1 (Ipopt + MadNLP on X gate)#93
jack-champagne wants to merge 4 commits into
mainfrom
benchmarks/convergence-suite-v1

jack-champagne commented May 16, 2026

Uh oh!

codecov Bot commented May 20, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

jack-champagne commented May 16, 2026

Summary

What's included

Local results

Test plan

Out of scope (follow-up PRs)

Uh oh!

codecov Bot commented May 20, 2026

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant