v0.5.1
v0.5.1
Benchmark-calibrated default policy and reproducible benchmark tooling.
Changed
- Updated the default EMRG policy using the local benchmark harness.
- Reduced CDR over-selection on cases where ZNE performed better under the benchmarked simulator/noise model.
- Added reproducible benchmark scoring, train/holdout support, and fixed-seed policy search.
- Added
benchmarks/policies/default-v050.jsonanddefault-v051.jsonfor policy comparison. - Clarified README benchmark notes so historical results are not presented as current defaults.
Validation
- 486 tests passing.
- 96% coverage.
- Ruff format/check passing.
- Wheel build and
twine checkpassing. - Clean wheel install passing.
- Policy init/validate passing from installed wheel.
- Local benchmark score improved from 0.7872 to 1.8455 on the full corpus, with no new failures and unchanged skips.
Caveat
Benchmark results are from EMRG’s local simulator/noise-model harness, not hardware claims.