Releases: slowdini/eval-magic
Releases · slowdini/eval-magic
Release list
eval-magic v0.4.1
eval-magic v0.4.0
What's Changed
- docs(isolated-run): resolve design spike & add env/dispatch design note (#77) by @slowdini in #88
- feat(isolated-runs): runbook artifact by @slowdini in #89
- feat(isolated-runs): create isolated env for eval runs by @slowdini in #91
- feat(isolated-runs): full-loop handoff for the isolated session (#79) by @slowdini in #92
- feat(isolated-runs): retire the session-juggling apparatus (#80) by @slowdini in #93
- feat(isolated-runs): focus guard flag for new isolated envs by @slowdini in #94
- feat(claude): hybrid run mode support by @slowdini in #95
- feat(claude): headless run mode support by @slowdini in #96
- docs(isolated-runs): encapsulation bug fixes and docs update by @slowdini in #97
- feat(run): setup-time isolation grouping for multi-run batches (#90) by @slowdini in #98
- fix(run): walk per-(group,condition) Cli envs for guard + plugin-shadow (#99) by @slowdini in #101
- chore(docs): retire isolated-run.md (#100) by @slowdini in #103
- fix(codex): fix flag order by @slowdini in #105
- chore(cli): rename artifacts directories by @slowdini in #106
Full Changelog: v0.3.4...v0.4.0
eval-magic v0.3.4
What's Changed
Full Changelog: v0.3.3...v0.3.4
eval-magic v0.3.3
Full Changelog: v0.3.2...v0.3.3
eval-magic v0.3.1
What's Changed
- Merge pull request #62 from slowdini/dev by @slowdini in #63
- feat(headless): add model selection support to headless run mode by @slowdini in #67
- fix(parity): add run capability matrix, to replace hard-coded codex ref by @slowdini in #68
- fix(promote-baseline): fix promote baseline for multi-run gradings by @slowdini in #71
Full Changelog: v0.3.0...v0.3.1
eval-magic v0.3.0
What's Changed
- Merge pull request #28 from slowdini/dev by @slowdini in #29
- chore(docs): document run mode intention by @slowdini in #30
- chore(dx): agent memory files by @slowdini in #46
- feat(cli): init command for fresh evals/eval.json by @slowdini in #47
- refactor(pipeline): enumerate per-condition run slots by @slowdini in #48
- feat(run): N runs per condition cell (--runs + per-eval override) by @slowdini in #49
- feat(cli): simplify and improve param default values by @slowdini in #50
- feat(cli): warn that guard is armed between iterations by @slowdini in #51
- feat(cli): record run provenance in manifest, handle notes.md on promote by @slowdini in #52
- feat(opencode): add foundational harness selection and staging by @slowdini in #53
- fix(cli): sharpen run/ingest diagnostics and next-step commands by @slowdini in #54
- fix(claude): provide correct handling and guidance for claude skill-staging gotcha by @slowdini in #55
- feat(claude): auto-resolve suagents dir by @slowdini in #56
- fix(codex): fix codex run friction by @slowdini in #57
- refactor(adapters): HarnessAdapter trait + run-mode foundation (PR1 of #58) by @slowdini in #59
- refactor(run): organize dispatch + ingest around DispatchMechanism (PR2 of #58) by @slowdini in #60
- docs(parity): reframe parity around run mode; comment sweep (PR3 of #58) by @slowdini in #61
Full Changelog: v0.2.1...v0.3.0
eval-magic v0.2.1
Release notes
- Rename binary to eval-magic
eval-magic v0.2.0
Release notes
- Codex to full parity
eval-magic v0.1.3
What's Changed
- Main by @slowdini in #2
- feat(rewrite): phase 1 - core - complete by @slowdini in #3
- feat(rewrite): phase 2 - validation - complete by @slowdini in #4
- feat(rewrite): phase 3 - adapters - complete by @slowdini in #5
- feat(rewrite): phase 4 - sandbox - complete by @slowdini in #6
- feat(rewrite): phase 5 — pipeline by @slowdini in #7
- feat(rewrite): phase 6 - workspace - complete by @slowdini in #8
- feat(rewrite): phase 7 — cli/run orchestrator by @slowdini in #9
- refactor: enforce <500-line file guideline across the rewrite by @slowdini in #10
- Cutover: binary distribution, evergreen docs, roadmap retirement by @slowdini in #15
- Release v0.1.1 by @slowdini in #16
- Merge pull request #16 from slowdini/dev by @slowdini in #17
- fix(ci): push release tag with PAT so dist workflow triggers by @slowdini in #19
Full Changelog: v0.1.0...v0.1.3
eval-magic v0.1.0
What's Changed
New Contributors
Full Changelog: https://github.com/slowdini/eval-magic/commits/v0.1.0