Skip to content

Releases: slowdini/eval-magic

eval-magic v0.4.1

Choose a tag to compare

@github-actions github-actions released this 25 Jun 03:38
f7a8242

What's Changed

  • Merge pull request #107 from slowdini/dev by @slowdini in #108
  • fix(cli-mode): fix runbook framing and permission issues for cli mode by @slowdini in #110

Full Changelog: v0.4.0...v0.4.1

eval-magic v0.4.0

Choose a tag to compare

@github-actions github-actions released this 21 Jun 20:43
5e5aba7

What's Changed

  • docs(isolated-run): resolve design spike & add env/dispatch design note (#77) by @slowdini in #88
  • feat(isolated-runs): runbook artifact by @slowdini in #89
  • feat(isolated-runs): create isolated env for eval runs by @slowdini in #91
  • feat(isolated-runs): full-loop handoff for the isolated session (#79) by @slowdini in #92
  • feat(isolated-runs): retire the session-juggling apparatus (#80) by @slowdini in #93
  • feat(isolated-runs): focus guard flag for new isolated envs by @slowdini in #94
  • feat(claude): hybrid run mode support by @slowdini in #95
  • feat(claude): headless run mode support by @slowdini in #96
  • docs(isolated-runs): encapsulation bug fixes and docs update by @slowdini in #97
  • feat(run): setup-time isolation grouping for multi-run batches (#90) by @slowdini in #98
  • fix(run): walk per-(group,condition) Cli envs for guard + plugin-shadow (#99) by @slowdini in #101
  • chore(docs): retire isolated-run.md (#100) by @slowdini in #103
  • fix(codex): fix flag order by @slowdini in #105
  • chore(cli): rename artifacts directories by @slowdini in #106

Full Changelog: v0.3.4...v0.4.0

eval-magic v0.3.4

Choose a tag to compare

@github-actions github-actions released this 19 Jun 00:22
89eb963

What's Changed

Full Changelog: v0.3.3...v0.3.4

eval-magic v0.3.3

Choose a tag to compare

@github-actions github-actions released this 19 Jun 00:03
0fc0897

Full Changelog: v0.3.2...v0.3.3

eval-magic v0.3.1

Choose a tag to compare

@github-actions github-actions released this 18 Jun 05:45
6ab5511

What's Changed

  • Merge pull request #62 from slowdini/dev by @slowdini in #63
  • feat(headless): add model selection support to headless run mode by @slowdini in #67
  • fix(parity): add run capability matrix, to replace hard-coded codex ref by @slowdini in #68
  • fix(promote-baseline): fix promote baseline for multi-run gradings by @slowdini in #71

Full Changelog: v0.3.0...v0.3.1

eval-magic v0.3.0

Choose a tag to compare

@github-actions github-actions released this 17 Jun 03:07
14e29d0

What's Changed

  • Merge pull request #28 from slowdini/dev by @slowdini in #29
  • chore(docs): document run mode intention by @slowdini in #30
  • chore(dx): agent memory files by @slowdini in #46
  • feat(cli): init command for fresh evals/eval.json by @slowdini in #47
  • refactor(pipeline): enumerate per-condition run slots by @slowdini in #48
  • feat(run): N runs per condition cell (--runs + per-eval override) by @slowdini in #49
  • feat(cli): simplify and improve param default values by @slowdini in #50
  • feat(cli): warn that guard is armed between iterations by @slowdini in #51
  • feat(cli): record run provenance in manifest, handle notes.md on promote by @slowdini in #52
  • feat(opencode): add foundational harness selection and staging by @slowdini in #53
  • fix(cli): sharpen run/ingest diagnostics and next-step commands by @slowdini in #54
  • fix(claude): provide correct handling and guidance for claude skill-staging gotcha by @slowdini in #55
  • feat(claude): auto-resolve suagents dir by @slowdini in #56
  • fix(codex): fix codex run friction by @slowdini in #57
  • refactor(adapters): HarnessAdapter trait + run-mode foundation (PR1 of #58) by @slowdini in #59
  • refactor(run): organize dispatch + ingest around DispatchMechanism (PR2 of #58) by @slowdini in #60
  • docs(parity): reframe parity around run mode; comment sweep (PR3 of #58) by @slowdini in #61

Full Changelog: v0.2.1...v0.3.0

eval-magic v0.2.1

Choose a tag to compare

@github-actions github-actions released this 10 Jun 17:00
8a24c41

Release notes

  • Rename binary to eval-magic

eval-magic v0.2.0

Choose a tag to compare

@github-actions github-actions released this 10 Jun 07:25
f12d870

Release notes

  • Codex to full parity

eval-magic v0.1.3

Choose a tag to compare

@github-actions github-actions released this 10 Jun 04:24
b793bf5

What's Changed

  • Main by @slowdini in #2
  • feat(rewrite): phase 1 - core - complete by @slowdini in #3
  • feat(rewrite): phase 2 - validation - complete by @slowdini in #4
  • feat(rewrite): phase 3 - adapters - complete by @slowdini in #5
  • feat(rewrite): phase 4 - sandbox - complete by @slowdini in #6
  • feat(rewrite): phase 5 — pipeline by @slowdini in #7
  • feat(rewrite): phase 6 - workspace - complete by @slowdini in #8
  • feat(rewrite): phase 7 — cli/run orchestrator by @slowdini in #9
  • refactor: enforce <500-line file guideline across the rewrite by @slowdini in #10
  • Cutover: binary distribution, evergreen docs, roadmap retirement by @slowdini in #15
  • Release v0.1.1 by @slowdini in #16
  • Merge pull request #16 from slowdini/dev by @slowdini in #17
  • fix(ci): push release tag with PAT so dist workflow triggers by @slowdini in #19

Full Changelog: v0.1.0...v0.1.3

eval-magic v0.1.0

Choose a tag to compare

@github-actions github-actions released this 08 Jun 17:11
51fbe18

What's Changed

New Contributors

Full Changelog: https://github.com/slowdini/eval-magic/commits/v0.1.0