Skip to content

Agent Anvil v0.2.59

Choose a tag to compare

@dKosarevsky dKosarevsky released this 09 Jun 03:21
· 19 commits to main since this release
c0f1796

Summary

  • Reject versioned results.json files when summary trial counts or pass rate do not match grades.
  • Document summary-vs-grades validation for persisted results artifacts.
  • Add parametrized storage tests for tampered aggregate summaries.

Verification

  • PR #191 CI passed: Demo Eval, Test Python 3.12, Test Python 3.14.
  • Release PR #192 CI passed: Demo Eval, Test Python 3.12, Test Python 3.14.