v0.7.1
Fixed
- Fixed six engine correctness gaps: early
--formatvalidation before side effects, explicit--warm-context 1.0handling, friction-aware assigned-agent METR re-checks, Python 3.10-compatible Z timestamp parsing, UTC-normalized calibration cohorts/order, andplan_wavesduplicate/negative input validation (#31, #33, #36, #37, #38, #39). - README hero example now uses the real CLI surface with real captured output — the previous block invoked a nonexistent
--modelflag, passed an unsupported second positional task, and showed fabricated numbers (#28). - README Action workflow example now grants
issues: read, required for issue fetching on private repos where an explicitpermissions:block zeroes unlisted scopes (#40). - Claude Code plugin exposes the
/estimateskill again:plugin.jsonnow points skill discovery atskills/estimate/claude/, which the v0.7.0 multi-runtime restructure had moved out of the defaultskills/<name>/SKILL.mdscan path (#29). - Dropped the phantom
/validate-estimateand/calibrateslash commands (documented but never installable); validation and calibration now route through the installed skill as/estimate validateand/estimate calibrate(#29). - Refreshed stale METR model keys in
examples/multi-agent.mdoutput (gpt_5_3→gpt_5_4,gemini_3_pro→gemini_3_1_pro) to match current output (#28).
Full Changelog: v0.7.0...v0.7.1