v0.7.0
What's New in v0.7.0
Added
- Frontend/UI task category with separate content-patch (15/25/40) and page-build (40/60/90) bands.
- App-development task category with a generic cold L-style prior and app/UI human-comparison multiplier.
3-roundreview mode with a 35 minute additive review tier.- METR threshold entries for Opus 4.7 (current) and GPT-5.5;
opus_4_xretained as a forward-compatible alias. - Opt-in structured audit logging via
AGENT_ESTIMATE_AUDIT_*environment variables, emitting secret-scrubbed JSON events to stdout, stderr, or a file.
Changed
- Research-grounded brainstorms now route to the research band instead of the flat brainstorm band.
- Codex model-key alias now resolves to the GPT-5.5 METR threshold; GPT-5.4 remains available.
- Corrected the Codex skill install path in
skills/estimate/README.mdto.codex/skills/.... - Version bumped to v0.7.0 across package, plugin, action, issue template, and tests.
- Claude runtime
/estimateskill refreshed to v0.7.0 parity with the Codex slice (frontend/app_dev types,3-roundreview mode, refreshed METR keys). claude/claude_opusmodel-key aliases now resolve toopus_4_7(Opus 4.7);opus_4_6retained for backward compatibility.
This release also brings the repository structure up to date: a Makefile for dev shortcuts, a scripts/preflight.py pre-PR check, and the multi-runtime skills/estimate/ layout (shared spec + per-runtime Claude/Codex slices).