v0.29.0
What's New in v0.29.0
New Features
-
Per-task skill directories (#156) — Tasks can now override eval-level
skill_directorieswith their own, enabling multi-skill eval suites. (@LarryOsterman) -
Disable skill loading (#126) — New
disabled_skillsconfig field and--no-skillsCLI flag. Usedisabled_skills: ["*"]to disable all skills for baseline/comparison testing. (@richardpark-msft) -
Debug workspace preservation (#123) — New
--keep-workspaceflag preserves temp workspace directories after execution for debugging fixture and agent file issues. (@richardpark-msft) -
Version update notifications (#104) —
waza runnow checks for new versions in the background (cached 24h, non-blocking). Disable with--no-update-checkorWAZA_NO_UPDATE_CHECK=1. (@RickWinter)
Test Coverage
- Copilot log parsing edge cases (#115) — 23 new tests covering malformed JSON, truncated logs, binary data, unknown event types, and more. (@richardpark-msft)
Dependencies
- Bumped astro + @astrojs/starlight in site
Full Changelog: v0.28.0...v0.29.0