v0.28.0

spboyer released this 21 Apr 21:06

· 90 commits to main since this release

b1acf61

What's New in v0.28.0

New Features

waza models command (#141) — List available Copilot models with their IDs and capabilities. Requires authentication via copilot login. (@richardpark-msft)
Trigger test early termination (#188) — Trigger tests now cancel the agent session as soon as a skill invocation is detected, instead of waiting for the full timeout. Implemented at the execution layer via CancelOnSkillInvocation flag. (@JasonYeMSFT)
Follow-up prompts (#189) — Support multi-turn eval tasks with follow_up_prompts in task YAML. Follow-ups reuse the same session and workspace, enabling tests for conversational workflows where the agent pauses for confirmation. (@JasonYeMSFT)
Quick Start guide — New focused 5-minute quick start page on the docs site, with Mermaid workflow diagram and tabbed install options. Added as the first sidebar item.

Bug Fixes

CI integration test (#210) — Fixed the root cause of persistent ubuntu-latest CI failures. PR #203 (v0.27.0) wired up evaluateExpectations() which made output_contains checks execute for the first time — the mock executor's generic output didn't match. CI now correctly allows eval failures with mock while still catching crashes.
YAML validation audit (#132) — Verified all 10 user-facing config loaders use strict KnownFields(true) parsing. Added regression test for unknown field rejection. (@LarryOsterman)

Infrastructure

CODEOWNERS simplified to @spboyer
Branch protection rulesets updated with proper bypass actors and streamlined required checks (Lint, CLA, test)

Documentation

Quick Start page with install → auth → first eval workflow
waza models added to CLI reference
follow_up_prompts and trigger early termination documented in eval-yaml guide

Full Changelog: v0.27.0...v0.28.0

Contributors

spboyer, LarryOsterman, and 2 other contributors

Assets 2