Skip to content

v0.28.0

Choose a tag to compare

@spboyer spboyer released this 21 Apr 21:06
· 90 commits to main since this release
b1acf61

What's New in v0.28.0

New Features

  • waza models command (#141) — List available Copilot models with their IDs and capabilities. Requires authentication via copilot login. (@richardpark-msft)

  • Trigger test early termination (#188) — Trigger tests now cancel the agent session as soon as a skill invocation is detected, instead of waiting for the full timeout. Implemented at the execution layer via CancelOnSkillInvocation flag. (@JasonYeMSFT)

  • Follow-up prompts (#189) — Support multi-turn eval tasks with follow_up_prompts in task YAML. Follow-ups reuse the same session and workspace, enabling tests for conversational workflows where the agent pauses for confirmation. (@JasonYeMSFT)

  • Quick Start guide — New focused 5-minute quick start page on the docs site, with Mermaid workflow diagram and tabbed install options. Added as the first sidebar item.

Bug Fixes

  • CI integration test (#210) — Fixed the root cause of persistent ubuntu-latest CI failures. PR #203 (v0.27.0) wired up evaluateExpectations() which made output_contains checks execute for the first time — the mock executor's generic output didn't match. CI now correctly allows eval failures with mock while still catching crashes.

  • YAML validation audit (#132) — Verified all 10 user-facing config loaders use strict KnownFields(true) parsing. Added regression test for unknown field rejection. (@LarryOsterman)

Infrastructure

  • CODEOWNERS simplified to @spboyer
  • Branch protection rulesets updated with proper bypass actors and streamlined required checks (Lint, CLA, test)

Documentation

  • Quick Start page with install → auth → first eval workflow
  • waza models added to CLI reference
  • follow_up_prompts and trigger early termination documented in eval-yaml guide

Full Changelog: v0.27.0...v0.28.0