Waza v0.37.0
What's Changed
- ci: Deploy Pages after release workflow by @spboyer in #265
- docs: fix AGENTS root path guidance by @drvoss in #269
- fix: guard empty sandbox prompts by @spboyer in #278
- fix: respect signal cancellation in waza run by @spboyer in #279
- docs: map OpenAI Evals modelgraded YAML to waza graders by @spboyer in #280
- Fix custom-agent eval example schema by @spboyer in #282
- docs: fix binary release links for #276 by @spboyer in #284
- docs: align custom agent eval docs with skill field by @spboyer in #283
- docs: update integration testing guide by @spboyer in #281
- feat: add per-trial usage to results JSON by @spboyer in #277
- fix: use SDK approval kind for tool permissions by @drvoss in #268
- feat: wire BYOK providers by @slbug in #240
- Add waza update command by @spboyer in #288
- Add forbidden skills to skill invocation grader by @spboyer in #291
- Add skill body injection opt-out by @spboyer in #292
- Add agent-friendly PR and issue templates by @spboyer in #293
- Improve
waza runconcurrency: shared Copilot client + auto-sized workers (#135) by @spboyer with @Copilot in #221 - Release v0.34.0 by @spboyer in #294
- Fix skill best practices reference by @spboyer in #298
- Fix installer latest release selection by @spboyer in #299
- fix: prevent Copilot CLI PATH fallback by @spboyer in #300
- Simplify AgentEngine cancellation around caller contexts by @spboyer with @Copilot in #290
- fix: pass --model via CLIArgs to override user settings and experiment flights by @sebastienlevert in #263
- feat: support git worktree resources in task inputs (#121) by @spboyer in #302
- fix: skip --model CLI startup arg when BYOK provider is configured (#305) by @spboyer in #306
- feat(pricing): model-aware cost calculation for dashboard by @spboyer in #310
- Upgrade Copilot SDK to v1.0.0 and surface premium-request credits on the dashboard by @spboyer in #311
- Release v0.35.0 by @spboyer in #312
- chore(deps): Bump esbuild, @tailwindcss/vite, @vitejs/plugin-react and vite in /web by @dependabot[bot] in #317
- fix(execution): add a first-event watchdog to catch session-start hangs by @sebastienlevert in #321
- fix(graders): make prompt-grader timeout configurable via WAZA_PROMPT_GRADER_TIMEOUT by @sebastienlevert in #319
- feat: Upgrade Squad framework from 0.8.25 to 0.10.0 by @spboyer in #323
- Phase 1 & 2: Squad workflows + failure detection infrastructure #322 by @spboyer in #324
- [WIP] Add support for custom agents in waza check by @spboyer with @Copilot in #314
- Release v0.36.0 by @spboyer in #325
- feat: complete issue #322 with triage automation and regression loop by @spboyer in #326
- Surface engine failures in
waza suggestby @spboyer with @Copilot in #330 - Materialize task-level context fixtures in workspaces by @spboyer with @Copilot in #329
- chore(deps): Bump js-yaml from 4.1.1 to 4.2.0 in /site by @dependabot[bot] in #327
- chore(deps): Bump astro from 6.3.2 to 6.4.7 in /site by @dependabot[bot] in #331
- feat: drive interactive skills via an LLM responder (#303) by @adamdougal in #304
- chore: upgrade copilot-sdk to v1.0.2 and re-bundle embedded CLI to 1.0.64-0 (fixes session.idle hang) by @sebastienlevert in #333
- chore: refresh dependencies by @spboyer in #335
- Release v0.37.0 by @spboyer in #334
New Contributors
- @slbug made their first contribution in #240
- @adamdougal made their first contribution in #304
Full Changelog: v0.33.0...v0.37.0