docs: Add CI/CD integration guide (GitHub Actions, Azure DevOps)#100
Open
docs: Add CI/CD integration guide (GitHub Actions, Azure DevOps)#100
Conversation
Contributor
There was a problem hiding this comment.
Pull request overview
This PR replaces #97 (which had leaked .squad/ files) and adds a comprehensive CI/CD integration guide to the waza docs site at site/src/content/docs/guides/ci-cd.mdx. The guide covers installing waza in CI, running evaluations in GitHub Actions, Azure DevOps, and GitLab CI, along with best practices and troubleshooting tips.
Changes:
- Rewrites the existing
ci-cd.mdxguide from a simple GitHub Actions-focused page into a comprehensive multi-platform CI/CD integration guide covering GitHub Actions, Azure DevOps, and GitLab CI - Adds sections for token budget checks, caching, path filtering, advanced workflows (baseline comparison, multi-stage pipelines), and troubleshooting
- Adds tabbed UI components for cross-platform secret configuration examples
Comments suppressed due to low confidence (2)
site/src/content/docs/guides/ci-cd.mdx:267
- This uses
waza tokens diffwith non-existent--ref1and--ref2flags. The correct command iswaza tokens compare, which takes positional arguments:waza tokens compare origin/main HEAD --format json > token-diff.json. Additionally, thejq '.exceeded'check is wrong — the JSON output has no top-levelexceededfield. The correct check would bejq '.summary.exceededCount'(an integer, not a boolean), so the condition should be something likeif [ "$EXCEEDED" -gt 0 ].
```yaml
- script: |
waza tokens diff \
--ref1 origin/main \
--ref2 HEAD \
--format json > token-diff.json
EXCEEDED=$(jq '.exceeded' token-diff.json)
if [ "$EXCEEDED" == "true" ]; then
echo "##vso[task.logissue type=error;]Token budget exceeded"
exit 1
fi
displayName: 'Check token budget'
**site/src/content/docs/guides/ci-cd.mdx:379**
* Same `waza tokens diff` issue here — the correct command is `waza tokens compare --strict`.
- name: Check token budget
run: waza tokens diff --strict</details>
Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
- Fix waza tokens diff -> waza tokens compare (3 occurrences) - Fix JSON field paths (results.passed -> results.summary.succeeded) - Fix waza compare example (takes JSON files, not YAML) - Remove incorrect Windows PowerShell install example - Fix waza tokens check to use directory path not eval YAML - Remove unnecessary Python dependency from Azure DevOps example - Remove unused Aside import Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>
ca73446 to
ccea6a2
Compare
chlowell
pushed a commit
to chlowell/waza
that referenced
this pull request
Mar 5, 2026
Closes microsoft#98 Implements the behavior grader type for validating agent behavior metrics (tool usage, token consumption, forbidden/required tools, duration limits). ## Changes - New `internal/graders/behavior_grader.go` with BehaviorGrader - Added `TypeBehavior` constant and factory registration - Extended `graders.Context` with `Session` field for SessionDigest data
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Closes #89
Adds CI/CD integration guide to the GitHub Pages docs site.
Replaces #97 (closed — had leaked .squad/ files).