Reduce Daily Syntax Error Quality workflow token churn without changing cadence#31586
Conversation
Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
|
Hey One thing that would help before this lands:
If you'd like a hand addressing this, you can assign this prompt to your coding agent:
|
Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
| name: Daily Syntax Error Quality Check | ||
| description: Tests compiler error message quality by introducing syntax errors in workflows, evaluating error clarity, and suggesting improvements | ||
| on: | ||
| schedule: daily |
Co-authored-by: pelikhan <4175913+pelikhan@users.noreply.github.com>
There was a problem hiding this comment.
Pull request overview
This PR aims to reduce token/turn amplification in the Daily Syntax Error Quality workflow while keeping the existing daily schedule, primarily by tightening the prompt’s execution guidance and narrowing how candidate workflows are staged for the agent.
Changes:
- Adds explicit batching/tool-call budget rules to the prompt to encourage fewer, chained bash/tool calls.
- Introduces a pre-step that copies 5 randomly selected candidate workflow files into
/tmp/gh-aw/agent/candidatesand updates prompt/tool examples to use that directory. - Reduces prompt size by removing unused/verbose example sections.
Show a summary per file
| File | Description |
|---|---|
| .github/workflows/daily-syntax-error-quality.md | Updates candidate staging, tool allowlist prefixes, and prompt guidance to reduce tool-call/token churn. |
| .github/workflows/daily-syntax-error-quality.lock.yml | Regenerated compiled workflow reflecting the updated prompt and candidate-copy step. |
Copilot's findings
Tip
Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
- Files reviewed: 2/2 changed files
- Comments generated: 1
| mkdir -p /tmp/gh-aw/agent/candidates | ||
| find .github/workflows -name '*.md' -type f ! -name 'daily-*.md' ! -name '*-test.md' \ | ||
| | shuf -n 5 \ | ||
| > /tmp/gh-aw/agent/candidates.txt | ||
| while IFS= read -r workflow; do | ||
| cp "$workflow" /tmp/gh-aw/agent/candidates/ | ||
| done < /tmp/gh-aw/agent/candidates.txt |
✨ Enhancement
What does this improve?
The Daily Syntax Error Quality workflow was averaging 99 turns against a stated 20-turn budget, making prompt replay the main token cost driver. This change keeps the workflow on its existing daily cadence, tightens how the agent accesses candidate workflows, and makes the prompt more explicit about batching work into fewer tool calls.
Why is this valuable?
The workflow is intended to sample compiler error quality every day without unnecessary prompt and tool-call overhead. Preserving the daily schedule maintains the existing coverage, while stricter execution guidance and a smaller prompt should cut per-run token amplification without changing the workflow’s core evaluation behavior.
Implementation approach:
Schedule and run volume
Prompt execution constraints
Candidate access narrowing
/tmp/gh-aw/agent/candidates..github/workflowstree.Prompt size reduction