[audit-workflows] π Agentic Workflow Audit β 2026-06-14 (rebound to 87%, PR Sous Chef recovered) #39289
Replies: 1 comment
-
|
Smoke poke. Cave bot see latest discussion. Run 27515546240 grunt good. Warning Firewall blocked 6 domainsThe following domains were blocked by the firewall during workflow execution:
network:
allowed:
- defaults
- "accounts.google.com"
- "android.clients.google.com"
- "clients2.google.com"
- "contentautofill.googleapis.com"
- "safebrowsingohttpgateway.googleapis.com"
- "www.google.com"See Network Configuration for more information.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
π Agentic Workflow Audit β 2026-06-14
Window: ~7h (14:40β21:35Z) Β· 100 terminal runs analyzed (logs MCP timed out at 120s β partial window, consistent with recent days)
Headline: Strong rebound to 87.0% success after 06-13's 68.4% (worst since 05-23). PR Sous Chef fully recovered (10/10, was 6 fails β fix landed). All remaining genuine prod-main failures are known/recurring β no new prod-main failure class.
Key Metrics
Engines: copilot 66 Β· claude 21 Β· codex 8 Β· antigravity 2 Β· gemini 2 Β· pi 2
π Trends
Success rate snapped back to 87% from the 06-13 trough (68.4%), landing close to the 30-day baseline (~85%). The rebound was driven mostly by PR Sous Chef recovering its 6-run failure cluster; the residual failures are a stable, well-characterized tail.
Token usage (54.5M this window) sits near the 7-day moving average β below the 06-12 126M spike. Top consumers were all successful: [aw] Failure Investigator (5.9M/$4.75), Safe Output Tool Optimizer (4.2M/$3.92), Daily Code Metrics (3.0M/$3.02).
β Resolved / Improved
copilot/aw-fix-pr-sous-chef-failappears landed.Details
avenger-err-config-no-structured-logs(day2)27502877407created PR, $2.06/48 turns) then run reddened;27508639846explicit ERR_CONFIG 0-tok. Fix branchcopilot/aw-avenger-failed-fixnot yet effective.copilot-sdk-drivertool-perm-lockout (day8)forecast-specification.md), permissionDeniedCount=11, 5/5 abort, 24m30s wasted, not retriedcopilot-sdk-drivertool-perm-lockout (day8)doc-unbloat-empty-outputπ§ͺ PR/branch failures (7) β all by-design smoke noise
Details
gemini-3.1-flash-tts-previewhas no AI-credits pricing (RECUR, count 4).27507898911) burned 3.9M tok then CAPIError 429 Maximum AI credits exceeded (1015/1000) β daily-AI-credits cap resurfaced, but only on a heavy PR probe (account-wide cap).π― Top Open Recommendations
copilot-sdk-drivertool-permission-lockout (day8, HIGH) β longest-running unresolved prod-main class. The sdk-driver denies routine read-only ops the workflow legitimately needs (read of.md/spec/source files) then aborts at 5 denials with no retry, wasting 19β24 min/run. claude tolerates the same denials. Fix: allowlist these reads or relax the guard / retry-with-grant.copilot/aw-avenger-failed-fixin flight but not yet effective.References: Β§27502877407 Β· Β§27504260264 Β· Β§27508844367
Beta Was this translation helpful? Give feedback.
All reactions