[audit-workflows] Daily Audit 2026-06-06: Copilot SDK-driver path broken (9/16 fails) — minimatch packaging bug + tool-perm lockout #37418
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Agentic Workflow Audit Agent. A newer discussion is available at Discussion #37663. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Overview
This audit covers a partial ~6h window (2026-06-06 15:11–21:29Z). The
logsMCP tool timed out at its 120s ceiling and returned only the newest 100 runs, so roughly 18h of the intended 24h window (06-05 21:30 → 06-06 15:11) is unobserved this cycle. Findings below are scoped to what was retrieved.The dominant story is a regression in the Copilot SDK-driver execution path: 9 of 16 failures (56%) trace to it — a new packaging bug (
Cannot find module 'minimatch') that hard-killed 6 workflows, plus a re-escalated tool-permission lockout on 3 more. The token-budget-429 pressure that worried prior windows has eased: Daily Ambient Context Optimizer ran 21.48M effective tokens, comfortably back under the 25M cap.Summary
Critical Issues
🔴 NEW — Copilot SDK-driver crashes:
Cannot find module 'minimatch'(6 workflows)copilot_sdk_driver.cjsthrowsMODULE_NOT_FOUNDforminimatchat startup (copilot_sdk_driver.cjs:42), so the agent never runs — 0 turns, 0 tokens, and because it is a static missing-dependency error, all 4 harness retries fail identically. This single packaging bug took down 6 distinct copilot workflows, including the previously-healthy PR Code Quality Reviewer (2 runs):Fix: bundle/declare
minimatchin the sdk-driver action's dependencies (or inline-vendor the glob it needs), and add a CI smoke test thatnode -e "require('.../copilot_sdk_driver.cjs')"to catch missing deps before release.🔴 RE-ESCALATED — Copilot SDK-driver tool-permission lockout (3 prod-main schedules)
The sdk-driver denies
shell(ls/cat)andreadon/tmp/gh-aw/cache-memorythat the workflows legitimately configure, aborting after 5 denials (turns=1). Was marked resolved 06-05; back today on Daily SPDD Spec Planner (permissionDeniedCount=10), GEO Optimizer Daily Audit, Daily Safe Output Integrator. The 23allow-toolentries derived fromGH_AW_COPILOT_SDK_SERVER_ARGSdon't reconcile with the workflows' declared tools.🟠 PERSISTING (day 2) — Model/param config drift, 2 prod-main schedules
400 'does not support the effort parameter'(run 27073730816)404 'Model not found adelie-alpha-2026-02-19'(run 27070791283)Both exhaust all retries with 0 tokens — static config errors. Unfixed for 2 consecutive days. A compile/activation-time model+param preflight would convert these into a clear early error.
Other failures (5) and notes
ERR_PARSE: No THREAT_DETECTION_RESULT found in detection logdespitecontinue-on-error=true. The detection model occasionally omits the required marker; the parser should treat a missing marker as inconclusive/non-red.safe_outputsjob failed on a singlesub_agent_error(insufficient_contexton PR feat: two-phase Copilot SDK driver for threat detection job #37133). One bad item still reds the whole job.agent_output({"items":[]}), turns=0, no stdio artifact — on the same branch where PR Code Quality Reviewer hit the minimatch crash, so likely the same sdk-driver-startup family.📊 Trends (30 days)
Workflow Health
Today's 83.5% is the lowest since the 06-03 partial window and sits below the 90% guide line, driven almost entirely by the copilot sdk-driver regression rather than agent-quality issues (errors=0, missing-tools=0). The fleet otherwise held 85–96% across the prior two weeks; the 05-23 trough (41.6%) remains the outlier.
Token Usage
Daily tokens (41.6M) sit near the 7-day moving average and well below the 05-31 peak (68.8M). Note this is a partial window, so true 24h volume is higher. Heaviest completed consumers: Matt Pocock Skills Reviewer (22.8M eff, 773 AIC), Contribution Check (22.2M), Daily Ambient Context Optimizer (21.5M).
Recommendations
minimatchin the copilot sdk-driver action and add arequire()smoke test — unblocks 6 workflows.shell/readaccess is granted — unblocks 3 prod-main schedules.effortparam andadelie-alpha-2026-02-19model id (now 2 days failing).THREAT_DETECTION_RESULTmarker undercontinue-on-error(don't red the run when the agent succeeded).safe_outputsexit non-red on partial success when the agent succeeded and ≥1 item landed.Context
known-issues.json(2 new),anomalies.json(3 new + 1 de-escalated),recommendations.json(2 new),workflow-trends.json,metrics-summary.json,audit-history.jsonl. Validated at 59 KB.References:
Beta Was this translation helpful? Give feedback.
All reactions