[copilot-cli-research] Copilot CLI Deep Research - 2026-06-15 #39328
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-06-16T05:46:05.568Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-06-15 | Run: §27525865107
Scope: 246 total workflows, 133 using Copilot engine (54%)
📊 Executive Summary
This is the 7th deep-research run. Three metrics regressed vs the previous run (06-10); persistent feature gaps remain unchanged for 18+ consecutive runs.
engine.agentmax-ai-creditssandbox🔴 High Priority
1. Investigate regressions —
engine.agentdropped 34→21,max-ai-creditsdropped 14→6, andsandboxdropped 20→15 without a corresponding change in total workflow count (+1 net). Likely caused by recent bulk workflow updates. Audit commits between 06-10 and 06-15.2.
max-tool-denialsmissing from ALL 63 SDK workflows — Everycopilot-sdk: trueworkflow lacks the SDK-specific guard against infinite tool-denial loops. The default is 5 but must be set explicitly.3. 66/133 workflows (50%) have no security config — No
sandbox, nonetwork. Half of all Copilot workflows allow unrestricted outbound access, enabling prompt injection exfiltration.4. Budget controls nearly absent — 127/133 (95%) workflows lack
max-ai-credits. Runaway agents can exhaust monthly budgets.🟡 Medium Priority
5.
engine.args— 18 consecutive zero-usage runs (PERSISTENT). Custom CLI args fully supported since v0.36 but never used in production.["--share"]would enable session tracking.6.
engine.api-target— 18 consecutive zero-usage runs (PERSISTENT). Required for GHE/GHES/GHEC-with-data-residency deployments but never configured.7.
engine.harness— Never used. Custom startup hooks and retry logic beyond the built-in CAPIError 400 retry have never been exercised.8. 5 orphaned agent files (unchanged for 7 runs):
grumpy-reviewer.agent.mdinteractive-agent-designer.agent.mdw3c-specification-writer.agent.mdcreate-safe-output-type.agent.mdcustom-engine-implementation.agent.mdWire these to workflows or remove them to reduce confusion.
9.
min-integrityonly 17% — Only 22/133 Copilot workflows require PR approval before acting. PR-triggered workflows without this are vulnerable to prompt injection.🟢 Low Priority
10.
engine.token-weights— Never used. BYOK workflows using non-standard models get inaccurate AIC cost accounting.11.
web-fetchonly 8% — 10/133 workflows. Research/documentation workflows frequently fetch external content but don't declare the capability.12.
engine.versionpin — 0 Copilot CLI pins. The 12version:fields found are Node/Go/Python runtime versions, not engine pins. Production workflows may break on new Copilot CLI releases.📈 Feature Usage Matrix (133 Copilot workflows)
strict: truenetworkconfigcopilot-sdkengine.agentbaremin-integritymax-continuationsmax-ai-creditssandboxweb-fetchengine.argsengine.api-targetengine.harnessengine.token-weightsmax-tool-denials📅 7-Run Historical Trend
Persistent zero-usage features:
engine.args(18 runs),engine.api-target(18 runs),engine.harness(all runs),engine.token-weights(all runs).✅ Action Items
This week:
engine.agentdropped 34→21 (check commits 06-10 to 06-15)max-ai-creditsdropped 14→6 (restore where removed inadvertently)max-tool-denials: 5to all 63copilot-sdk: trueworkflowsThis month:
max-ai-credits: 500to the 127 workflows missing itnetwork: allowed: [defaults]to the 66 workflows with no security config.agent.mdfilesThis quarter:
engine.argshas 18 consecutive zero-usage runsengine.harnesswith a smoke workflowengine.api-targetfor GHE/GHES use casesengine.versionpins to production-critical workflowsReferences: §27525865107 | Research history in
memory/copilot-cli-researchbranchBeta Was this translation helpful? Give feedback.
All reactions