[copilot-cli-research] Copilot CLI Deep Research - 2026-04-15 #26497
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot CLI Deep Research Agent. A newer discussion is available at Discussion #26727. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-04-15 | Repository: github/gh-aw | Run: §24478512391
Previous Analysis: 2026-04-14 (§24423068207)
📊 Executive Summary
Scope: 191 total workflows — 101 explicit Copilot (91 string form + 10 block form) + 22 default (no engine specified) = ~123 effective Copilot workflows
Key Finding: The repository has a rich set of Copilot CLI capabilities but most advanced features remain consistently unused across all three days of tracking. Three persistent gaps stand out: zero version pinning (0% for 3+ days), minimal autopilot adoption (2%), and 8 of 11 custom agent files unwired. Meanwhile, two positive trends emerge: playwright adoption jumped from 4% → 10% and cache-memory usage grew from 19% → 29%.
Primary Recommendation: Enable
engine.versionpinning for at least critical production workflows to prevent silent regressions when the Copilot CLI auto-updates.🔴 Critical Findings (High Priority)
1. Zero Version Pinning — Persistent 3-Day Gap
Every single Copilot workflow runs with
version: latest(implicit default). With 101 Copilot workflows and no version pins, any breaking Copilot CLI release affects all workflows simultaneously.2. Bash Wildcard Overuse — Security Surface
37 of 191 workflows (19%) use
bash: ["*"]which compiles to--allow-all-tools, granting the agent unrestricted shell access. Every such workflow is more permissive than necessary.🟡 Medium Priority Opportunities
3.
max-continuationsAutopilot at 2% (Copilot-Exclusive Feature)Only 2 Copilot workflows use this feature (
smoke-copilotwithmax-continuations: 2,test-quality-sentinelwithmax-continuations: 40). This is a Copilot-exclusive capability enabling multi-phase autonomous runs — ideal for complex daily analysis or multi-step code improvement workflows.4. 8 of 11 Custom Agent Files Unused
.github/agents/contains 11 specialized agent files, but only 2 are wired viaengine.agent:ci-cleaner(hourly-ci-cleaner.md)technical-doc-writer(technical-doc-writer.md)adr-writeragentic-workflowscontribution-checkercreate-safe-output-typecustom-engine-implementationgrumpy-reviewerinteractive-agent-designerw3c-specification-writerdeveloper.instructionsEach unused agent file represents a specialized Copilot persona that could enable better focused behavior in matching workflows.
1️⃣ Feature Inventory & Usage Matrix
engine.versionengine.modelengine.agentengine.api-targetengine.argsengine.envengine.token-weightsengine.baremax-continuationssandbox.agent: awfbash: ["*"]wildcardweb-fetchplaywrightcache-memorysafe-outputsfeatures.copilot-requestsfeatures.mcp-gatewayfeatures.copilot-integration-idtools.timeoutstrict: trueshared/mcp/2️⃣ Detailed Missed Opportunities
🔴 High Priority Opportunities
Opportunity 1: Engine Version Pinning
engine.versionfor Copilot CLIOpportunity 2: Bash Wildcard Security Surface
bash: ["*"]→ compiles to--allow-all-tools(unrestricted shell)craft.md,smoke-copilot.md,smoke-copilot-arm.md, and 34 others"*"with specific commands:🟡 Medium Priority Opportunities
Opportunity 3: Autopilot (
max-continuations) for Multi-Phase Workflowsdaily-repo-chronicle.md— multi-phase analysis workflowweekly-blog-post-writer.md— multi-step creative workflowrepository-quality-improver.md— iterative improvement workflowOpportunity 4: Wire Up Custom Agent Files
What: 8 of 11
.github/agents/files never referenced viaengine.agentWhy It Matters: These files define specialized agent personas and behaviors that could improve output quality
Specific Recommendations:
contribution-checker.agent.mdcontribution-check.mdadr-writer.agent.mdgrumpy-reviewer.agent.mdpr-nitpick-reviewer.mdw3c-specification-writer.agent.mdspec-extractor.md,spec-librarian.mdagentic-workflows.agent.mdcraft.md,workflow-generator.mdHow to Implement:
Opportunity 5: Model Selection for Cost Optimization
gpt-5.1-codex-mini(×5),gpt-5(×1),gpt-4.1-mini(×1),claude-haiku-4-5(×1)gpt-4.1-miniorgpt-5.1-codex-minigpt-5Opportunity 6: Sandbox (AWF) Adoption for Internet-Facing Workflows
sandbox.agent: awfweb-fetch,playwright, external API MCP servers (Tavily, Brave, DeepWiki)🟢 Low Priority Opportunities
Opportunity 7:
token-weightsfor Accurate Cost Tracking (0% adoption)Opportunity 8:
tools.timeoutfor Long-Running Tool Calls (3% adoption)bashcommand can exhaust the entiretimeout-minutesbudgetmake build,make test, or complex scriptsOpportunity 9:
features.mcp-gateway(0% adoption)Opportunity 10:
bare: truefor Narrow-Scope Workflows--no-custom-instructions)firewall.md,smoke-*tests, validation workflows3️⃣ Workflow-Specific Recommendations
View Workflow Recommendations
contribution-check.md— Wire Up Matching Agent Fileengine: copilotwith GitHub MCPengine.agent: contribution-checkercontribution-checker.agent.mdfile in.github/agents/is purpose-built for this workflowcraft.md— Tighten Tool Permissionsbash: ["*"](unrestricted shell)git,gh,cat,finddaily-repo-chronicle.md— Enable Autopilotmax-continuations: 2-3withtimeout-minutes: 90research.md— Add Version Pinengine: copilotwith no version pinningdaily-*analysis workflows — Model Selectionmodel: gpt-4.1-minifor simple read + report workflows4️⃣ Trends & Historical Insights
View 3-Day Trend Analysis
Observation: The fluctuations in some metrics suggest different counting methodologies across runs (total Copilot pool vs explicit-only). The persistent gaps (version pinning, autopilot, custom agents) are the most reliable signals requiring action.
Notable positive trend: Playwright adoption recovery (+6pp day-over-day) suggests new visual testing workflows are being added.
5️⃣ Best Practice Guidelines
Based on 3 days of analysis, here are recommended best practices for Copilot workflows:
bash: ["git", "gh"]instead ofbash: ["*"]max-continuationsfor complex multi-phase workflows — This is a Copilot-exclusive feature that enables autonomous multi-run pipelines.github/agents/*.agent.mdfiles to their corresponding workflowsstrict: truebroadly — Already at 58% adoption, should be standard for all non-trivial workflowstools.timeout— Prevent single hung commands from consuming the entire job budget6️⃣ Action Items
Immediate (this week):
engine.version: "1.0.21"to at least the 3 smoke test workflowscontribution-checker.agent.mdtocontribution-check.mdbash: ["*"]incraft.mdwith specific command listShort-term (this month):
max-continuationsto 2-3 complex daily workflowstools.timeout: 300on workflows that run builds/testsLong-term (this quarter):
View Methodology & References
Research Methodology
*.mdfiles in.github/workflows/(191 total)pkg/workflow/copilot_engine.go,copilot_engine_execution.go,copilot_engine_tools.go,copilot_mcp.gopkg/constants/engine_constants.go,pkg/constants/feature_constants.godocs/src/content/docs/reference/engines.mdReferences
docs/src/content/docs/reference/engines.mdpkg/workflow/copilot_engine*.go.github/aw/github-agentic-workflows.mdReferences:
Beta Was this translation helpful? Give feedback.
All reactions