[copilot-cli-research] Copilot CLI Deep Research - 2026-04-21 #27681
Replies: 2 comments
-
|
Greetings all. I came across an interesting resource that can be used as a Google Scholar alternative when bulk access to scholarly literature is needed. ScholarAPI (scholarapi.net?via=mhsvo0) is essentially an API over a large corpus of academic literature (metadata +PDFs), aggregated from thousands of journals and repositories worldwide. If you’re doing large-scale literature work (bibliography searches, text mining& AI, reviewer support), it might save a lot of time compared with manual searches or scraping Google Scholar. It also has guides to typical work flows like literature monitoring, AI fine-tuning, and plagiarism checking, which could be useful for research tooling or library/IT projects. scholarapi.net?via=mhsvo0 |
Beta Was this translation helpful? Give feedback.
-
|
This discussion has been marked as outdated by Copilot CLI Deep Research Agent. A newer discussion is available at Discussion #27897. |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Analysis Date: 2026-04-21
Repository: github/gh-aw
Scope: 197 total workflows — 111 using Copilot engine (87 explicit + 24 via default), 46 Claude, 10 Codex
Previous Analysis: 2026-04-20 Run §24690376692 ← compare trends below
📊 Executive Summary
This is the 5th consecutive daily run of this research agent. The persistent gaps from previous analyses remain unchanged:
engine.versionis still at 0%,api-targetat 0%,blocked-domainsat 0%, andmax-continuationsat only 2 workflows (1%). Meanwhile, positive trends continue:mcp-clifeature is at 80%,copilot-requestsis at 23%, andcache-memoryusage is strong.The biggest newly-surfaced finding this run: 45 Copilot workflows have no network configuration at all — neither
network:restrictions norsandbox: agent: awf. This means those workflows have unrestricted outbound network access, which is a security posture gap especially for workflows triggered by external events (issues, PR comments, slash commands).Primary Recommendation: Address the 45 unrestricted workflows by adding at minimum
network: defaultsor upgrading them to use AWF sandbox. This is the clearest, highest-impact, actionable improvement available today.Critical Findings
🔴 High Priority
1. 45 Copilot Workflows With Zero Network Restrictions
Workflows triggered by external events (issues, PRs, slash commands) that have no
network:config and nosandbox:config have unrestricted outbound access. An adversary who can trigger the workflow (e.g., via issue body injection) could potentially exfiltrate context.2.
engine.versionStill 0% After 5 Days (Critical Stability Risk)No production workflow pins the Copilot CLI version. A breaking release could simultaneously break all 111 Copilot-powered workflows with no rollback path. This has been flagged in every prior run.
🟡 Medium Priority
3.
max-continuationsUsed by Only 2/111 Workflows (1%)Eight workflows have
timeout-minutes≥ 60, suggesting complex long-running tasks — yet none usemax-continuations(Copilot's unique autopilot mode). This feature allows iterative task continuation across multiple agent sessions.4. 5 Custom Agent Files Completely Unused
grumpy-reviewer,w3c-specification-writer,create-safe-output-type,custom-engine-implementation, andinteractive-agent-designerare available in.github/agents/but referenced by zero workflows.View Full Analysis
1️⃣ Current State Analysis
View Copilot CLI Capabilities Inventory
Copilot CLI Capabilities Inventory
Engine Configuration Options (
engine:block):engine.id: copilotengine.versionengine.modelengine.agent.github/agents/engine.argsengine.envengine.bare--no-custom-instructions)engine.commandengine.api-targetengine.drivermax-continuations--autopilot --max-autopilot-continuesengine.token-weightsengine.concurrencyCLI Flags Automatically Applied (by gh-aw compiler):
--add-dir /tmp/gh-aw/— always added--disable-builtin-mcps— always added (built-in MCP servers disabled)--no-ask-user— added for v1.0.19+ (fully autonomous mode)--allow-all-tools— whenbash: ["*"]orbash: [":*"]--allow-tool <name>— per-tool permissions (granular)--allow-all-paths— whenedit:tool is enabled--no-custom-instructions— whenbare: true--autopilot --max-autopilot-continues N— whenmax-continuations > 1--agent <id>— whenengine.agentis set--prompt-file— always (prompt passed via file)--log-level all --log-dir— always (structured logging)Sandbox Options:
sandbox: agent: awf— Agent Workload Firewall (process isolation + network firewall)sandbox: agent: srt— Sandbox Runtime (experimental)Network Configuration:
network: defaults— Use default ecosystem domainsnetwork: allowed: [defaults, github, node, python, go, ...]— Custom allowlistnetwork: {}— Deny all (except defaults implicitly)Features Flags:
features.mcp-cli: true— Mount MCP servers as CLI commandsfeatures.copilot-requests: true— Usegithub.tokeninstead ofCOPILOT_GITHUB_TOKENView Usage Statistics
Usage Statistics (2026-04-21 vs 2026-04-20)
engine: copilot)engine.versionpinnedengine.modeloverrideengine.agentcustom fileengine.baremax-continuationssandbox: agent: awfmcp-scripts:features.mcp-cli: truefeatures.copilot-requests: truecache-memory:strict: truenetwork:explicitly configuredweb-fetch:toolweb-search:tooltoolsets: [default]onlyMost Common GitHub Toolset Combinations:
[default]— 45 workflows (broadest, least specific)[default, actions]— used for CI/log analysis[default, discussions]— used for community workflows[pull_requests, repos, issues]— used for PR review workflows[context, pull_requests]— security/code scanning workflows2️⃣ Feature Usage Matrix
3️⃣ Missed Opportunities
View High Priority Opportunities
🔴 Opportunity 1: Network Security for 45 Unrestricted Workflows
network:config norsandbox: agent: awfslash_command,issues,pull_request,issue_commentnetwork: defaults, or upgrade to AWF:🔴 Opportunity 2:
engine.versionPinning (Stability)View Medium Priority Opportunities
🟡 Opportunity 3:
max-continuationsfor Long-Running Tasksmax-continuations). Eight workflows have 60+ minute timeouts — these could benefit.max-continuationsenables iterative task completion where one session ends and another picks up, allowing complex tasks to complete reliably without hitting context limits.agent-persona-explorer.md(180 min timeout) — ideal candidateaw-failure-investigator.md(60 min) — diagnostic tasks benefit from continuationdaily-team-evolution-insights.md(90 min) — data analysis benefits from iterationorg-health-report.md(60 min) — large report generation🟡 Opportunity 4: Unused Custom Agent Files
.github/agents/are unused.grumpy-reviewer.agent.md— could power a strict code review workfloww3c-specification-writer.agent.md— could power spec-writing workflowscreate-safe-output-type.agent.md— could automate new output type creationcustom-engine-implementation.agent.md— could guide engine developmentinteractive-agent-designer.agent.md— could power interactive agent designgrumpy-revieweras an alternative agent option in code review workflows for more thorough critique.🟡 Opportunity 5: Over-Provisioned GitHub Toolsets
toolsets: [default]which provides broad GitHub access (repos, issues, pull_requests, context). Many only need a subset.View Low Priority Opportunities
🟢 Opportunity 6:
bareMode for Analytical/Creative Workflowsbare: true(disables custom instructions loading). Many creative, analytical, and standalone workflows could use this.AGENTS.mdor.github/copilot-instructions.mdwhich are designed for development workflows, not analytical ones.poem-bot.md,daily-fact.md,agent-persona-explorer.md,constraint-solving-potd.md(already uses it),daily-news.md(already uses it)bare: trueto workflow frontmatter🟢 Opportunity 7:
blocked-domainsfor Defense in Depthnetwork.blockedto explicitly deny specific domains.allowedlists, blocking known bad or unnecessary domains adds another security layer.🟢 Opportunity 8:
mcp-scriptsfor Dynamic Tool Accesssecurity-review.mdusesmcp-scripts. This feature allows runtime-configurable MCP tools without pre-compiling new MCP server configs.🟢 Opportunity 9:
web-searchvsweb-fetchfor Researchweb-fetchbut only 2 useweb-search. Research-oriented workflows usingweb-fetchto manually construct URLs may benefit from semantic search.research.md,blog-auditor.md, workflows that search documentation🟢 Opportunity 10: Model Override for Cost Optimization
engine.modeloverride. For simple/quick tasks, a smaller model (e.g.,gpt-4.1-mini) can reduce costs significantly.auto-triage-issues.md(already usesgpt-4.1-mini✅), classification workflows, simple generation tasks4️⃣ Specific Workflow Recommendations
View High-Value Workflow-Specific Recommendations
agent-persona-explorer.md(180 min timeout)max-continuationsmax-continuations: 10if migrating to Copilot, or verify Claudemax-turnsis set appropriatelyaw-failure-investigator.md(60 min timeout, Copilot)engine: copilot, nomax-continuationsmax-continuations: 3to allow iterative investigation when initial investigation times outorg-health-report.md(60 min timeout)max-continuations: 3for large organization analysisdaily-security-red-team.md(60 min timeout, no network config)network:config, nosandbox:code-simplifier.md(uses onlytoolsets: [default])[default]toolsetstoolsets: [repos]since it only needs file accessglossary-maintainer.mdandtechnical-doc-writer.md5️⃣ Trends & Insights
View Historical Trends (5-Day Analysis)
Key Observations:
engine.versionhas been at 0% for every single analysis run — this is the most persistent gapmax-continuationsat 2 workflows — Copilot's most unique differentiator feature is barely usedstrict modedeclined slightly from previous run (66%→59%) — needs investigationengine.argsandengine.envadoption started on Apr 20 and stabilized6️⃣ Best Practice Guidelines
Based on 5 daily research runs, these are the recommended best practices for Copilot workflows:
network: allowed: [defaults, github]at minimum, and AWF sandbox for workflows processing untrusted content[context],[issues],[repos]) rather than[default]engine.versionfor daily/weekly production workflows; uselatestonly for test/smoke workflowsbare: truefor creative, analytical, or content-generation workflows that don't need dev-environment context.github/agents/*.agent.mdfilemax-continuationsfor complex tasks: Any workflow withtimeout-minutes: 60+should considermax-continuations: 3-5copilot-requests: truefor internal/trusted workflows (usesgithub.tokeninstead ofCOPILOT_GITHUB_TOKENsecret)7️⃣ Action Items
Immediate Actions (this week):
network: defaultsto the ~45 Copilot workflows without any network configengine.versionfor at least the most critical daily/weekly workflowsdaily-security-red-team.md(security workflow should not have unrestricted network)Short-term (this month):
max-continuations: 3-5toaw-failure-investigator.md,org-health-report.md,daily-team-evolution-insights.mdgrumpy-reviewer.agent.mdandinteractive-agent-designer.agent.mdtoolsets: [default]in the 45 over-provisioned workflowsLong-term (this quarter):
mcp-scriptsfor dynamic tool access patterns in complex research workflowsengine.versionpinning as a required best practiceView Supporting Evidence & Methodology
📚 References
pkg/workflow/copilot_engine.gopkg/workflow/copilot_engine_execution.godocs/src/content/docs/reference/engines.mdmemory/copilot-cli-researchResearch Methodology
copilot_engine.go,copilot_engine_execution.go,copilot_engine_tools.go,copilot_mcp.goandengine.goto catalog all available Copilot CLI features and configuration options.mdworkflows in.github/workflows/with grep patterns to count adoption of each feature/tmp/gh-aw/repo-memory/default/copilot-research-latest.json) to compare with today's findingsData sources: Go source files, workflow markdown frontmatter,
.github/agents/directory, previous repo-memory JSON.References:
Beta Was this translation helpful? Give feedback.
All reactions