[prompt-clustering] Copilot Agent Prompt Clustering — Daily Analysis (2026-06-13) #39060
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #39212. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
NLP clustering of 991 Copilot-agent task prompts (PRs from the last 30 days, window
2026-05-26 → 2026-06-13). The agent's overall merge (success) rate held steady at 78.5%, in line with the trailing two-week average of ~79%.Key Findings
.lock.ymloutputs drive the most churn.Success & Effort by Cluster
Trend (overall success rate)
Success rate has been stable in a 75–80% band for two weeks; no regression or improvement signal this run.
Per-cluster detail & representative PRs
1. Workflow & Prompt Engineering — 146/177 merged (82%), 3.53 commits/PR. Terms: workflow, prompt, guidance, workflows, skill, removed. Ex: #36748 Add portable
agentic-workflow-designerskill, route wor...; #36727, #37326.2. Code Refactor & Shared Helpers — 108/143 merged (76%), 3.68 commits/PR. Terms: package, function, string, helpers, line, behavior. Ex: #36012 Refactor
ParseWorkflowFileorchestration into focused h...; #36177, #36144.3. Token Budgeting & Cost (AIC) — 101/125 merged (81%), 3.97 commits/PR. Terms: credits, aic, token, effective, budget, cost. Ex: #37265 Update safe-output health failure messaging to AI Credits; #37101, #36042.
4. CI Steps & Job Configuration — 100/124 merged (81%), 3.61 commits/PR. Terms: step, job, workflow, env, conclusion, artifact. Ex: #37976 Derive omitted GitHub App owners from effective checkout...; #35270, #37408.
5. Safe-Outputs Tooling — 76/102 merged (75%), 3.58 commits/PR. Terms: safe, safe output, output, safe outputs, outputs, tool. Ex: #37122 Tighten safe-outputs noop contract for prompt-omission sc...; #36901, #36963.
6. Model Aliases & Regression Coverage — 74/93 merged (80%), 3.39 commits/PR. Terms: model, alias, coverage, regression, entries, regression coverage. Ex: #36388 Update 2026-06-02 model inventory: add missing Gemini pre...; #35826, #36226.
7. Engine SDK / Driver & Permissions — 59/79 merged (75%), 4.7 commits/PR. Terms: sdk, driver, permission, mode, command, behavior. Ex: #37322 Fix Copilot SDK headless auth/driver path and tool-permis...; #36538, #36731.
8. Root-Cause Bug Fixes (failing CI) — 38/54 merged (70%), 2.56 commits/PR. Terms: fix, root cause, actions, root, cause, job. Ex: #38397 [WIP] Fix failing GitHub Actions job 'js-typecheck'; #37674, #36647.
9. Firewall & Network Domains — 41/49 merged (84%), 6.18 commits/PR. Terms: domains, firewall, blocked, version, smoke, network. Ex: #37708 [awf] Bump firewall images to v0.25.66 and MCPG to v0.3.24; #35973, #35117.
10. Sous-Chef Generated Workflows — 35/45 merged (78%), 5.76 commits/PR. Terms: sous, chef, sous chef, generated sous, generated sous chef, generated. Ex: #35573 chore: update changeset workflow model to gpt-5.4; #37273, #37162.
Methodology & data quality
<issue_title>/<issue_description>, stripped agent preamble, code, URLs, HTML. 991/1000 PRs had a usable prompt (≥40 chars); 9 empty bodies dropped.min_df=3,max_df=0.6, 400 features, extended stop-list.Recommendations
References: §27464239047
Generated by Prompt Clustering Analysis (Run: 27464239047)
Beta Was this translation helpful? Give feedback.
All reactions