[prompt-clustering] Copilot Agent Prompt Clustering — 994 PRs, 7 themes (last 30d) #40288
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #40457. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Analysis period: last 30 days (2026-05-30 → 2026-06-19)
Copilot agent PRs analyzed: 994 (of 1,000 fetched; 6 had no usable task text)
Clusters identified: 7 (k chosen over 5–7 by silhouette)
Overall merge-success rate: 80.4% (794 merged / 199 closed / 7 open)
Avg commits per PR: 3.82 · Avg files changed: 39.7
All PRs were authored by
app/copilot-swe-agentingithub/gh-aw. Task prompts were extracted from PR bodies (copilot's "Thanks for asking..." boilerplate stripped; the original task usually survives as a blockquote), vectorized with TF-IDF (1–2 grams) and clustered with K-means.Key findings
[WIP]debugging PRs are frequently abandoned/closed rather than iterated to a merge.max-ai-credits/ AIC budgets across workflows.Full cluster analysis & data
Methodology & caveat
{5: 0.022, 6: 0.025, 7: 0.028}. Separation is weak (silhouette ≈ 0.03) — expected for short, jargon-dense technical prompts that share vocabulary ("workflow", "agent", "schema"). Treat clusters as themes, not hard partitions.gh-aw logsdoes not apply.commits_countis used as the iteration proxy instead.Clusters (largest first)
C3 — Schema / validation / docs (catch-all)
C4 — Prompt & safe-output / engine guidance
C1 — Workflow jobs (sous-chef, footer, forecast)
C0 — AI credits / budget guardrails
C6 — Copilot SDK driver / harness
C2 — Firewall / network domains ⭐ highest success
C5 — Fix failing Actions jobs⚠️ lowest success
Success rate by cluster
Recommendations
[WIP]PRs often stall. Give them richer up-front context (failing log excerpt + suspected root cause in the prompt, not just a check-run URL) before dispatch, and consider auto-closing stale ones.References: §27822078061
Beta Was this translation helpful? Give feedback.
All reactions