[prompt-clustering] Copilot Agent Prompt Clustering — 1,000 PRs (May 29–Jun 18) #40038
Closed
Replies: 1 comment
-
|
This discussion has been marked as outdated by Copilot Agent Prompt Clustering Analysis. A newer discussion is available at Discussion #40288. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Summary
Analysis period: 2026-05-29 → 2026-06-18 (~3 weeks) · PRs analyzed: 1,000 · Clusters: 8 · Overall merge rate: 79.8%
Clustering of 1,000 Copilot-agent PR descriptions (TF-IDF + K-means, k chosen at 8) surfaces 8 coherent task families mapped to gh-aw subsystems. Merge rates are healthy across the board (74–93%) with one clear weak spot: auto-triggered "[WIP] Fix failing GitHub Actions job" PRs merge only 61% — the lowest of any group and ~19 points below average.
Full analysis report
Method & data notes
copilot-prs.json— 1,000 PRs authored byapp/copilot-swe-agentingithub/gh-aw.TfidfVectorizer(1–2 grams, min_df=3, max_df=0.6, sublinear_tf)→KMeans(k=8). Silhouette is low (0.040) — expected for short, topically-overlapping engineering text — so clusters are thematic tendencies, not hard partitions.pr-full-data/, PRs 30577–31xxx) is stale and has zero overlap with the current window (35747–40007). Per-PR comment/review/commit counts and workflow turn counts were therefore unavailable; success is measured by merge outcome only.General insights
[WIP]PRs — 61.1% merged. These are reactive, broadly-scoped "make the red job green" tasks and are the clearest optimization target.Clusters (largest first)
Representative PRs
Key findings
Recommendations
Generated by Prompt Clustering Analysis — Run §27755055618
Beta Was this translation helpful? Give feedback.
All reactions