You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Iteration proxy: commits/PR — Copilot-agent PRs don't map 1:1 to gh-aw runs, so turn-counts aren't joined.
Key findings
~62% of work sits in two clusters: generic tests/schema/docs (C2, 39%) and workflow/prompt engineering (C4, 23%). The agent is mostly used for in-repo maintenance and for building more workflows.
Merge rate is flat (73–83%) across themes — task type barely predicts outcome; nothing is failing badly.
Weakest clusters are the most intricate subsystems:safe-outputs/MCP (C3, 73%) and SDK/harness (C6, 74%). Best: observability (C7, 83%) and AI-credit (C1, 82%).
Effort ≠ size. Dependency bumps (C0) touch ~151 files but need the fewest commits (3.1, mechanical); the SDK cluster (C6) has small diffs yet the most iterations (5.3 commits, 5.4 comments) — genuinely hard.
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
Uh oh!
There was an error while loading. Please reload this page.
-
🧩 Copilot Agent Prompt Clustering — Daily Analysis
Period: last 30 days (2026-05-27 → 06-16) · Repo:
github/gh-aw· PRs: 1000 (100% full data)Summary
Key findings
Clusters
Representative PRs per cluster
Largest-diff outliers: #36006 (+1.6M, 5272 files) · #36065 · #38369
Recommendations
Methodology: TF-IDF (1–2 grams, 600 feats, min_df=3) over 3×-weighted title + cleaned body; firewall/code blocks stripped; clusters validated by manual term/example review. Low silhouette is expected for sparse text.
References: §27614318724
Beta Was this translation helpful? Give feedback.
All reactions