[prompt-clustering] Copilot Agent Prompt Clustering Analysis - 2025-12-10 #6059
Closed
Replies: 1 comment
-
|
⚓ Avast! This discussion be marked as outdated by Copilot Agent Prompt Clustering Analysis. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Daily NLP-based clustering analysis of copilot agent task prompts using K-means clustering and TF-IDF vectorization.
Summary
Analysis Period: Historical data (1,811 PRs)
Total Tasks Analyzed: 1789
Clusters Identified: 6
Overall Success Rate: 75.0%
Merged PRs: 1341
Closed (not merged): 372
Open PRs: 76
Key Findings
Full Analysis Report
Cluster Analysis
Cluster 4: New Features
Size: 806 tasks (45.1% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: update, add, agent, step, github, firewall, use, make
Characteristics: Tasks in this cluster typically involve new features
Example PRs:
Cluster 6: Bug Fixes
Size: 398 tasks (22.2% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: gh, gh aw, aw, issue, section, githubnext gh aw, githubnext gh, githubnext
Characteristics: Tasks in this cluster typically involve bug fixes
Example PRs:
Cluster 3: Updates & Modifications
Size: 216 tasks (12.1% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: agentic, agentic workflow, workflow, workflows, update, create, shared, use
Characteristics: Tasks in this cluster typically involve updates & modifications
Example PRs:
Cluster 1: Updates & Modifications
Size: 201 tasks (11.2% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: cli, version, code, changes, duplicate, update, analysis, duplicate code
Characteristics: Tasks in this cluster typically involve updates & modifications
Example PRs:
Cluster 2: CI/CD & Workflows
Size: 109 tasks (6.1% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: mcp, server, mcp server, tool, github, safe, tools, json
Characteristics: Tasks in this cluster typically involve ci/cd & workflows
Example PRs:
Cluster 5: Testing & Test Coverage
Size: 59 tasks (3.3% of total)
Success Metrics:
Complexity Metrics:
Top Keywords: fix, tests, format, javascript, test, workflows, issues, reference
Characteristics: Tasks in this cluster typically involve testing & test coverage
Example PRs:
Success Rate by Cluster
Sample Data Table
Sample of analyzed PRs with cluster assignments:
Detailed Insights
1. Task Complexity Patterns
2. Success Rate Patterns
3. Review and Collaboration Patterns
4. Task Distribution
Distribution of tasks across clusters:
Recommendations
Based on the clustering analysis, here are actionable recommendations:
1. Leverage High-Success Patterns
Tasks in the Testing & Test Coverage cluster show 84.7% success rate.
Consider applying similar prompt patterns to other task types.
2. Improve Low-Success Clusters
The CI/CD & Workflows cluster has only 65.1% success rate.
Investigate common failure patterns and refine prompts or provide more context for these tasks.
3. Optimize for Complexity
Tasks in the CI/CD & Workflows cluster are most complex (avg 29.1 files).
Consider breaking down complex tasks into smaller, more manageable subtasks.
4. Focus on Dominant Task Types
New Features represents 45.1% of all tasks.
Optimizing prompts for this category will have the largest impact on overall success rates.
Methodology
NLP Techniques Used:
Data Sources:
Generated by Prompt Clustering Analysis Workflow on 2025-12-10 19:23 UTC
Beta Was this translation helpful? Give feedback.
All reactions