[prompt-clustering] Copilot Agent Prompt Clustering Analysis - 2026-02-13 #15296
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-02-20T05:03:58.699Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Daily NLP-based clustering analysis of copilot agent task prompts to identify patterns, success factors, and opportunities for optimization.
Executive Summary
Analyzed 1,354 copilot-created PRs from the last 30 days using advanced NLP clustering techniques (TF-IDF vectorization + K-means). Identified 7 distinct task patterns with varying success rates (56%-72%). Overall agent performance shows 67.1% success rate with notable differences across task types.
Key Findings:
Analysis Metrics
Cluster Performance Overview
Legend: ✅ High (≥70%) |⚠️ Moderate (60-69%) | ❌ Low (<60%)
Detailed Cluster Analysis
Cluster 2: Mixed (Workflows) - 25.0% of tasks
Performance
Why It Works
Representative Examples
Cluster 1: Test - 19.6% of tasks
Performance
Why It Works
Representative Examples
Cluster 5: Fix - 22.3% of tasks⚠️ NEEDS ATTENTION
Performance
Why It Struggles
Improvement Opportunities
Representative Examples
Cluster 4: Mixed (MCP) - 18.3% of tasks
Performance
Representative Examples
Cluster 7: Update - 5.5% of tasks
Performance
Representative Examples
Cluster 6: Docs - 5.1% of tasks
Performance
Representative Examples
Cluster 3: Remove - 4.2% of tasks
Performance
Why More Iterations
Representative Examples
Recommendations
🎯 Priority 1: Improve Fix Task Success Rate (Currently 56%)
Problem: Fix tasks have the lowest success rate at 56%, representing 302 tasks (22.3% of all work).
Root Causes:
Action Items:
Template for Fix Tasks:
Two-Phase Approach:
Better Scoping: Break "fix multiple issues" into separate focused PRs
✅ Priority 2: Replicate Success Patterns
Successful Clusters: Test (72.1%) and Mixed Workflows (72.3%)
Success Factors:
Action Items:
📊 Priority 3: Monitor Iteration Counts
Observation: Remove tasks require 5.1 commits on average (42% more than average).
Action Items:
🔍 Priority 4: Improve Prompt Quality
General Observations:
Action Items:
Prompt Checklist:
Prompt Engineering Training: Share successful prompt examples
Methodology
Data Collection
Text Processing
Clustering Analysis
Metrics
Next Steps
Analysis Run: §21974945192
Beta Was this translation helpful? Give feedback.
All reactions