Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 1 month ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - 2025-10-19
Executive Summary
File Size Distribution
Statistics:
Interpretation
The majority of lock files (94.6%) are substantial in size, ranging from 100 KB to over 346 KB. This indicates that most agentic workflows are feature-rich with comprehensive agent instructions, multiple safe output configurations, and extensive MCP integrations. The absence of small lock files (<50 KB) suggests that even the simplest agentic workflows require significant configuration.
Trigger Analysis
Most Popular Triggers
Note: Percentages sum to more than 100% because workflows can have multiple triggers.
Common Trigger Combinations
/brave,/plan)Schedule Patterns
0 10 * * *0 9 * * 00 9 * * 1-50 9 * * *0 3 * * *Insight: Most scheduled workflows run during morning hours (9-10 AM UTC), likely designed to provide daily summaries or updates at the start of the workday.
Safe Outputs Analysis
Safe Output Types Distribution
Key Findings
Universal Tool Reporting: 97.3% of workflows enable the
missing_toolsafe output, allowing agents to report when they encounter missing capabilities.Discussion vs Issue Creation: Create discussion (32.4%) and create issue (32.4%) are equally popular, suggesting workflows are split between using GitHub Discussions for broader conversations and Issues for actionable tracking.
PR Automation: 24.3% of workflows can create pull requests autonomously, demonstrating significant code modification capabilities.
Multi-modal Outputs: Many workflows combine multiple safe output types, enabling agents to interact through various GitHub interfaces (comments + issues + discussions).
Structural Characteristics
Job Complexity
Distribution Analysis
Average Lock File Structure
Based on statistical analysis, a typical .lock.yml file has:
{}),read-allfor agent jobStructural Insights
Standard Pattern: The 6-job pattern appears in 54% of workflows, representing a canonical agentic workflow structure:
Complexity Outliers:
Step Efficiency: Average 9 steps per job indicates well-organized job definitions with focused responsibilities.
Permission Patterns
Permission Configuration
permissions: {})read-allSecurity Model
Principle of Least Privilege: 100% of workflows use empty top-level permissions, restricting access by default. The agent job then explicitly grants
read-allpermissions (94.6% of cases), enabling agents to:Write Operations: All write operations (creating issues, comments, PRs) are handled through safe-output MCP tools, which use separate authentication mechanisms, maintaining a security boundary between agent read access and write capabilities.
Notable Exception: Two workflows (test-jqschema, test-post-steps) don't specify
read-allfor the agent job, likely because they're minimal test workflows with limited requirements.Tool & MCP Patterns
Most Used MCP Servers
Engine Distribution
Insight: GitHub Copilot dominates as the primary agent engine (67.6%), with Claude as the secondary option (24.3%). This suggests the repository is optimized for Copilot-based agentic workflows.
Common Tool Configurations
Core Toolset (present in ~95% of workflows):
Extended Toolset (present in research/web-enabled workflows):
Specialized Tools (workflow-specific):
Timeout Configuration
Average Timeout: 12.16 minutes
Analysis:
Interesting Findings
Standardization Achievement: Despite 37 diverse workflows with different purposes, there's remarkable consistency in structure:
The "Missing Tool" Pattern: 97.3% of workflows enable the
missing_toolsafe output, indicating a culture of continuous improvement where agents can report gaps in their capabilities back to developers.Size-Complexity Correlation: There's a strong correlation between file size and structural complexity:
The Poem Bot Anomaly: The
poem-bot.lock.ymlworkflow is an outlier in multiple dimensions:Morning Automation Pattern: 75% of scheduled workflows run between 9-11 AM UTC, suggesting these agentic workflows are designed to provide morning briefings, daily summaries, or workday preparation tasks.
Workflow Dispatch Dominance: 78.4% of workflows support manual triggering, indicating that while automation is valuable, human-in-the-loop control remains a priority.
Historical Trends
Note: This is the first comprehensive analysis. Future analyses will compare trends over time to track:
This baseline data has been stored in
/tmp/gh-aw/cache-memory/history/2025-10-19.jsonfor future comparison.Recommendations
For Workflow Authors
Adopt the Standard Pattern: The 6-job pattern (pre_activation → activation → agent → detection → outputs → missing_tool) has emerged as a best practice used by 54% of workflows. New workflows should consider adopting this pattern for consistency.
Enable Missing Tool Reporting: If not already present, add
missing_toolto your safe outputs configuration to enable agents to report capability gaps.Consider Manual Triggers: 78.4% of workflows include
workflow_dispatch. If your workflow might benefit from manual execution, add this trigger.Optimize Timeouts: The average timeout is 12 minutes. Review your workflow's timeout settings:
For Repository Maintainers
Template Workflow: Create a reference template based on the most common patterns:
Size Guidelines: Lock files naturally range from 100-350 KB. Files outside this range may indicate:
Documentation: Document the canonical 6-job pattern and when to deviate from it (e.g., poem-bot's 14-job structure for specialized use cases).
MCP Server Discovery: Only 13.5% of workflows use web search (Tavily). Consider promoting web-search capabilities for research-oriented workflows.
For Future Development
Consolidate Schedule Times: 11 different cron schedules are in use. Consolidating to a few standard times (e.g., 9 AM, 10 AM) could simplify maintenance and reduce confusion.
Standardize Safe Output Combinations: Some safe output combinations appear frequently (e.g., create_discussion + missing_tool). Consider creating standard configuration presets.
Engine Abstraction: With 67.6% using Copilot and 24.3% using Claude, ensure workflows remain engine-agnostic or clearly document engine requirements.
Methodology
.github/workflows/*.lock.yml/tmp/gh-aw/cache-memory/for script persistence and data storageAnalysis Scripts
All analysis scripts have been preserved in
/tmp/gh-aw/cache-memory/scripts/for reproducibility and future analyses.Data Files Generated
file_sizes_bytes.txt- Raw file size datatrigger_analysis.txt- Trigger combinations per workflowsafe_outputs.txt- Safe output configurationsstructure.txt- Job and step countspermissions.txt- Permission configurationsmcp_servers.txt- MCP server usageengines.txt- Engine type detectiontimeouts.txt- Timeout settingsschedules.txt- Cron schedule patternsGenerated by Lockfile Statistics Analysis Agent on 2025-10-19
Analysis Time: ~5 minutes
Workflows Analyzed: 37
Total Data Processed: 7.21 MB
Beta Was this translation helpful? Give feedback.
All reactions