📊 Agentic Workflow Lock File Statistics - December 2025 #5648
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it was created by an agentic workflow more than 3 days ago. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
📊 Agentic Workflow Lock File Statistics - December 2025
Executive Summary
This comprehensive analysis examined all 107 agentic workflow lock files (
.lock.yml) in the.github/workflows/directory of the githubnext/gh-aw repository. The analysis reveals extensive adoption of agentic workflows with Copilot as the dominant engine (58%), strong preference for create-discussion safe outputs (37%), and consistent use of GitHub MCP server tools across workflows.Key Statistics:
Full Statistical Report
File Size Distribution
Statistics:
.github/workflows/shared/mcp/arxiv.lock.yml(80.23 KB).github/workflows/poem-bot.lock.yml(611.14 KB)The distribution shows remarkably consistent file sizes, with the vast majority (67.3%) falling into the 300-400 KB range. This consistency suggests standardized workflow structures and tooling configurations.
Trigger Analysis
Most Popular Triggers
Based on analysis of workflow trigger configurations in frontmatter:
Key Findings:
workflow_dispatch, enabling on-demand executionSchedule Patterns
Top 10 most common cron schedules:
0 9 * * *0 14 * * 1-50 11 * * 1-50 8 * * *0 13 * * 1-50 9 * * 10 10 * * 1-50 3 * * *0 15 * * 1-50 0 * * *Observations:
1-5) to avoid weekend executionSafe Outputs Analysis
Safe Output Types Distribution
Safe outputs enable agents to interact with GitHub safely without destructive permissions:
Total Safe Output Declarations: 127 (some workflows use multiple types)
Key Insights:
Discussion Categories
Distribution of discussion categories used by
create-discussionworkflows:Note: "audits" and "Audits" are treated as separate categories due to case sensitivity.
Finding: Strong preference for the "audits" category (37.5%), indicating these workflows primarily perform analysis and reporting functions.
Structural Characteristics
Workflow Structure Overview
poem-bot.lock.yml)Average Lock File Structure
Based on statistical analysis, a typical
.lock.ymlfile has:Step Complexity Distribution
The analysis reveals significant variation in workflow complexity:
High-Complexity Workflows (>100 steps):
poem-bot.lock.yml- 116 stepsEngine Distribution
Distribution of AI engines powering the agentic workflows:
Total Workflows with Engine Declaration: 81 out of 107 (75.7%)
Observations:
Permission Patterns
Most Common Permissions
Top 20 permissions requested across all workflows:
Key Security Observations:
Permission Philosophy
The repository demonstrates a least-privilege security model:
Tool & MCP Patterns
Most Used MCP Servers
MCP (Model Context Protocol) servers provide specialized capabilities to agents:
Total MCP Server Tool Calls: 4,039 across all workflows
Key Findings:
Common Tool Configurations
Native tools enabled across workflows:
Observations:
Timeout Patterns
Timeout Statistics
Common Timeout Values
Most workflows use standard timeout durations:
Finding: The average ~18 minute timeout suggests most workflows are designed for quick, focused tasks rather than long-running operations.
Interesting Findings
Standardized Structure: 67.3% of lock files fall into the 300-400 KB size range, indicating strong structural consistency across workflows
Copilot-First Strategy: With 58% market share among declared engines, Copilot is the clear platform default, likely due to native GitHub integration
Discussion-Driven Output: 37.4% of safe outputs are discussions, suggesting a preference for persistent, threaded reporting over ephemeral comments
GitHub MCP Dominance: The GitHub MCP server accounts for 89.6% of all MCP tool calls (3,619 out of 4,039), making it the critical infrastructure component
Business Hours Automation: Scheduled workflows heavily favor business hours (8 AM - 3 PM UTC) and weekdays, indicating these workflows support human workflows rather than 24/7 automation
Single Job Pattern: 100% of workflows use exactly 1 job per workflow, suggesting a design pattern of focused, single-purpose workflows
High Step Complexity: Average of 63.27 steps per workflow (with max of 116) indicates sophisticated multi-stage agent operations
Firewall Usage: Many workflows include network firewall rules, demonstrating security-conscious design
Cache Memory Adoption: 29% of workflows use cache-memory tool, enabling state persistence across workflow runs
Weekday-Only Schedules: Many cron schedules explicitly exclude weekends (
1-5), respecting developer work patternsRecommendations
For Workflow Authors
Standardize Category Names: Consolidate "audits", "Audits", and "audit" categories to reduce fragmentation
Consider Engine Selection:
Optimize Timeout Values: The 18-minute average suggests most tasks complete quickly - consider reducing timeouts for faster failure detection
Leverage Cache Memory: Only 29% of workflows use cache-memory; consider adopting for workflows with repeated data fetching
For Platform Improvements
MCP Server Documentation: Given GitHub MCP's dominance (89.6% of calls), comprehensive documentation and examples are critical
Discussion Category Management: Provide tooling to standardize and manage discussion categories
Timeout Monitoring: Track actual runtime vs. timeout to identify optimization opportunities
Engine Performance Metrics: Publish comparative metrics for engine selection guidance
For Repository Maintenance
Consolidate Small Workflows: Consider combining very small workflows (<100 KB) to reduce overhead
Review Large Workflows: Investigate the 4 workflows >400 KB for refactoring opportunities
Schedule Optimization: Current schedules create load spikes at 9 AM, 11 AM, and 2 PM UTC - consider spreading load
Permission Audit: Regularly review the 101 workflows with
contentspermissions to ensure least-privilegeMethodology
Data Collection
.github/workflows/*.lock.ymland.github/workflows/**/*.lock.ymlAnalysis Techniques
ducommand with awk aggregationon:frontmatter sectionsmcp__[server]__tool call patternsCache Memory Usage
Analysis scripts stored in
/tmp/gh-aw/cache-memory/scripts/:analyze_lockfiles.sh- Comprehensive bash analysisanalyze.py- Python-based data extractioncount_jobs.py- Job and step countingHistorical data saved to
/tmp/gh-aw/cache-memory/history/.last_analysisfor future trend tracking.Limitations
Future Analysis Opportunities
Conclusion
The githubnext/gh-aw repository demonstrates mature, production-ready agentic workflow practices with:
The repository serves as an excellent reference implementation for organizations adopting agentic workflows in GitHub Actions.
Analysis Metadata:
/tmp/gh-aw/cache-memory/scripts/Beta Was this translation helpful? Give feedback.
All reactions