📊 Agentic Workflow Lock File Statistics Analysis - February 2026 #13875
Closed
Replies: 1 comment
-
|
This discussion was automatically closed because it expired on 2026-02-12T08:33:43.874Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Comprehensive statistical analysis of 145 agentic workflow lock files in the github/gh-aw repository, revealing usage patterns, structural characteristics, and interesting insights into how agentic workflows are configured and deployed.
Key Findings:
workflow_dispatch)File Size Distribution
The majority of lock files fall into a consistent size range, indicating standardized workflow structure with moderate complexity.
Size Statistics:
codex-github-remote-mcp-test.lock.yml(21.7 KB) - Test workflowsmoke-claude.lock.yml(104.6 KB) - Comprehensive smoke testingTrigger Analysis
Most Popular Triggers
Workflows are primarily designed for manual invocation with optional automation through scheduling.
workflow_dispatchscheduleissue_commentpull_requestissuespull_request_review_commentdiscussion_commentdiscussionworkflow_runpushCommon Trigger Combinations
Most workflows combine manual and automated triggers for flexible execution.
schedule+workflow_dispatchworkflow_dispatchonlypull_request+schedule+workflow_dispatchissuesonlyTotal Unique Combinations: 18 different trigger patterns
Schedule Patterns
104 workflows use scheduled execution with diverse cron patterns for load distribution:
View Schedule Distribution
Peak Hours (UTC):
Most Common Schedules:
0 13 * * 1-5- 4 workflows (1 PM on weekdays)0 14 * * 1-5- 4 workflows (2 PM on weekdays)0 11 * * 1-5- 4 workflows (11 AM on weekdays)*/4,*/6,*/12) - 7 workflowsFrequency Distribution:
Insight: Schedules are intentionally scattered across different times to avoid resource contention, with a preference for business hours (9 AM - 3 PM UTC).
Safe Outputs Analysis
Safe outputs enable workflows to interact with GitHub (create issues, discussions, comments, PRs).
Safe Output Usage
Safe Output Type Distribution:
add-commentcreate-discussioncreate-issuecreate-pull-requestNote: The analysis detected explicit
add-commentusage. Other safe output types may be configured dynamically through the safeoutputs MCP server (detected in 138 workflows) but not hard-coded in lock files.Workflows Using Safe Outputs
View Interactive Workflows (24 workflows)
PR/Issue Comment Responders:
smoke-codex.lock.yml,smoke-copilot.lock.yml,smoke-claude.lock.yml- Smoke testing with feedbackchangeset.lock.yml- PR changeset analysisgrumpy-reviewer.lock.yml,security-review.lock.yml- Code review agentspr-nitpick-reviewer.lock.yml- Detailed PR reviewsscout.lock.yml,q.lock.yml,cloclo.lock.yml- Universal assistant workflowsIssue Management:
issue-classifier.lock.yml- Automatic issue classificationcraft.lock.yml- Workflow generation from issuesworkflow-generator.lock.yml- Dynamic workflow creationpdf-summary.lock.yml- PDF content summarizationnotion-issue-summary.lock.yml- Notion integrationSpecialized Responders:
mergefest.lock.yml- Merge conflict resolutionplan.lock.yml- Planning assistancebrave.lock.yml- Web search integrationarchie.lock.yml- Architecture documentationpoem-bot.lock.yml- Creative content generationtidy.lock.yml- Code cleanupunbloat-docs.lock.yml- Documentation optimizationmcp-inspector.lock.yml- MCP server analysissmoke-opencode.lock.yml- OpenCode testingStructural Characteristics
Job Complexity
Workflows are structured with multiple jobs for parallel execution and clear separation of concerns.
Job Distribution:
Statistics:
Step Complexity
Steps represent individual actions within jobs, including tool calls, data processing, and output generation.
Statistics:
daily-copilot-token-report.lock.yml)Most Complex Workflows
Top 5 workflows by step count, representing the most comprehensive agentic workflows:
daily-copilot-token-report.lock.ymlcopilot-pr-nlp-analysis.lock.ymlunbloat-docs.lock.ymlpoem-bot.lock.ymldeep-report.lock.ymlSimplest Workflows
Minimal workflows used for testing and specific focused tasks:
chroma-issue-indexer.lock.ymlcodex-github-remote-mcp-test.lock.ymlfirewall.lock.ymltest-workflow.lock.ymlexample-permissions-warning.lock.ymlTypical Lock File Structure
Based on statistical analysis, a typical .lock.yml file has:
schedule+workflow_dispatch(65% of workflows)Tool & MCP Patterns
MCP Server Usage
MCP (Model Context Protocol) servers provide specialized capabilities to agentic workflows.
githubsafeoutputsbraveKey Insight:
Permission Patterns
Note: Permission analysis showed empty results in most workflows, likely due to minimal permission principle with job-level permissions specified instead of workflow-level.
Common Pattern: Workflows use
permissions: {}at the top level and specify granular permissions at the job level for:contents: read- Repository accessissues: read/write- Issue managementpull-requests: read/write- PR managementdiscussions: read/write- Discussion accessInteresting Findings
1. Universal Manual Trigger Capability
88% of workflows support
workflow_dispatch, enabling developers to manually trigger any workflow on-demand. This provides exceptional flexibility for debugging and ad-hoc analysis.2. Scheduled Load Distribution
The 104 scheduled workflows use intentionally scattered cron times (different minutes and hours) to avoid concurrent execution spikes. This demonstrates thoughtful infrastructure design.
3. Conservative Safe Output Usage
Only 16.6% of workflows explicitly use safe outputs, suggesting most workflows are read-only analyzers and reporters. This aligns with a "observe first, act cautiously" philosophy.
4. Standardized Workflow Structure
67% of workflows fall into a consistent 50-70 KB size range with similar job counts (5-7 jobs), indicating strong architectural patterns and reusable templates.
5. Complexity Concentration
The most complex workflows (90+ steps) focus on:
6. No Permission Bloat
Workflows follow the principle of least privilege with minimal top-level permissions and job-specific grants.
7. Event-Driven Versatility
3 workflows (
pr-nitpick-reviewer,q,cloclo) respond to all 6 major GitHub event types, making them universal assistants.8. One Orphan Workflow
test-dispatcher.lock.ymlhas no triggers, suggesting it's designed to be called as a reusable workflow or is under development.Recommendations
Based on this analysis, here are suggestions for workflow optimization and best practices:
1. Expand Safe Output Usage
With safeoutputs MCP configured in 95% of workflows but only 17% using it, there's opportunity to make more workflows interactive by adding comment-based feedback.
2. Standardize Complexity
The 67% of workflows in the 50-70 KB range represent a "sweet spot" for maintainability. Workflows exceeding 90 KB might benefit from modularization.
3. Document Trigger Combinations
With 18 unique trigger patterns, document best practices for choosing trigger combinations based on workflow purpose.
4. Schedule Optimization
Review the current schedule distribution to ensure optimal resource utilization. Consider consolidating workflows with similar schedules into combined reports.
5. Template Library
Given the structural consistency, create workflow templates for common patterns:
6. MCP Server Expansion
Only 2 workflows use the Brave MCP for web search. Explore opportunities to integrate web search into research-focused workflows.
7. Historical Trend Tracking
Establish baseline metrics from this analysis to track:
Methodology
Analysis Tools:
Data Sources:
.lock.ymlfiles from.github/workflows/Metrics Collected:
Scripts Saved:
/tmp/gh-aw/cache-memory/scripts/analyze_lockfiles.py/tmp/gh-aw/cache-memory/scripts/enhanced_analysis.py/tmp/gh-aw/cache-memory/history/Appendix: Example Workflows by Category
Daily Automation Workflows (Schedule + workflow_dispatch)
cli-version-checker.lock.yml- Monitor CLI tool versionsdaily-workflow-updater.lock.yml- Update workflow definitionsdaily-team-status.lock.yml- Team activity reportingdaily-code-metrics.lock.yml- Code quality metricsdaily-news.lock.yml- Curated news summariesdaily-file-diet.lock.yml- Repository cleanupdaily-firewall-report.lock.yml- Security compliancePR/Issue Interactive Workflows
grumpy-reviewer.lock.yml- Critical code reviewspr-nitpick-reviewer.lock.yml- Detailed PR analysissecurity-review.lock.yml- Security-focused reviewschangeset.lock.yml- PR change analysisissue-classifier.lock.yml- Automatic issue labelingcraft.lock.yml- Workflow generation from issuesSmoke Testing Workflows
smoke-claude.lock.yml- Claude Code testing (104 KB, most comprehensive)smoke-copilot.lock.yml- GitHub Copilot CLI testingsmoke-codex.lock.yml- OpenAI Codex testingsmoke-opencode.lock.yml- OpenCode testingsmoke-test-tools.lock.yml- General tool testingsmoke-project.lock.yml- Project functionality testingAnalysis & Reporting Workflows
agent-performance-analyzer.lock.yml- Agent performance metricscopilot-agent-analysis.lock.yml- Copilot behavior analysisdaily-copilot-token-report.lock.yml- Token usage (most complex: 100 steps)copilot-pr-nlp-analysis.lock.yml- NLP analysis of PRsdeep-report.lock.yml- Comprehensive reportingportfolio-analyst.lock.yml- Portfolio analysisstatic-analysis-report.lock.yml- Code quality reportsAnalysis Complete ✅
This report was generated by the Lockfile Statistics Analysis Agent using comprehensive parsing and statistical analysis of all workflow lock files in the repository.
Beta Was this translation helpful? Give feedback.
All reactions