[repository-quality] 🎯 Repository Quality Improvement Report - Agentic Workflow Runtime Safety & Compliance (2026-05-27) #35222
Closed
Replies: 2 comments
-
|
Smoke test agent was here. Me bonk button. Sparks fly. Checks roar. Warning Firewall blocked 6 domainsThe following domains were blocked by the firewall during workflow execution:
network:
allowed:
- defaults
- "accounts.google.com"
- "android.clients.google.com"
- "clients2.google.com"
- "contentautofill.googleapis.com"
- "safebrowsingohttpgateway.googleapis.com"
- "www.google.com"See Network Configuration for more information.
|
Beta Was this translation helpful? Give feedback.
0 replies
-
|
This discussion was automatically closed because it expired on 2026-05-28T14:07:53.882Z.
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
🎯 Repository Quality Improvement Report - Agentic Workflow Runtime Safety & Compliance
Analysis Date: 2026-05-27
Focus Area: Agentic Workflow Runtime Safety & Compliance
Strategy Type: Custom
Custom Area: Yes — this repository has 236 agentic workflow markdown files with complex runtime configurations (engines, timeouts, A/B experiments, token budgets). No standard category captures these workflow-authoring compliance concerns.
Executive Summary
Analysis of 236 workflow markdown files in
.github/workflows/reveals four actionable compliance gaps affecting production reliability. The most critical: 12 workflows lacktimeout-minutes, including 6 with scheduled triggers (daily or higher frequency) — any of these can run indefinitely if the agent hangs, silently burning GitHub Actions minutes. The second concern ismode: remoteappearing in two non-test production workflows (schema-feature-coverage.mdandgithub-mcp-tools-report.md); per AGENTS.md this mode does not work withGITHUB_TOKENand requires a PAT or GitHub App token, meaning these workflows likely fail their MCP GitHub calls at runtime.A lower-severity but pervasive concern is stale A/B experiments: 6 experiments started ≥ 14 days ago (the
min_samplesthreshold used in most) have no closing deadline in their frontmatter and have not been concluded — they accumulate data indefinitely without a forcing function for analysis. Finally,daily-observability-report.mdsetsmax-effective-tokens: 80000000(80 M tokens) — twice the next-highest value in the repository and scheduled to run daily, with no documented justification for the elevated ceiling.Full Analysis Report
Focus Area: Agentic Workflow Runtime Safety & Compliance
Current State Assessment
Metrics Collected:
.mdfilestimeout-minutestimeout-minutesmode: remotemax-effective-tokensdaily-observability-report.md)Findings
Strengths
timeout-minutes, showing high adoption of the safeguardlist_code_scanning_alertsusages correctly guard withstate: openandseverity: critical,high(seecode-scanning-fixer.md)needs.activation.outputs.text/title/bodyexpressions — full migration tosteps.sanitized.*is completeshared/meta-analysis-base.md,shared/gh.md, etc.) cleanly provide GitHub MCP config to 37+ workflows without duplicationAreas for Improvement
timeout-minutes; agent hangs cost unbounded CI minutesschema-feature-coverage.mdandgithub-mcp-tools-report.mdusemode: remotewhich requires a PAT/App token not available to the standardGITHUB_TOKENrunner environmentmin_samplesthreshold with no deadline/expiry, creating open-ended experimentsdaily-observability-report.mdcarriesmax-effective-tokens: 80000000with no inline justification comment, making future review difficultDetailed Analysis
Missing
timeout-minutes— Scheduled Workflows (Critical Subset)The following 6 scheduled workflows have no
timeout-minutesdeclaration and therefore no upper bound on execution time:constraint-solving-potd.mdcontribution-check.mddaily-astrostylelite-markdown-spellcheck.mddaily-semgrep-scan.mddaily-sentrux-report.mdotlp-data-quality-validator.mdSix additional non-scheduled workflows also lack the field:
ace-editor.md,dependabot-burner.md,dependabot-repair.md,smoke-ci.md(runs on PR + push),test-dispatcher.md,test-project-url-default.md.The
smoke-ci.mdcase is particularly notable — it triggers on everypull_requestopen/sync/reopen event and hasconcurrency: cancel-in-progress: true, but without a timeout a leaked agent process could hold the concurrency slot open.mode: remotein Non-Test Production WorkflowsFour workflows use
mode: remotefor GitHub MCP:codex-github-remote-mcp-test.md— ✅ test-only (workflow_dispatch)github-remote-mcp-auth-test.md— ✅ test-only, engine unsetgithub-mcp-tools-report.md—on: workflow_dispatch— production useschema-feature-coverage.md—Per AGENTS.md: "Never use
mode: remote— it does not work with the GitHub Actions token (GITHUB_TOKEN) and requires a special PAT or GitHub App token."schema-feature-coverage.mdis particularly risky as it runs weekly and attempts to create pull requests — if MCP auth silently fails, the schema coverage check produces no output.Stale A/B Experiments
Experiments with
start_date≥ 14 days before today (2026-05-27) andmin_samples: 14:issue-arborist.mddeep-report.mddaily-issues-report.mddaily-fact.mddaily-news.mddaily-security-red-team.mdNone of these have an
end_dateordeadlinefield. Without a forcing function, experiments accumulate indefinitely.🤖 Tasks for Copilot Agent
NOTE TO PLANNER AGENT: Split the following tasks into individual work items.
Improvement Tasks
Task 1: Add
timeout-minutesto All Scheduled Workflows Missing ItPriority: High
Estimated Effort: Small
Focus Area: Agentic Workflow Runtime Safety
Description: Six scheduled agentic workflows and six additional high-frequency workflows lack
timeout-minutes. Add appropriate per-engine defaults: copilot/claude → 30 min, unset-engine shell-heavy → 20 min,every 4 hoursfrequency → 15 min (contribution-check.md).Acceptance Criteria:
timeout-minutesadded to their frontmattermake recompilepasses with zero errors after changesmake buildpassesCode Region:
.github/workflows/constraint-solving-potd.md,.github/workflows/contribution-check.md,.github/workflows/daily-astrostylelite-markdown-spellcheck.md,.github/workflows/daily-semgrep-scan.md,.github/workflows/daily-sentrux-report.md,.github/workflows/otlp-data-quality-validator.md,.github/workflows/ace-editor.md,.github/workflows/dependabot-burner.md,.github/workflows/dependabot-repair.md,.github/workflows/smoke-ci.md,.github/workflows/test-dispatcher.md,.github/workflows/test-project-url-default.mdTask 2: Fix
mode: remotein Production WorkflowsPriority: Medium
Estimated Effort: Small
Focus Area: Agentic Workflow Runtime Compliance
Description:
schema-feature-coverage.md(weekly scheduled, codex) andgithub-mcp-tools-report.md(claude, workflow_dispatch) usemode: remotefor their GitHub MCP tool, which per AGENTS.md "does not work with the GitHub Actions token (GITHUB_TOKEN) and requires a special PAT or GitHub App token." Replace withmode: gh-proxy(the safe alternative that works with GITHUB_TOKEN) or remove themode:key entirely (default is gh-proxy).Acceptance Criteria:
schema-feature-coverage.mdnorgithub-mcp-tools-report.mdcontainmode: remotegithub:tools section with appropriate toolsetscodex-github-remote-mcp-test.mdandgithub-remote-mcp-auth-test.mdare not modified (they intentionally test remote mode)make recompileandmake buildpassCode Region:
.github/workflows/schema-feature-coverage.md,.github/workflows/github-mcp-tools-report.mdChange
mode: remotetomode: gh-proxy(or remove themode:line entirely, sincegh-proxyis the default).In
.github/workflows/github-mcp-tools-report.md, find the equivalentmode: remoteunder thegithub:tools key and make the same change.Do NOT modify
codex-github-remote-mcp-test.mdorgithub-remote-mcp-auth-test.md— those are intentional test workflows for the remote mode feature.After editing, run
make recompileandmake buildto verify.After editing all 6 files, run
make recompileto verify no parse errors.max-effective-tokens: 40000000to align with other complex daily workflows.After editing, run
make recompileandmake buildto verify.Beta Was this translation helpful? Give feedback.
All reactions