-
Notifications
You must be signed in to change notification settings - Fork 355
Reduce aw-failure-investigator issue churn by prioritizing closure and reusing parent tracking #26795
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Reduce aw-failure-investigator issue churn by prioritizing closure and reusing parent tracking #26795
Changes from all commits
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Some generated files are not rendered by default. Learn more about how customized files appear on GitHub.
| Original file line number | Diff line number | Diff line change | ||||||
|---|---|---|---|---|---|---|---|---|
| @@ -1,5 +1,5 @@ | ||||||||
| --- | ||||||||
| description: Investigates [aw] failures from the last 6 hours, correlates with open agentic-workflows issues, and opens a parent report with fix sub-issues | ||||||||
| description: Investigates [aw] failures from the last 6 hours, correlates with open agentic-workflows issues, closes fixed issues, and opens focused fix sub-issues when needed | ||||||||
| on: | ||||||||
| schedule: | ||||||||
| - cron: "every 6h" | ||||||||
|
|
@@ -22,10 +22,13 @@ safe-outputs: | |||||||
| expires: 7d | ||||||||
| title-prefix: "[aw-failures] " | ||||||||
| labels: [agentic-workflows, automation, cookie] | ||||||||
| max: 8 | ||||||||
| max: 2 | ||||||||
| group: true | ||||||||
| update-issue: | ||||||||
| target: "*" | ||||||||
| max: 10 | ||||||||
| link-sub-issue: | ||||||||
| max: 20 | ||||||||
| max: 10 | ||||||||
| noop: | ||||||||
| timeout-minutes: 60 | ||||||||
| imports: | ||||||||
|
|
@@ -49,7 +52,7 @@ Investigate agentic workflow failures from the last 6 hours and produce actionab | |||||||
| 1. Find recent failures from agentic workflows in the last 6 hours. | ||||||||
| 2. Correlate findings with currently open `agentic-workflows` issues. | ||||||||
| 3. Perform large-scale failure analysis using logs + audit + audit-diff. | ||||||||
| 4. Create one parent report issue and linked sub-issues proposing concrete fixes. | ||||||||
| 4. Close fixed/stale issues first, then create only the minimum necessary linked fix sub-issues. | ||||||||
|
|
||||||||
| ## Required Investigation Steps | ||||||||
|
|
||||||||
|
|
@@ -91,16 +94,15 @@ Use `agentic-workflows` MCP `audit-diff` to compare: | |||||||
|
|
||||||||
| Identify regressions and deltas (metrics/tooling/firewall/MCP behavior) that support fix recommendations. | ||||||||
|
|
||||||||
| ### 5) Create parent report issue + sub-issues | ||||||||
| ### 5) Close fixed issues first, then add focused sub-issues | ||||||||
|
|
||||||||
| Create a **single parent report issue** with a temporary ID (format `aw_` + 3-8 alphanumeric characters) summarizing: | ||||||||
| - observed failure clusters in last 6h | ||||||||
| - links to analyzed run IDs | ||||||||
| - evidence from logs/audit/audit-diff | ||||||||
| - mapping to existing open issues (duplicate / related / new) | ||||||||
| - prioritized fix plan | ||||||||
| First, identify currently open `agentic-workflows` issues that are now fixed, stale, or no longer actionable based on fresh evidence, and close them using `update-issue`. | ||||||||
|
||||||||
| First, identify currently open `agentic-workflows` issues that are now fixed, stale, or no longer actionable based on fresh evidence, and close them using `update-issue`. | |
| First, identify currently open `agentic-workflows` issues that are now fixed, stale, or no longer actionable based on fresh evidence, and close them using `update-issue`. | |
| When closing an issue with `update-issue`, use the safe-output fields `issue_number` and `status: "closed"`. Do **not** use `state`. Do not update the issue body unless you are intentionally revising it as part of the close action. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
update-issueis intended to close fixed/stale issues, but the frontmatter configuration does not enable status changes (it only setstargetandmax). In compiled workflows, allowing closure typically requires declaringstatus:undersafe-outputs.update-issue, which propagates toallow_status: truein the lock config. Addstatus:(and recompile the lock file) so the workflow can actually close issues as instructed.