[safe-output-health] Safe Output Health Report - 2026-06-13 #39023

2026-06-13T05:56:55Z

github-actions[bot]
Bot Jun 13, 2026

Executive Summary

Second consecutive CLEAN day for safe outputs. Across the captured early-morning window (11 workflow runs, 04:46Z–05:42Z), 9 safe_outputs jobs executed with 0 hard failures and 0 actuation failures — every real write succeeded. This follows the 2026-06-12 recovery and extends a 2-day clean streak after the 2026-06-11 break (3 hard failures).

The headline positive: the assign_to_agent cluster was exercised and ran clean for the first fully-confirmed time on record — Issue Monster assigned Copilot to three issues using real, resolvable issue numbers, with neither of the two prior failure variants (05-31 guess, 06-10 missing-field) reproducing.

Metric	Value
Runs in window	11 (+ self-monitor)
Safe_outputs jobs executed	9
Safe_outputs jobs failed (hard)	0
Failed messages (actuation)	0
Collection-time rejections	1 (by-design, max=1 noop)
Run-level job success rate	100%
Headline failure cluster	none

⚠️ Window caveat: This is a partial early-morning batch. LintMonster, the smoke-test suite, and all PR-reviewer workflows did not run, so several tracked failure modes were neither exercised nor validated today. Daily totals are a lower bound.

Safe Output Job Statistics

Job Type	Executions	Success Rate
create_issue	2	100%
create_discussion	2	100%
create_pull_request	1	100%
add_comment	5	100%
assign_to_agent	3	100%
add_labels	1	100%
missing_tool	1	100%
noop	3	100%
upload_asset (standalone)	4	100%
Total messages	22	100%

Per-run safe_outputs job detail (9 jobs)

Run	Workflow	Engine	Trigger	Result	Writes
27457637771	Step Name Alignment	claude	schedule	1✓/0✗	noop
27457550710	Smoke CI	copilot	push	1✓/0✗	noop
27457409492	Sergo	claude	schedule	3✓/0✗	create_issue #39016/#39017, create_discussion #39018
27457409699	Deep Research	copilot	schedule	1✓/0✗	missing_tool (handoff)
27457440763	Documentation Noob Tester	copilot	schedule	✓/0✗	create_discussion #39021, upload_asset ×4
27457358263	AI Moderator	codex	issues	1✓/0✗	add_labels #39015 (spam)
27457391975	jsweep	copilot	schedule	1✓/0✗	create_pull_request #39019
27457308504	Issue Monster	copilot/haiku	schedule	6✓/0✗	assign_to_agent ×3, add_comment ×3
27457031940	PR Sous Chef	copilot/gpt-5-mini	schedule	3✓/0✗	noop, add_comment #38911 ×2

Error Clusters

None. No safe_outputs-job hard failures and no failed messages this window. The single non-success disposition was a by-design collection-time rejection (see below), not an error.

Notable Observations

🟢 `assign_to_agent` cluster — exercised + CLEAN (key positive)

Issue Monster (run-27457308504) emitted 3 assign_to_agent messages with explicit, real, resolvable issue_number values (38999/38863/38787 — carried as issue_number, not item_number). The Process Safe Outputs log confirms all three Successfully assigned copilot coding agent, plus the three paired add_comment messages succeeded — Total 6, Successful 6, Failed 0.

This is the first fully-confirmed clean exercise of this cluster (06-05 was outcome-unobserved). Neither prior failure variant reproduced:

05-31 guess/off-by-two (literal predicted numbers → actuation 404): absent — these were pre-existing triaged issues, not create-then-assign-by-guess.
06-10 missing-field (used item_number, rejected at collection): absent — the agent correctly populated issue_number.

🟡 PR Sous Chef misleading "blocked" noop (low-sev, recurrence)

PR Sous Chef (run-27457031940, gpt-5-mini) emitted a noop whose body reads "No GitHub action was completed because write helpers returned permission errors in this environment" — yet the same run successfully posted 2 add_comments to PR #38911 via safeoutputs (Successful 3/Failed 0). It also emitted a second noop that was correctly rejected at collection (max=1).

This is a recurrence of the 2026-05-30 pr_sous_chef_premature_blocked_noop_with_buffered_writes pattern: the agent conflates firewall-blocked direct gh/git calls with safeoutputs being unavailable, so the noop misrepresents the run outcome. Agent-side output-hygiene only — net job = success.

🟢 `add_labels` with triggering context — succeeded again

AI Moderator (run-27457358263) added ['spam','link-spam'] to spam issue #39015 with the triggering issue context present — clean target resolution. Second consecutive day confirming add_labels works with context (after 06-12 #38782); the 06-11 hard-fails were specifically the no-context branch.

⚪ Deep Research missing_tool — clean failure-path handoff

Deep Research (run-27457409699) hit numerous permission-denied errors (agent-side, out of scope) but correctly emitted a missing_tool fallback that the safe_outputs job processed cleanly (Successful 1/Failed 0).

By-design collection-time rejection (not a failure)

PR Sous Chef emitted 2 noop messages; the noop handler is configured max: 1, so the second was dropped at collection (agent_output.errors: ["Line 4: Too many items of type 'noop'. Maximum allowed: 1."]). The accepted noop + 2 add_comments still processed — net job success. This is correct max-enforcement, recorded for completeness.

Out-of-scope agent-side anomalies (2)

Both emitted empty items:[] (no safe-output tool called, no safe_outputs job — not safe-output failures):

AI Moderator run-27457352501 — second moderation activation that determined no action.
Code Simplifier run-27456907583 — recurring agent-side friction (06-04, 06-11, 06-12, and again today).

Standing Gaps (tracked, not exercised today)

These remediations remain unvalidated because the workflows that would exercise them were absent from this partial window:

review_path_unresolved_422 Path-variant fix (pr_review_buffer.cjs:554) — UNVALIDATED for the 16th consecutive audit. No PR-reviewer workflows ran (no submit_pull_request_review, no create_pull_request_review_comment). One-line predicate fix (add || errorMessage.includes("Path could not be resolved")) still outstanding.
update_issue / add_labels no-context hard-fail family — LintMonster (06-11 production offender) and smoke workflows did not run. The proposed lint-monster.md target: "*" fix and the soft-skip-vs-hard-fail unification remain unvalidated.

Recommendations

No new action is required from this audit — the day is clean. The following remain the standing backlog (carried, unchanged):

Land the review_path_unresolved_422 Path-variant one-line predicate fix (pr_review_buffer.cjs:554) + a mirrored unit test. Highest-priority outstanding remediation; the recovery path has still never fired in production.
LintMonster update_issue target fix — add target: "*" to lint-monster.md so scheduled runs can update issues by explicit number; and (system-side) honor an explicit issue_number regardless of the default target: "triggering".
Unify missing-trigger-context handling across update_issue / add_labels / remove_labels / add_comment to soft-skip (⏭) like the review-comment handlers, rather than hard-fail.
(Low priority) Agent-side output hygiene for PR Sous Chef — stop emitting a "permission errors / nothing done" noop when safeoutputs writes actually succeeded; distinguish firewall-blocked direct calls from safeoutputs availability.

Historical Context

Date	Hard failures	Note
2026-06-09	0	8th clean day
2026-06-10	0	9th clean day; assign_to_agent collection-rejection variant noted
2026-06-11	3	streak broken — target/context-resolution family (LintMonster update_issue + smoke add_labels)
2026-06-12	0	recovery
2026-06-13	0	this report — 2-day clean streak; assign_to_agent exercised clean

Trend: Error rate back to stable/zero after the isolated 06-11 break. The 06-11 failures were specifically the missing-trigger-context branch; with-context paths (add_labels, assign_to_agent) have since been positively exercised and confirmed healthy on 06-12 and 06-13. The two genuine remediation gaps (Path-variant 422, no-context target unification) remain open but un-reproduced.

References:

§27457308504 — Issue Monster (assign_to_agent clean exercise)
§27457031940 — PR Sous Chef (misleading noop + successful writes)
§27457409492 — Sergo (create_issue ×2 + create_discussion)

Generated by 🔒 Safe Output Health Monitor · 503.4 AIC · ⌖ 16.1 AIC · ⊞ 10.1K · ◷

expires on Jun 13, 2026, 9:56 PM UTC-08:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[safe-output-health] Safe Output Health Report - 2026-06-13 #39023

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

[safe-output-health] Safe Output Health Report - 2026-06-13 #39023

Uh oh!

github-actions[bot] Bot Jun 13, 2026

Executive Summary

Safe Output Job Statistics

Error Clusters

Notable Observations

🟢 assign_to_agent cluster — exercised + CLEAN (key positive)

🟡 PR Sous Chef misleading "blocked" noop (low-sev, recurrence)

🟢 add_labels with triggering context — succeeded again

⚪ Deep Research missing_tool — clean failure-path handoff

Standing Gaps (tracked, not exercised today)

Recommendations

Historical Context

Replies: 0 comments

github-actions[bot]
Bot Jun 13, 2026

🟢 `assign_to_agent` cluster — exercised + CLEAN (key positive)

🟢 `add_labels` with triggering context — succeeded again