Outcome Scorecard β 2026-05-26
| Metric |
Value |
Status |
| Acceptance rate |
54.5% |
π΄ <60% |
| Zero-touch rate |
0% |
π΄ 0% |
| Waste rate |
20% |
π‘ 10-25% |
| Median time to resolution |
343 sec (5.7 min) |
β |
| Accepted |
6 / 25 |
β |
| Rejected |
5 |
β |
| Pending |
14 |
β |
| Runs checked |
12 |
β |
π΄ Action Items
-
Chaos PR Bundle Fuzzer β Critical β 50% rejection rate (5 rejected, 5 pending). This workflow's output quality needs immediate review. Recommend auditing the prompt and testing with smaller sample before re-enabling at scale.
-
Stuck pending items β 3 items pending >12 hours:
Recommend setting timeout policies for pending outcomes.
-
Missing URLs β 2 pending items with no URL (Contribution Check, AI Moderator). These may be silently failing. Investigate error handling.
-
Zero-touch rate at 0% β All 6 accepted items required human review/edits. This suggests the workflows are not generating production-ready output. Consider whether these should be rejected instead.
Per-Workflow Breakdown
| Workflow |
Accepted |
Rejected |
Pending |
Acceptance |
Waste Rate |
| Chaos PR Bundle Fuzzer |
0 |
5 |
5 |
0% |
50% |
| Contribution Check |
0 |
0 |
3 |
0% |
0% |
| Smoke CI |
0 |
0 |
4 |
0% |
0% |
| Instructions Janitor |
0 |
0 |
1 |
0% |
0% |
| AI Moderator |
0 |
0 |
1 |
0% |
0% |
| Agent Performance Analyzer |
2 |
0 |
0 |
100% |
0% |
| Claude Code User Documentation Review |
1 |
0 |
0 |
100% |
0% |
| Terminal Stylist |
1 |
0 |
0 |
100% |
0% |
| Workflow Normalizer |
1 |
0 |
0 |
100% |
0% |
| Dev |
1 |
0 |
0 |
100% |
0% |
Trend Analysis
Comparing to 2026-05-25:
| Metric |
2026-05-25 |
2026-05-26 |
Change |
| Acceptance Rate |
100% |
54.5% |
β¬οΈ -45.5pp |
| Zero-touch Rate |
0% |
0% |
β‘οΈ Stable |
| Waste Rate |
0% |
20% |
β¬οΈ +20pp |
| Runs Checked |
14 |
12 |
-2 runs |
| Total Outcomes |
29 |
25 |
-4 outcomes |
β¬οΈ Regressing β Acceptance rate dropped 45.5 percentage points, primarily due to Chaos PR Bundle Fuzzer's 50% rejection rate. This is the single largest factor driving today's degradation.
Rejected Items
All 5 rejections came from Chaos PR Bundle Fuzzer (run #26451165321):
All were auto-closed by the system, suggesting policy violations or test failures.
Reaction Summary & Recommendations
No positive or negative reactions recorded on any outcomes. This may indicate:
- Items not yet reviewed by team
- Reactions not being captured in evaluation
- Low visibility of the outcomes
Recommendations
- Immediate: Pause or reduce Chaos PR Bundle Fuzzer runs pending prompt audit
- Short-term: Add URL validation and error handling for missing-URL outcomes (Contribution Check, AI Moderator)
- Medium-term: Investigate why zero-touch rate is 0% β consider whether accepting-but-edited items should be classified differently
- Ongoing: Set TTL policies for pending outcomes to flag stuck items automatically
π Measured by Outcome Collector Β· haiku45 39K
Outcome Scorecard β 2026-05-26
π΄ Action Items
Chaos PR Bundle Fuzzer β Critical β 50% rejection rate (5 rejected, 5 pending). This workflow's output quality needs immediate review. Recommend auditing the prompt and testing with smaller sample before re-enabling at scale.
Stuck pending items β 3 items pending >12 hours:
Recommend setting timeout policies for pending outcomes.
Missing URLs β 2 pending items with no URL (Contribution Check, AI Moderator). These may be silently failing. Investigate error handling.
Zero-touch rate at 0% β All 6 accepted items required human review/edits. This suggests the workflows are not generating production-ready output. Consider whether these should be rejected instead.
Per-Workflow Breakdown
Trend Analysis
Comparing to 2026-05-25:
β¬οΈ Regressing β Acceptance rate dropped 45.5 percentage points, primarily due to Chaos PR Bundle Fuzzer's 50% rejection rate. This is the single largest factor driving today's degradation.
Rejected Items
All 5 rejections came from Chaos PR Bundle Fuzzer (run #26451165321):
All were auto-closed by the system, suggesting policy violations or test failures.
Reaction Summary & Recommendations
No positive or negative reactions recorded on any outcomes. This may indicate:
Recommendations