fix(harness): accept zero-failure verifier summaries by shaun0927 · Pull Request #1018 · Q00/ouroboros

shaun0927 · 2026-05-14T05:39:21Z

Summary

Narrow follow-up after #1006/#1010. The fat-harness verifier should not reject common successful test summaries solely because they contain explicit zero-failure text.

What changed

Accepts explicit zero-failure summaries such as:
- 0 failed, 3 passed
- 0 failed, 0 errors, 1 passed
- no errors, 3 passed
- no tests failed, 3 passed
Continues rejecting non-zero failure/error output such as:
- 1 failed, 3 passed
- 2 errors, 1 passed
Adds direct regression coverage for the success parser.

Boundary

Does not change the Agent OS roadmap: make ooo run trustworthy with a fat harness execution path #920/Meta SSOT: AgentOS roadmap sequencing (#920–#960) #961 acceptance invariant.
Does not start Design spine: AgentOS evidence-gated delivery via TraceGuard #978 P5.
Keeps legacy self-report fallback untouched.
Complements fix(harness): require current-run proof for atomic file evidence #1010 current-run file-proof hardening by removing the remaining verifier false-negative in test-output parsing.

Validation

uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py -k 'zero_failure_summaries or fat_harness or observe_only or typed_evidence' → 24 passed
uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py tests/unit/orchestrator/test_verifier.py tests/unit/orchestrator/test_runner.py tests/unit/cli/test_run_qa.py → 245 passed, 1 skipped
uv run ruff check src/ouroboros/orchestrator/parallel_executor.py tests/unit/orchestrator/test_parallel_executor.py → passed
uv run mypy src/ouroboros/orchestrator/verifier.py src/ouroboros/orchestrator/parallel_executor.py src/ouroboros/orchestrator/parallel_executor_models.py → passed

Refs #961, #920, #1006, #1010.

Fat-harness verifier output can include explicit zero-failure phrases such as 0 failed, 0 errors, or no errors. Treat those as successful summaries while still rejecting non-zero failures and errors so atomic acceptance avoids avoidable false negatives. Constraint: Q00#1010 closed current-run file proof hardening, but verifier test-output parsing still needs the remaining zero-failure false-negative fix before the observation window is clean. Rejected: Blanket allow any output containing passed | would re-accept 1 failed, 3 passed mixed results. Confidence: high Scope-risk: narrow Directive: Keep verifier success parsing conservative for non-zero failures while allowing explicit zero-failure summaries. Tested: uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py -k 'zero_failure_summaries or fat_harness or observe_only or typed_evidence'; uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py tests/unit/orchestrator/test_verifier.py tests/unit/orchestrator/test_runner.py tests/unit/cli/test_run_qa.py; uv run ruff check src/ouroboros/orchestrator/parallel_executor.py tests/unit/orchestrator/test_parallel_executor.py; uv run mypy src/ouroboros/orchestrator/verifier.py src/ouroboros/orchestrator/parallel_executor.py src/ouroboros/orchestrator/parallel_executor_models.py Not-tested: live ooo run execution. Co-authored-by: OmX <omx@oh-my-codex.dev>

shaun0927 · 2026-05-14T05:39:39Z

@ouroboros-agent please review this narrow follow-up after #1006/#1010. It only fixes verifier false-negatives for explicit zero-failure test summaries while preserving rejection of non-zero failures/errors.

ouroboros-agent

Review — ouroboros-agent[bot]

Verdict: REQUEST_CHANGES

Reviewing commit 289bdca for PR #1018

Review record: 4164f1d3-896d-4141-bee1-1508970a27f6

Blocking Findings

---|-----------|----------|---------|
| 1 | src/ouroboros/orchestrator/parallel_executor.py:288 | BLOCKING | The new failure regex no longer rejects a standalone failed token. The old guard caught cases like FAILED tests/test_x.py::test_y or tests failed unless they were explicitly 0 failed / no tests failed; after this patch, those strings survive zero_failure_pattern but do not match `\b(failure |

Non-blocking Suggestions

None.

Design Notes

The change is appropriately narrow, but the verifier logic is regex-fragile: expanding the zero-failure allowlist also removed an important negative signal. This path needs tests for both summary-style failures and raw pytest FAILED ... lines.

Reviewed by ouroboros-agent[bot] via Codex deep analysis

Restore the negative signal lost in #1018 while preserving explicit zero-failure summary handling. Constraint: #1018 intentionally accepts zero-failure summaries, but the verifier must still reject raw pytest FAILED lines and generic tests-failed output before #978 P5 readiness. Rejected: Reverting #1018 | zero-failure summaries are valid success evidence and only the standalone failed token regression needed correction. Confidence: high Scope-risk: narrow Directive: Keep zero-failure allowlisting paired with explicit failed-token rejection tests whenever changing verifier output parsing. Tested: uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py -k 'zero_failure_summaries or fat_harness or observe_only or typed_evidence'; uv run pytest -q tests/unit/orchestrator/test_parallel_executor.py tests/unit/orchestrator/test_verifier.py tests/unit/orchestrator/test_runner.py tests/unit/cli/test_run_qa.py; uv run ruff check src/ouroboros/orchestrator/parallel_executor.py tests/unit/orchestrator/test_parallel_executor.py; uv run mypy src/ouroboros/orchestrator/verifier.py src/ouroboros/orchestrator/parallel_executor.py src/ouroboros/orchestrator/parallel_executor_models.py Not-tested: live runtime transcript containing provider-specific pytest formatting

ouroboros-agent Bot requested changes May 14, 2026

View reviewed changes

shaun0927 merged commit a695192 into Q00:main May 14, 2026
8 checks passed

shaun0927 mentioned this pull request May 14, 2026

fix(harness): resolve #978 typed evidence blocker #1025

Merged

shaun0927 mentioned this pull request May 22, 2026

fix(orchestrator): credit transcript test commands for tests_passed claims #1166

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(harness): accept zero-failure verifier summaries#1018

fix(harness): accept zero-failure verifier summaries#1018
shaun0927 merged 1 commit into
Q00:mainfrom
shaun0927:fix/verifier-zero-error-summaries

shaun0927 commented May 14, 2026

Uh oh!

shaun0927 commented May 14, 2026

Uh oh!

ouroboros-agent Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

shaun0927 commented May 14, 2026

Summary

What changed

Boundary

Validation

Uh oh!

shaun0927 commented May 14, 2026

Uh oh!

ouroboros-agent Bot left a comment

Choose a reason for hiding this comment

Review — ouroboros-agent[bot]

Blocking Findings

Non-blocking Suggestions

Design Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant