feat(nexus): TaskResult, preflight exit codes, ledger event, /why post-run, smoke evidence (WI-NEXUS-011..015)#73
Merged
Conversation
…t-run, smoke evidence (WI-NEXUS-011..015) - WI-NEXUS-011 (REQ-095): Captured live l1-nexus smoke evidence at .specsmith/runs/WI-NEXUS-011/logs.txt. - WI-NEXUS-012 (REQ-091): orchestrator.run_task returns a TaskResult dataclass (equilibrium, confidence, summary, files_changed, test_results); REPL executor consumes it directly. - WI-NEXUS-013 (REQ-094): Nexus REPL emits a [/why] post-run governance block when verbose_governance is on. - WI-NEXUS-014 (REQ-092): specsmith preflight exits 0 for accepted, 2 for needs_clarification, 3 for blocked/rejected; JSON still on stdout for non-zero exits. - WI-NEXUS-015 (REQ-093): Accepted preflight appends a preflight ledger event tagged with REQ-085 plus matched requirement_ids. Suite: 247 passing, 1 skipped (live). Co-Authored-By: Oz <oz-agent@warp.dev>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
feat(nexus): TaskResult, preflight exit codes, ledger event, /why post-run, smoke evidence
Tightens the Nexus ↔ Specsmith contract that landed in PR #72. Five
follow-up work items, all governed by Specsmith and verified by pytest.
Suite: 247 passing, 1 skipped (live l1-nexus integration test).
Work items in this PR
l1-nexussmoke evidence at.specsmith/runs/WI-NEXUS-011/logs.txt. The smoke script ran offline andreturned a structured
ok=falsetransport error; the log includes areproducible note describing how to re-run it against a live container.
orchestrator.run_tasknow returns aTaskResultdataclass (equilibrium,confidence,summary,files_changed,test_results). The Nexus REPL's bounded-retry harnessconsumes it directly instead of synthesizing equilibrium from
bool(summary). Adds a tolerant parser for the existing Nexus outputcontract (Plan/Commands/Files changed/Diff/Test results/Next action).
[/why]post-rungovernance block when
verbose_governanceis on, listing the assignedwork_item_id, matchedrequirement_ids/test_case_ids, post-runconfidence, and harnessequilibrium.specsmith preflightexits0foraccepted,2forneeds_clarification, and3forblocked/rejected. The JSON payload continues to print on stdout forevery exit code so CI pipelines can branch on intent without re-parsing.
specsmith preflightinvocationappends a
preflightledger event tagged withREQ-085plus the matchedrequirement_ids, recording the utterance, assignedwork_item_id, andconfidence_target. Non-accepted decisions never touch the ledger.Verification
py scripts/sync_governance_state.py→ 95 requirements / 95 test cases.py -m pytest -q→ 247 passed, 1 skipped (≈17s; the skip is theNEXUS_LIVE=1-gated integration test)..specsmith/runs/WI-NEXUS-011/logs.txt..specsmith/runs/WI-NEXUS-015/.Notes for reviewers
[/why]block is gated entirely behind the existing/whytoggle; default REPL behavior remains plain English with no governance
identifiers leaking to the user.
partial) is documented as a placeholder for a real verifier signal; the
retry harness already honors whatever value the executor returns.
the CLI from emitting its JSON or returning its exit code.
🤖 Generated with Warp — agent conversation:
link
Plan artifact: Warp Agent Implementation Plan
Co-Authored-By: Oz oz-agent@warp.dev