Accept core runtime health fallback by rohitg00 · Pull Request #4 · iii-experimental/iii-code

rohitg00 · 2026-05-13T11:37:02Z

Summary

accept the core worker function set when harness::status is unavailable
keep setup/doctor failing when both harness and core runtime probes fail
document that stopped workers are expected until the iii engine is running

Validation

cargo test
cargo clippy -- -D warnings
cargo fmt --check
git diff --check
target/debug/iii-code doctor against live engine
target/debug/iii-code models against live engine

Summary by CodeRabbit

Documentation
- Expanded setup, config example, and diagnostics: explicit local worker/engine/provider requirements, install/setup snippet, probe order and expectations, config.yaml guidance, and tightened doctor output/exit behavior.
New Features
- Added session management commands (status, tree, clone, export, compact, fork, repair, abort) and a --new option for chat sessions; export/compact output options documented.
Bug Fixes
- Health checks now accept core runtime when harness probe fails; probe failures reported precisely and exit nonzero.
Tests
- Added/updated tests for harness fallback, core-runtime acceptance, probe failures, and session command parsing.

coderabbitai · 2026-05-13T11:37:13Z

📝 Walkthrough

Walkthrough

Adds harness-or-core fallback probing for doctor/health_probe, new chat shell session-tree commands and matching CLI args, session payload builders for clone/tree/export/compact, a detailed example config, and README/setup/test updates.

Changes

Runtime probes, session-tree ops, payloads, and docs

Layer / File(s)	Summary
README updates and usage docs `README.md`	Update setup verification (probe order), session slash commands, `--system-prompt` behavior, doctor probe output categories, and config guidance.
Example configuration `config.example.yaml`	Add `workers` example with state/stream/http/shell/security policies, provider defaults, credentials linkage, sandbox resource limits, and runtime params.
Core runtime functions constant `src/app.rs`	Add `CORE_RUNTIME_FUNCTIONS` constant and `core_function_list_json()` test helper to mock `engine::functions::list`.
CLI dispatch and chat session init `src/app.rs`	Extend chat command dispatch and refactor chat startup to derive initial session id/source (cwd resume/new).
Chat shell session-tree commands `src/app.rs`	Wire `/status`,`/tree`,`/clone`,`/export`,`/compact` into chat handler, add session-status/tree/clone/export/compact helpers, and update chat help.
Doctor & health_probe refactor and helpers `src/app.rs`	Refactor `doctor`/`health_probe` to probe harness first then core-stack fallback (`engine::functions::list`), add workspace fs and auth probing helpers, and replace hard-fail probe helper with nonfatal reporters.
Probe and health tests `src/app.rs`	Update/add tests to accept core-stack readiness when harness probe fails, and to fail when both harness and core validation fail; add workspace fs and provider-auth test cases.
CLI argument additions and tests `src/cli.rs`	Add `Tree/Clone/Export/Compact/Status` CLI variants, `ChatArgs::new` flag, associated arg structs, and extend parsing tests (including `--output`).
Session payload builders and system_prompt change `src/payload.rs`	Introduce `SessionCompactPayloadParams`, build_session_clone/tree/export/compact helpers, and make `build_run_payload` omit `system_prompt` when not provided; update unit tests accordingly.

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

rohitg00/iii-code#3: Modifies chat/CLI session-tree handling and slash-command wiring; related session-command additions overlap with this PR.
rohitg00/iii-code#1: Introduced the initial CLI and probing logic that this PR extends (doctor/health probe and payload/cli areas).
rohitg00/iii-code#2: Contains sandbox/setup guidance overlapping with the README and config example changes here.

Poem

🐰 I checked the harness, then peeked at the core,
I hopped through sessions, adding commands galore,
Payloads compact, configs pinned tight,
Doctor reports shine in the soft morning light,
A tiny rabbit cheers the repo tonight.

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 23.33% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Accept core runtime health fallback' accurately summarizes the main change in the PR: implementing fallback behavior to accept core worker functions when harness::status is unavailable, as confirmed by the PR objectives and the significant logic changes in src/app.rs.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch feature/harness-terminal-shell

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (3)

src/payload.rs (2)
421-444: 💤 Low value

Optional: cover the Some("") system_prompt branch.

The current tests verify None omits the field and Some("custom") includes it, but the new .filter(|s| !s.is_empty()) branch (empty string also omitted) is untested. A one-line assertion would lock in that behavior so a future refactor of the filter doesn't silently regress.
♻️ Suggested addition
     #[test]
     fn build_run_payload_preserves_system_prompt_override() {
@@
         assert_eq!(payload["system_prompt"], "custom");
     }
+
+    #[test]
+    fn build_run_payload_omits_empty_system_prompt() {
+        let payload = build_run_payload(&RunPayloadParams {
+            session_id: "s1".into(),
+            messages: vec![],
+            provider: "anthropic".into(),
+            model: "claude-sonnet-4-6".into(),
+            system_prompt: Some(String::new()),
+            approval_required: vec![],
+            image: "python".into(),
+            idle_timeout_secs: 120,
+            max_turns: 3,
+            cwd: "/tmp/project".into(),
+            cwd_hash: "abc".into(),
+        });
+        assert!(payload.get("system_prompt").is_none());
+    }
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/payload.rs` around lines 421 - 444, The test suite lacks coverage for the
new .filter(|s| !s.is_empty()) branch in build_run_payload; add a test (or
extend build_run_payload_preserves_system_prompt_override) that calls
build_run_payload with RunPayloadParams.system_prompt set to Some("".into()) and
assert that payload.get("system_prompt").is_none() to ensure empty-string system
prompts are omitted; reference the build_run_payload function and the
RunPayloadParams struct so the assertion targets the same payload map shape as
the existing tests.
105-122: 💤 Low value

Implementation is correct; verify downstream harness handles missing system_prompt key.

The tests confirm the intended behavior: system_prompt is omitted from the payload when None or empty. Any downstream consumer (harness, router, session persistence) that previously read this field must now treat its absence the same as the old default behavior. This is outside the scope of the current codebase but worth confirming before merge.

Additionally, consider documenting this behavior in the README or API docs—specifically that Some("") is now silently treated as None, allowing users to clear the override and fall back to the canonical prompt.
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/payload.rs` around lines 105 - 122, build_run_payload now omits the
"system_prompt" key when RunPayloadParams.system_prompt is None or Some("")
which requires downstream code to treat a missing key as the same as None/empty;
update any consumers (harness, router, session persistence) that read
payload["system_prompt"] to handle absence by falling back to the canonical
prompt (i.e., treat missing key as None/empty) and add a short note to the
API/README that Some("") is treated as clearing the override so callers can rely
on omission semantics.
src/cli.rs (1)
90-95: 💤 Low value

Add conflicts_with = "session_id" to the --new flag definition.

Both flags can be supplied together (chat --session-id s1 --new) and clap silently accepts it, leaving precedence to the dispatch layer. Using clap's conflicts_with attribute (already used elsewhere in this file) makes the intent explicit and gives users a clear error message instead of silently ignoring one argument.
♻️ Suggested change
     #[arg(long)]
     pub session_id: Option<String>,

-    #[arg(long, help = "Start a fresh session instead of resuming this cwd")]
+    #[arg(
+        long,
+        conflicts_with = "session_id",
+        help = "Start a fresh session instead of resuming this cwd"
+    )]
     pub new: bool,
🤖 Prompt for AI Agents
Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

In `@src/cli.rs` around lines 90 - 95, The --new flag currently lacks a clap
conflict declaration so users can pass both --session-id and --new; update the
#[arg(...)] attribute on the pub new: bool field to include conflicts_with =
"session_id" (i.e., add conflicts_with = "session_id" to the #[arg(...)] on the
new field) so clap will produce a clear error when both session_id and new are
supplied.

🤖 Prompt for all review comments with AI agents

Verify each finding against current code. Fix only still-valid issues, skip the
rest with a brief reason, keep changes minimal, and validate.

Inline comments:
In `@src/app.rs`:
- Around line 763-776: status_session/print_session_status must handle the
state::get envelope the same way as load_last_cwd_session and
extract_session_messages: check if the returned JSON is an object with a "value"
key and, if so, set value = value["value"] before accessing fields; update
print_session_status (or the caller status_session) to defensively peek for and
unwrap the {"value": ...} wrapper so session_id, turn_count, max_turns,
pending_function_calls, last_assistant, and updated_at_ms are read from the
inner object when present.

---

Nitpick comments:
In `@src/cli.rs`:
- Around line 90-95: The --new flag currently lacks a clap conflict declaration
so users can pass both --session-id and --new; update the #[arg(...)] attribute
on the pub new: bool field to include conflicts_with = "session_id" (i.e., add
conflicts_with = "session_id" to the #[arg(...)] on the new field) so clap will
produce a clear error when both session_id and new are supplied.

In `@src/payload.rs`:
- Around line 421-444: The test suite lacks coverage for the new .filter(|s|
!s.is_empty()) branch in build_run_payload; add a test (or extend
build_run_payload_preserves_system_prompt_override) that calls build_run_payload
with RunPayloadParams.system_prompt set to Some("".into()) and assert that
payload.get("system_prompt").is_none() to ensure empty-string system prompts are
omitted; reference the build_run_payload function and the RunPayloadParams
struct so the assertion targets the same payload map shape as the existing
tests.
- Around line 105-122: build_run_payload now omits the "system_prompt" key when
RunPayloadParams.system_prompt is None or Some("") which requires downstream
code to treat a missing key as the same as None/empty; update any consumers
(harness, router, session persistence) that read payload["system_prompt"] to
handle absence by falling back to the canonical prompt (i.e., treat missing key
as None/empty) and add a short note to the API/README that Some("") is treated
as clearing the override so callers can rely on omission semantics.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: afd25e3e-e69b-405a-9b1b-bd32b5e30f31

📥 Commits

Reviewing files that changed from the base of the PR and between 76b068b and c17301a.

📒 Files selected for processing (5)

README.md
config.example.yaml
src/app.rs
src/cli.rs
src/payload.rs

coderabbitai · 2026-05-13T12:18:05Z

+fn status_session<R: CommandRunner, W: Write>(
+    client: &IiiClient<R>,
+    args: StatusArgs,
+    out: &mut W,
+) -> Result<()> {
+    let value = client
+        .trigger(
+            "state::get",
+            build_state_get_payload("agent", &format!("session/{}/turn_state", args.session_id)),
+            5_000,
+        )
+        .context("load durable turn state")?;
+    print_session_status(&value, out)
+}


⚠️ Potential issue | 🟡 Minor | ⚡ Quick win

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Locate state::get handler / contract to confirm response shape. rg -nP -C5 '"state::get"|fn\s+state_get|state::get\b' --type=rust fd -e md -e rst | xargs rg -nP -C3 'state::get' 2>/dev/null

Repository: rohitg00/iii-code

Length of output: 2679

🏁 Script executed:

# Find print_session_status implementation - it's defined somewhere in the file rg -n "fn print_session_status" --type=rust # Find extract_session_messages implementation rg -n "fn extract_session_messages" --type=rust # Look for test fixtures or test data showing state::get response format rg -n "state::get|turn_state|session.*message" --type=rust -B2 -A2 | head -100

Repository: rohitg00/iii-code

Length of output: 4688

🏁 Script executed:

# Search for tests that show actual response structures rg -n "#\[test\]|#\[tokio::test\]" --type=rust -A 20 | grep -A 20 "state::get\|turn_state\|session"

Repository: rohitg00/iii-code

Length of output: 14164

🏁 Script executed:

# View print_session_status implementation sed -n '1326,1380p' src/app.rs # View extract_session_messages implementation sed -n '1404,1425p' src/app.rs

Repository: rohitg00/iii-code

Length of output: 2519

🏁 Script executed:

# Search for tests that exercise status_session or print_session_status rg -n "status_session\|print_session_status\|turn_state" --type=rust -B3 -A5

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Check if there are integration tests or test fixtures showing state::get responses rg -n "session.*turn_state\|turn_state.*json\|status.*test" --type=rust -B2 -A3

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Search for mock responses or test fixtures showing response structure rg -n "\"value\"\s*:" --type=rust -B1 -A1 | head -50 # Look for any documentation about state::get find . -type f \( -name "*.md" -o -name "*.txt" -o -name "*.rs" \) | xargs rg -l "state::get\|turn_state" 2>/dev/null | head -10

Repository: rohitg00/iii-code

Length of output: 434

🏁 Script executed:

# Check if status_session is actually tested or used anywhere rg -n "Status|status.*session" --type=rust src/cli.rs src/app.rs | grep -i "status\|Status"

Repository: rohitg00/iii-code

Length of output: 639

🏁 Script executed:

# Check StatusArgs and command parsing sed -n '304,315p' src/cli.rs # Check if there are any tests for the status command behavior rg -n "#\[test\]" --type=rust -A 30 src/app.rs | grep -A 30 "status"

Repository: rohitg00/iii-code

Length of output: 10320

🏁 Script executed:

# Look for documentation or comments about state::get response format rg -n "state::get\|turn_state" --type=rust -B2 -A2 src/app.rs | head -80

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Look for any explicit test data or documentation showing response structure rg -n "turn_state\|session.*status\|\"messages\"" --type=rust src/app.rs | head -40 # Search for where state::set is called to understand payload patterns rg -n "state::set\|build_state_set" --type=rust -B2 -A2

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Look at the build_state_get_payload to understand contract rg -n "build_state_get_payload" --type=rust src/payload.rs -B2 -A5

Repository: rohitg00/iii-code

Length of output: 740

🏁 Script executed:

# Check if there are any real examples or integration tests showing state::get response find . -type f -name "*.rs" -o -name "*.json" -o -name "*.txt" | xargs grep -l "session.*turn_state\|turn_state.*json" 2>/dev/null # Look for chat session status output or expected format rg -n "session:" --type=rust -B3 -A3 src/app.rs | head -50

Repository: rohitg00/iii-code

Length of output: 1784

🏁 Script executed:

# Search for comments or docs about state get/set behavior rg -n "state::get\|state::set" --type=rust -B5 -A1 src/app.rs | grep -E "//|fn build|trigger" | head -30

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Search for actual test mocking of state::get responses or real response examples rg -n "MockRunner::ok.*session\|MockRunner::ok.*state" --type=rust -A1 | head -60 # Check if there's any documentation in comments about response envelope rg -n "//" --type=rust src/app.rs | grep -i "state\|get\|value\|envelope" | head -20

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Look at how load_last_cwd_session is tested to understand response shape sed -n '2100,2200p' src/app.rs

Repository: rohitg00/iii-code

Length of output: 3829

🏁 Script executed:

# Search for any tests of the status command or turn_state rg -n "turn_state\|Status\(" --type=rust src/app.rs -B2 -A4 # Check if there are integration tests or fixtures find . -type f \( -name "*test*" -o -name "*fixture*" \) | head -20

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Look at the chat session commands that use state::get to see if there's a pattern sed -n '745,800p' src/app.rs

Repository: rohitg00/iii-code

Length of output: 1523

🏁 Script executed:

# Look for any code that sets turn_state to understand what value shape is stored rg -n "turn_state\|session.*turn_state" --type=rust -B3 -A3 # Check if there are any references to the response envelope in comments rg -n "envelope\|wrapped\|{\"value" --type=rust src/app.rs | head -20

Repository: rohitg00/iii-code

Length of output: 190

🏁 Script executed:

# Look at other state::get callers to see if there's a consistent pattern rg -n "\.trigger.*state::get" --type=rust -A5 src/app.rs

Repository: rohitg00/iii-code

Length of output: 306

🏁 Script executed:

# Check the repair_session function which also uses state::get for messages sed -n '900,920p' src/app.rs # Also check load_session_messages which uses state::get sed -n '1379,1402p' src/app.rs

Repository: rohitg00/iii-code

Length of output: 1345

🏁 Script executed:

# Look for any definition or schema documentation find . -name "*.md" -o -name "*.txt" -o -name "*.toml" | xargs grep -l "state::get\|turn_state" 2>/dev/null

Repository: rohitg00/iii-code

Length of output: 73

🏁 Script executed:

# Check README for state::get documentation grep -n "state::get\|turn_state" README.md -B2 -A2 # Also search the entire README cat README.md | grep -i "state\|get\|response\|envelope"

Repository: rohitg00/iii-code

Length of output: 629

🏁 Script executed:

# Let me analyze the code pattern. Look at extract_session_messages more carefully # and compare it to print_session_status sed -n '1404,1420p' src/app.rs

Repository: rohitg00/iii-code

Length of output: 524

🏁 Script executed:

# Check if there are any tests for status_session or print_session_status rg -n "status\|Status" --type=rust src/app.rs | grep -i test # Look for chat command tests that might exercise status sed -n '2100,2250p' src/app.rs | grep -A 20 test

Repository: rohitg00/iii-code

Length of output: 5263

🏁 Script executed:

# Key question: does extract_session_messages also need envelope unwrap? # Let's trace what load_session_messages returns rg -n "extract_session_messages\|load_session_messages" --type=rust -B2 -A2 src/app.rs | head -50

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Final check: look for any chat session status command tests rg -n "chat.*status\|/status" --type=rust src/app.rs -B2 -A5

Repository: rohitg00/iii-code

Length of output: 43

🏁 Script executed:

# Check the chat shell command dispatcher for status handling sed -n '500,600p' src/app.rs | grep -A 20 "status"

Repository: rohitg00/iii-code

Length of output: 756

print_session_status should defensively unwrap potential state::get envelope, consistent with other handlers in this file.

Other consumers of state::get responses defensively peek inside a {"value": ...} wrapper (see load_last_cwd_session at lines 448–452 and extract_session_messages at lines 1404–1415). However, print_session_status calls value.as_object() on the raw response and immediately reads session_id, turn_count, max_turns, pending_function_calls, last_assistant, and updated_at_ms at the top level. If state::get returns the stored turn state inside a value envelope, the function will match the outer object but silently render missing fields as "unknown"—a misleading status output.

Apply the same defensive pattern already used elsewhere in the file:

fn print_session_status<W: Write>(value: &Value, out: &mut W) -> Result<()> { - let Some(object) = value.as_object() else { + let inner = value.get("value").unwrap_or(value); + let Some(object) = inner.as_object() else { return print_json(value, out); };

🤖 Prompt for AI Agents

Verify each finding against current code. Fix only still-valid issues, skip the rest with a brief reason, keep changes minimal, and validate. In `@src/app.rs` around lines 763 - 776, status_session/print_session_status must handle the state::get envelope the same way as load_last_cwd_session and extract_session_messages: check if the returned JSON is an object with a "value" key and, if so, set value = value["value"] before accessing fields; update print_session_status (or the caller status_session) to defensively peek for and unwrap the {"value": ...} wrapper so session_id, turn_count, max_turns, pending_function_calls, last_assistant, and updated_at_ms are read from the inner object when present.

Accept core runtime health fallback

667e495

rohitg00 added 3 commits May 13, 2026 12:43

Allow single provider auth health

8200a2e

Clarify harness health fallback output

76b068b

Use canonical harness prompt and session controls

c17301a

coderabbitai Bot reviewed May 13, 2026

View reviewed changes

rohitg00 merged commit 5c66b3d into main May 13, 2026
1 check passed

This was referenced May 13, 2026

Add coding-full worker profile #5

Merged

Add coding worker profile setup #7

Merged

Improve coding-full parity defaults #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Accept core runtime health fallback#4

Accept core runtime health fallback#4
rohitg00 merged 4 commits into
mainfrom
feature/harness-terminal-shell

rohitg00 commented May 13, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented May 13, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot May 13, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rohitg00 commented May 13, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Validation

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot May 13, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

rohitg00 commented May 13, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented May 13, 2026 •

edited

Loading