feat(orchestrator): Phase B Units 10 + 11 — synthetic gate, expiry sweeper, shadow-summary, live-flip runbook (depends on #72068) by PeterPlatinum · Pull Request #72086 · openclaw/openclaw

PeterPlatinum · 2026-04-26T09:21:03Z

Summary

Closes the Phase B implementation arc. Three deliverables stacked on #72068:

R30 synthetic-task observability gate — `openclaw orchestrator synthetic-all` runs a 5-fixture deterministic harness end-to-end through routing + store + dispatch + trajectory. Exits non-zero if any fixture diverges from its expected agent / rule / terminal state. The gate is the operator-facing precondition for ever flipping mode away from synthetic.
Expiry sweeper service — `createExpirySweeper` registered via `api.registerService(...)` with the `stop` lifecycle hook on the service object (recon Q-3 settled — `OpenClawPluginService` is the right model for periodic gateway-resident work). Fires every 60 minutes by default; logs swept count.
Shadow-summary CLI verb + live-flip runbook — `openclaw orchestrator shadow-summary [--window 24]` reads the shadow archive, prints by-state counts + mean duration + window span, and exits non-zero if any spawn failure landed inside the window. The README's new "Live-flip runbook" makes the synthetic → shadow → live transition explicit.

Stacked PR — depends on #72068 → #72054 → #72039 → #72029. Land in that order; this PR's diff narrows after each merge.

What landed

Commit	Sub-unit	Files
`5de1c34002`	Units 10 + 11 main	`src/synthetic.ts` (deterministic harness + fixture loader + result formatter), `test/fixtures/synthetic-tasks.json` (5 fixtures: code-1, ops-1, research-1, writing-1, fallback-1), `src/expiry-sweeper.ts` (start/stop/runOnce + crash-tolerant logging), `src/shadow-summary.ts` (windowed shadow stats), three new CLI verbs in `src/cli.ts`, README runbook
follow-up fix	Unit 11 service shape	Aligns the registered service with `OpenClawPluginService.stop?` (object-level, not return-from-start), and hardens one spawn-watch test against TS narrowing weirdness around closure-captured `let`

CLI surface (cumulative across Units 7-11)

Verb	Purpose
`openclaw orchestrator init`	Generate bearer token (Unit 7).
`openclaw orchestrator rotate-token`	Rotate bearer token (Unit 7).
`openclaw orchestrator synthetic `	One synthetic fixture end-to-end.
`openclaw orchestrator synthetic-all`	Full synthetic harness (R30 gate).
`openclaw orchestrator shadow-summary [--window ]`	Shadow archive stats + live-flip gate.

Live-flip procedure (now documented in README)

```

openclaw orchestrator init # bearer token
openclaw orchestrator synthetic-all # data-plane gate
mode = "shadow" in ~/.openclaw/openclaw.json # 24h soak
openclaw orchestrator shadow-summary --window 24 # live-flip gate
mode = "live" # production
```

Rollback at any point is a config edit + restart; in-flight `awaiting_approval` tasks remain operator-actionable from the Approvals tab.

Boundaries respected

`synthetic.ts`, `expiry-sweeper.ts`, `shadow-summary.ts` import only Node built-ins, types from `./types/schema.ts`, and the previously-shipped `store.ts` / `routing.ts` / `dispatch.ts` / `trajectory.ts`. No `src/**` imports.
The expiry sweeper is the first time this extension uses `api.registerService`; the service shape matches `src/plugins/types.ts:1996-1999` with separate top-level `start` and `stop` hooks (not `stop` returned from `start`).

Test plan

`pnpm test extensions/orchestrator` — 172/172 pass (16 files; +30 new tests across synthetic, expiry-sweeper, shadow-summary)
`pnpm tsgo:all` — clean
Boundary contract still passing
`openclaw orchestrator synthetic-all` exits 0 when all fixtures pass; non-zero with structured reasons when any fail
`openclaw orchestrator shadow-summary` exits non-zero when any task in window is in `failed`
CI green
Live-flip dry run on Peter's machine after PR-set lands

Phase B is now feature-complete

This is the last openclaw-side unit in Plan 005. After this stack lands and the MC commits ship:

Synthetic mode produces visible task records with full trajectory in the Pipeline tab
Approvals tab handles approve/reject for synthetic awaiting_approval tasks
Shadow + live modes are gated behind the documented runbook
The expiry sweeper keeps the task archive bounded

🤖 Generated with Claude Code

…t 2)

…lity inference (Unit 3)

… lock CAS (Unit 4)

…through (Unit 5b)

…Unit 6a)

…Unit 6b)

…t 6c)

…(Unit 7a)

…t 7b)

…summary + live-flip runbook (Units 10 + 11)

…rvice stop signature

greptile-apps · 2026-04-26T09:24:05Z

Greptile Summary

This PR closes Phase B of the orchestrator extension by adding the R30 synthetic-task observability gate (synthetic.ts), an expiry sweeper service (expiry-sweeper.ts), and the shadow-summary CLI verb with live-flip runbook (shadow-summary.ts). The implementation is well-structured and the 30 new tests cover the happy paths and failure injection cases thoroughly.

Confidence Score: 4/5

Safe to merge with minor design concerns; no blocking bugs found.

All findings are P2: the production R30 gate reading from test/fixtures/, the non-idempotent init exit code, and the in_progress expiry gap. No P0/P1 issues were identified. The core state-machine logic, atomic IO, and service registration are sound.

extensions/orchestrator/src/synthetic.ts (fixture path), extensions/orchestrator/src/cli.ts (init exit code), extensions/orchestrator/src/store.ts (in_progress expiry gap)

Prompt To Fix All With AI

This is a comment left during a code review.
Path: extensions/orchestrator/src/synthetic.ts
Line: 59-65

Comment:
**Production gate reads from `test/fixtures/`**

`defaultFixturePath()` resolves the R30 fixture data through the `test/fixtures/` directory. The `openclaw orchestrator synthetic-all` command is the documented operator precondition for flipping out of synthetic mode, so `synthetic-tasks.json` is a production asset — not just a dev artifact. Binding it to the test directory makes the path brittle: any future build step, packaging pass, or directory restructuring that omits `test/` will silently break the gate at runtime. Consider moving `synthetic-tasks.json` to `src/fixtures/` (or a sibling `fixtures/` directory) and adjusting the path accordingly.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: extensions/orchestrator/src/cli.ts
Line: 88-94

Comment:
**`init` exits non-zero when a token already exists**

When `init` finds a credential file and `--force` is not set, it prints the advisory and then sets `process.exitCode = 1`. Idiomatic CLI setup verbs (e.g. `git init`) treat an already-initialized state as a no-op success. Any automated script or CI step that runs `openclaw orchestrator init` as a setup guard will fail on every subsequent run after the first, forcing operators to either handle the exit code explicitly or always pass `--force`. Exiting 0 here and reserving the non-zero code for actual write failures would make the command safely composable.

How can I resolve this? If you propose a fix, please make it concise.

---

This is a comment left during a code review.
Path: extensions/orchestrator/src/store.ts
Line: 535-556

Comment:
**`in_progress` tasks are not eligible for expiry**

`STALE_ELIGIBLE` excludes `in_progress`, so `sweepExpired` silently skips any task in that state. In shadow/live mode, if a specialist session crashes without emitting a terminal event, the `spawn-watch` watcher has no timeout and the task stays `in_progress` indefinitely — the sweeper will never reclaim it. The `applyAction` guard also enforces this gap (`expire` throws on `in_progress`). This creates a category of tasks that can accumulate unboundedly and can never be expired or swept. If the intent is to avoid expiring truly active sessions, a documented stale-`in_progress` eviction path (e.g. a separate action type or a TTL-based fallback on the watcher side) would close the gap.

How can I resolve this? If you propose a fix, please make it concise.

_{Reviews (1): Last reviewed commit: "fix(orchestrator): align expiry-sweeper ..." | Re-trigger Greptile}

greptile-apps · 2026-04-26T09:24:09Z

+  // synthetic.ts lives at extensions/orchestrator/src/synthetic.ts. The
+  // canonical fixture file ships under test/fixtures/.
+  const here = dirname(fileURLToPath(import.meta.url));
+  return resolve(here, "..", "test", "fixtures", FIXTURE_FILE);
+}
+
+export function loadSyntheticFixtures(path?: string): SyntheticFixtureFile {


Production gate reads from test/fixtures/

defaultFixturePath() resolves the R30 fixture data through the test/fixtures/ directory. The openclaw orchestrator synthetic-all command is the documented operator precondition for flipping out of synthetic mode, so synthetic-tasks.json is a production asset — not just a dev artifact. Binding it to the test directory makes the path brittle: any future build step, packaging pass, or directory restructuring that omits test/ will silently break the gate at runtime. Consider moving synthetic-tasks.json to src/fixtures/ (or a sibling fixtures/ directory) and adjusting the path accordingly.

Prompt To Fix With AI

This is a comment left during a code review. Path: extensions/orchestrator/src/synthetic.ts Line: 59-65 Comment: **Production gate reads from `test/fixtures/`** `defaultFixturePath()` resolves the R30 fixture data through the `test/fixtures/` directory. The `openclaw orchestrator synthetic-all` command is the documented operator precondition for flipping out of synthetic mode, so `synthetic-tasks.json` is a production asset — not just a dev artifact. Binding it to the test directory makes the path brittle: any future build step, packaging pass, or directory restructuring that omits `test/` will silently break the gate at runtime. Consider moving `synthetic-tasks.json` to `src/fixtures/` (or a sibling `fixtures/` directory) and adjusting the path accordingly. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-04-26T09:24:10Z

+        path,
+        token: generateToken(),
+        ...(deps.now != null ? { now: deps.now } : {}),
+      });
+      out.write(describe(credentials, "created"));
+    });
+


init exits non-zero when a token already exists

When init finds a credential file and --force is not set, it prints the advisory and then sets process.exitCode = 1. Idiomatic CLI setup verbs (e.g. git init) treat an already-initialized state as a no-op success. Any automated script or CI step that runs openclaw orchestrator init as a setup guard will fail on every subsequent run after the first, forcing operators to either handle the exit code explicitly or always pass --force. Exiting 0 here and reserving the non-zero code for actual write failures would make the command safely composable.

Prompt To Fix With AI

This is a comment left during a code review. Path: extensions/orchestrator/src/cli.ts Line: 88-94 Comment: **`init` exits non-zero when a token already exists** When `init` finds a credential file and `--force` is not set, it prints the advisory and then sets `process.exitCode = 1`. Idiomatic CLI setup verbs (e.g. `git init`) treat an already-initialized state as a no-op success. Any automated script or CI step that runs `openclaw orchestrator init` as a setup guard will fail on every subsequent run after the first, forcing operators to either handle the exit code explicitly or always pass `--force`. Exiting 0 here and reserving the non-zero code for actual write failures would make the command safely composable. How can I resolve this? If you propose a fix, please make it concise.

greptile-apps · 2026-04-26T09:24:11Z

+        if (TERMINAL.has(task.state)) {
+          continue;
+        }
+        if (!STALE_ELIGIBLE.has(task.state)) {
+          continue;
+        }
+        if (localNow() <= new Date(task.expiresAt).getTime()) {
+          continue;
+        }
+        try {
+          const expired = transition(task.id, { type: "expire" }, { kind, holderId: "sweeper" });
+          swept.push(expired);
+        } catch (err) {
+          if ((err as StoreError).code === "lock_held") {
+            continue;
+          }
+          throw err;
+        }
+      }
+    }
+    return swept;
+  }


in_progress tasks are not eligible for expiry

STALE_ELIGIBLE excludes in_progress, so sweepExpired silently skips any task in that state. In shadow/live mode, if a specialist session crashes without emitting a terminal event, the spawn-watch watcher has no timeout and the task stays in_progress indefinitely — the sweeper will never reclaim it. The applyAction guard also enforces this gap (expire throws on in_progress). This creates a category of tasks that can accumulate unboundedly and can never be expired or swept. If the intent is to avoid expiring truly active sessions, a documented stale-in_progress eviction path (e.g. a separate action type or a TTL-based fallback on the watcher side) would close the gap.

Prompt To Fix With AI

This is a comment left during a code review. Path: extensions/orchestrator/src/store.ts Line: 535-556 Comment: **`in_progress` tasks are not eligible for expiry** `STALE_ELIGIBLE` excludes `in_progress`, so `sweepExpired` silently skips any task in that state. In shadow/live mode, if a specialist session crashes without emitting a terminal event, the `spawn-watch` watcher has no timeout and the task stays `in_progress` indefinitely — the sweeper will never reclaim it. The `applyAction` guard also enforces this gap (`expire` throws on `in_progress`). This creates a category of tasks that can accumulate unboundedly and can never be expired or swept. If the intent is to avoid expiring truly active sessions, a documented stale-`in_progress` eviction path (e.g. a separate action type or a TTL-based fallback on the watcher side) would close the gap. How can I resolve this? If you propose a fix, please make it concise.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: b3ba1e66c7

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-04-26T09:27:10Z

+        requiredCapabilities: body.requiredCapabilities ?? [],
+        submittedBy: body.submittedBy ?? submittedByDefault,
+        kind: body.kind ?? "synthetic",


Enforce synthetic kind in synthetic-only submit route

POST /orchestrator/tasks is explicitly gated to synthetic mode, but the handler persists caller-controlled body.kind directly. A client can submit kind: "live" or "shadow" while mode === "synthetic", which writes tasks into the wrong namespace and bypasses the intended mode boundary. This can contaminate live/shadow task stores and undermine the synthetic/shadow gating flow; the route should derive kind from mode or reject non-synthetic kinds here.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-04-26T09:27:10Z

+          const reason = body.reason ?? "";
+          if (reason.trim() === "" || reason.length > 1024) {


Type-check reject reason before calling trim

The reject transition path assumes reason is a string and immediately calls reason.trim(). If the client sends a non-string JSON value (for example {"action":"reject","reason":123}), this throws at runtime and the request fails as an internal error instead of returning invalid_reason. Add a string type guard before trim/length validation so invalid payloads are handled as 400s.

Useful? React with 👍 / 👎.

…t idempotency, kind boundary, reason type guard - Move synthetic-tasks.json from test/fixtures/ to src/fixtures/. The fixture is a production asset (the live-flip runbook gates on synthetic-all), so it must ship under the package boundary. - init: drop process.exitCode = 1 when a token already exists. Idempotent re-runs in setup scripts now exit 0; nonzero is reserved for actual write failures. - POST /orchestrator/tasks: force kind='synthetic' since the route is mode-gated. Trusting body.kind would let a client write live/shadow tasks into the synthetic namespace. - POST /tasks/<id>/transition reject: type-guard reason before .trim(). A non-string reason now returns 400 invalid_reason instead of crashing to 500.

… (companion to 06924c4 fixture relocation) The previous commit added src/fixtures/synthetic-tasks.json but the staging step missed deleting the original at test/fixtures/. The runtime resolver already points at src/fixtures/, so the leftover was unreferenced — this just removes the dead copy so the move is complete. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

PeterPlatinum · 2026-04-26T09:50:12Z

Bot review followups landed

Pushed `06924c4985` + `799c312aa5` addressing four of the five bot findings:

Finding	Source	Fix
Production gate reads from `test/fixtures/`	Greptile	Moved `synthetic-tasks.json` → `src/fixtures/`. Resolver updated; new test verifies the relocated path.
`init` exits 1 when token already exists	Greptile	Removed `process.exitCode = 1` so re-runs are idempotent (`exit 0`). Added `init: idempotent re-run (exit 0)` test.
Synthetic-mode submit honors `body.kind`	Codex	Forced `kind: "synthetic"` since the route is already mode-gated. New test asserts `{kind: "live"}` payload still lands in the synthetic namespace.
`reason.trim()` crashes on non-string	Codex	Type-guarded with `typeof reason !== "string"` before `.trim()`. New test asserts `{reason: 123}` returns 400 `invalid_reason` instead of 500.

Deferred: the `in_progress` expiry gap (Greptile P2) is dormant in v0 (synthetic mode never produces real `in_progress` liveness), but matters before the shadow/live cutover. Tracked in #72095.

Local: `pnpm test extensions/orchestrator` → 175/175 ✅

CI is currently red on this branch due to upstream lockfile drift (`extensions/diagnostics-prometheus/package.json` added `@openclaw/plugin-sdk@workspace:*` without a corresponding `pnpm-lock.yaml` refresh in 0f2e7510cb). Once that lands, this branch should rebase green.

clawsweeper · 2026-04-26T10:48:41Z

Closing this as better suited for ClawHub/community plugin work after Codex automated review.

Close as ClawHub/plugin work. Current main does not contain the orchestrator plugin, and the PR adds an optional heavy orchestration layer using plugin-style CLI, HTTP route, and service surfaces that OpenClaw already exposes. The project vision directs optional capabilities and heavy orchestration layers away from core unless there is explicit maintainer product sponsorship.

Best possible solution:

Close this OpenClaw core PR and move the orchestrator work to an external ClawHub/npm plugin that uses the existing plugin CLI, HTTP route, and service APIs. If external implementation exposes a concrete missing SDK seam, open a narrow plugin API design issue or a maintainer-sponsored core proposal instead.

What I checked:

Project scope guardrail: VISION.md says optional capability should usually ship as plugins, plugin discovery/promotion belongs in ClawHub, and the bar for adding optional plugins to core is intentionally high. (VISION.md:52, 6d60b035b4e7)
Heavy orchestration guardrail: VISION.md lists agent-hierarchy frameworks and heavy orchestration layers as things OpenClaw will not merge by default for now. (VISION.md:106, 6d60b035b4e7)
External plugin path exists: Plugin docs state plugins extend OpenClaw with new capabilities and do not need to be added to the OpenClaw repository; they can be published to ClawHub or npm and installed by users. Public docs: docs/plugins/building-plugins.md. (docs/plugins/building-plugins.md:11, 6d60b035b4e7)
Needed plugin APIs already exist: Current plugin API includes registerHttpRoute, registerCli, and registerService, matching the PR's claimed implementation surfaces without showing a missing core SDK seam. (src/plugins/types.ts:2088, 6d60b035b4e7)
Not implemented on current main: Current main has no extensions/orchestrator tree and no orchestrator labeler/runtime command strings such as synthetic-all, shadow-summary, or orchestrator-bearer. (6d60b035b4e7)
PR discussion handled review followups: The PR discussion records useful bot-review fixes in 06924c4 and 799c312, while deferring the remaining in_progress expiry design to orchestrator store: in_progress tasks have no expiry path #72095. (799c312aa59e)

So I’m closing this as a scope-fit item for the plugin/community path rather than keeping it open as an OpenClaw core request.

Codex Review notes: model gpt-5.5, reasoning high; reviewed against 6d60b035b4e7.

Peter van Wyk added 14 commits April 26, 2026 08:58

docs(orchestrator): recon notes for Phase B implementation plan

5e16394

feat(orchestrator): scaffold extension package and labeler

ebd498c

feat(orchestrator): add wire schema types and hash contract test (Uni…

38a648b

…t 2)

feat(orchestrator): add deterministic routing engine and agent-capabi…

4075738

…lity inference (Unit 3)

feat(orchestrator): add file-backed task store with atomic writes and…

aa2b442

… lock CAS (Unit 4)

feat(orchestrator): install Fleet Orchestrator agent template (Unit 5a)

d021a65

feat(orchestrator): dispatch via inbox watch with synthetic-mode pass…

77e3cda

…through (Unit 5b)

feat(orchestrator): add task.* trajectory writer with sidecar JSONL (…

8503e5e

…Unit 6a)

feat(orchestrator): add spawn-watch poller for subagent_done events (…

eb7e749

…Unit 6b)

feat(orchestrator): emit task.* events from dispatch transitions (Uni…

33f3e12

…t 6c)

feat(orchestrator): bearer credentials + init/rotate-token CLI verbs …

27b3070

…(Unit 7a)

feat(orchestrator): HTTP route handlers and gateway registration (Uni…

75ea670

…t 7b)

feat(orchestrator): synthetic-harness gate + expiry sweeper + shadow-…

5de1c34

…summary + live-flip runbook (Units 10 + 11)

fix(orchestrator): align expiry-sweeper service with OpenClawPluginSe…

b3ba1e6

…rvice stop signature

openclaw-barnacle Bot added the size: XL label Apr 26, 2026

greptile-apps Bot reviewed Apr 26, 2026

View reviewed changes

chatgpt-codex-connector Bot reviewed Apr 26, 2026

View reviewed changes

PeterPlatinum mentioned this pull request Apr 26, 2026

orchestrator store: in_progress tasks have no expiry path #72095

Closed

clawsweeper Bot mentioned this pull request Apr 26, 2026

feat(orchestrator): Phase B Unit 5 — Fleet Orchestrator agent template + dispatch (depends on #72029) #72039

Closed

5 tasks

clawsweeper Bot closed this Apr 26, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(orchestrator): Phase B Units 10 + 11 — synthetic gate, expiry sweeper, shadow-summary, live-flip runbook (depends on #72068)#72086

feat(orchestrator): Phase B Units 10 + 11 — synthetic gate, expiry sweeper, shadow-summary, live-flip runbook (depends on #72068)#72086
PeterPlatinum wants to merge 16 commits into
openclaw:mainfrom
PeterPlatinum:feat/orchestrator-unit-10-11

PeterPlatinum commented Apr 26, 2026

Uh oh!

greptile-apps Bot commented Apr 26, 2026

Uh oh!

greptile-apps Bot Apr 26, 2026

Uh oh!

greptile-apps Bot Apr 26, 2026

Uh oh!

greptile-apps Bot Apr 26, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Apr 26, 2026

Uh oh!

chatgpt-codex-connector Bot Apr 26, 2026

Uh oh!

PeterPlatinum commented Apr 26, 2026

Uh oh!

clawsweeper Bot commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

		const reason = body.reason ?? "";
		if (reason.trim() === "" \|\| reason.length > 1024) {

Uh oh!

Conversation

PeterPlatinum commented Apr 26, 2026

Summary

What landed

CLI surface (cumulative across Units 7-11)

Live-flip procedure (now documented in README)

Boundaries respected

Test plan

Phase B is now feature-complete

Uh oh!

greptile-apps Bot commented Apr 26, 2026

Greptile Summary

Confidence Score: 4/5

Uh oh!

greptile-apps Bot Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps Bot Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Apr 26, 2026

Choose a reason for hiding this comment

Uh oh!

PeterPlatinum commented Apr 26, 2026

Bot review followups landed

Uh oh!

clawsweeper Bot commented Apr 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant