feat(dx): real-workerd DO test, README + docs/concepts, 3 examples by drewstone · Pull Request #30 · tangle-network/agent-runtime

drewstone · 2026-05-22T22:13:28Z

Summary

Three days of substrate changes (#28 model-resolution, #29 durable-run supervisor + DO) shipped with no docs and no real-workerd test. This catches the surface up — without over-scoping into a substrate lift.

Test it for real

@cloudflare/vitest-pool-workers + wrangler.workers.toml + a separate vitest.workers.config.ts — runs SessionSupervisorDO under real workerd with real DurableObjectState, real ReadableStream, real Response. The fake-state Node suite covers the supervisor logic exhaustively; this proves the same host works against the actual Cloudflare runtime.
Pinned to @cloudflare/vitest-pool-workers ^0.8 — v0.16+ requires vitest 4; pinned to the latest vitest-3-compatible pool.
The default pnpm test is unaffected: vitest.config.ts excludes tests/workers/**, which only pnpm test:workers runs.

Docs

README rewrite — the entry-point table now covers every current primitive: durable turn, supervisor + DO, model resolution, defineAgent, durable chat-turn engine, analyst loop, platform clients. New sections for the durable supervisor, DO host wiring (wrangler.toml block), model resolution, defineAgent.
docs/concepts.md — real mental-model doc: the five layers, the three durability levels (runAgentTask / runDurableTurn / runSupervisedTurn), the SandboxReconnectAdapter contract, model resolution, reading order for new consumers. Replaces the dead src/index.ts JSDoc link.

New examples (3)

All offline-runnable; each in its own dir with a short README.

examples/model-resolution/ — resolveChatModel (precedence), validateChatModelId (fail-closed admission + allowlist short-circuit), withConfiguredModels.
examples/durable-supervisor/ — the cross-worker resume keystone: w1 drains 2 of 5 events, lease lapses, w2 resumes from cursor → caller sees the complete sequence exactly once.
examples/agent-into-reviewer/ — pipe one runtime's stream into a reviewer agent (the "two-runtime" pattern). Reviewer is just an AgentAdapter; no separate framework.

Out of scope (deliberate — would be its own focused PR)

runAnalystEval lift (the ~1,200 LOC byte-identical analyst-loop.ts duplication across 4 agents). Real, but it's a substrate primitive + multi-repo refactor — should not muddy a docs/test PR.
parseAgentStream (the 4-5 hand-rolled NDJSON decoders).
runSimulatedConversation / retire AgentDriver. Different repo.

Test plan

pnpm typecheck — 0 errors
pnpm test — 251 passed (17 files)
pnpm test:workers — 2 passed under real workerd
pnpm exec biome check src tests examples — clean (1 pre-existing warning)
pnpm build — green
All 3 new examples runnable end-to-end via pnpm tsx examples/<dir>/<name>.ts

…ples Three days of substrate changes (#28 model-resolution, #29 durable-run supervisor + DO) shipped with no docs and no real-workerd test. This catches the surface up. - Real-workerd integration test for SessionSupervisorDO via @cloudflare/vitest-pool-workers — runs the DO under actual workerd with real DurableObjectState + real ReadableStream + real Response. Pinned to a vitest-3-compatible pool version; separate vitest.workers.config.ts so the main Node suite is unaffected. - README rewrite: entry-point table covers every current primitive — durable turn, supervisor + DO, model resolution, defineAgent, durable chat-turn engine, analyst loop, platform clients. - docs/concepts.md: real mental-model doc — the five layers, the three durability levels, the reconnect-adapter contract, model resolution, reading order for new consumers. Replaces the dead src/index.ts JSDoc link. - examples/model-resolution — resolveChatModel + validateChatModelId (fail-closed) + withConfiguredModels. Runs offline. - examples/durable-supervisor — cross-worker resume keystone: w1 drains 2 of 5, lease lapses, w2 resumes from cursor, full sequence exactly once. Runs offline. - examples/agent-into-reviewer — pipe one runtime's stream into a reviewer agent (the "two-runtime" pattern). Runs offline. Verified: typecheck 0, Node suite 251, workerd suite 2 (real DO), biome clean, build green. All three new examples runnable end-to-end.

tangletools

Verified. Adds @cloudflare/vitest-pool-workers (pinned to v0.8 for vitest-3 compat) + wrangler.workers.toml + a separate vitest.workers.config.ts so SessionSupervisorDO is exercised under real workerd (real DurableObjectState + ReadableStream + Response). README rewrite covers every current primitive in the entry-point table; docs/concepts.md is a real mental-model doc replacing the dead JSDoc link. Three new offline examples — model-resolution, durable-supervisor (cross-worker resume keystone), agent-into-reviewer (two-runtime pipe pattern) — all runnable. typecheck 0, Node suite 251 green, workerd suite 2 green, biome clean (1 pre-existing warning), build green. Substrate lifts (runAnalystEval, parseAgentStream) deliberately deferred to their own focused PRs.

tangletools approved these changes May 22, 2026

View reviewed changes

drewstone merged commit e7f1707 into main May 22, 2026
1 check passed

drewstone deleted the feat/dx-docs-examples-workerd-test branch May 22, 2026 22:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(dx): real-workerd DO test, README + docs/concepts, 3 examples#30

feat(dx): real-workerd DO test, README + docs/concepts, 3 examples#30
drewstone merged 1 commit into
mainfrom
feat/dx-docs-examples-workerd-test

drewstone commented May 22, 2026

Uh oh!

tangletools left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

drewstone commented May 22, 2026

Summary

Test it for real

Docs

New examples (3)

Out of scope (deliberate — would be its own focused PR)

Test plan

Uh oh!

tangletools left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants