Captain Claw v0.6.5 Release Notes
Release title: Teams that wait for each other — Vatra rendezvous, specialist tiers, and a tidier deck
Release date: 2026-06-29
A polish-and-plumbing release on top of 0.6.4's Frontier Horizon. The headline is a
wait rendezvous for Vatra: a specialist that genuinely depends on a teammate's
artifact now blocks until it's ready instead of improvising a guess. Around it: the
Library gains dedicated coding and vision tiers (so code and image work route to the
right model), Basna/Vatra/Plan runs get a sticky "what phase is running now" pill plus
wait events in the timeline, Plan mode's live agent cards finally show per-agent token
counts, and a new Agent Folders admin page lets you inspect and clean up fd-data
subfolders. Fully additive and backward compatible with 0.6.4 — nothing changes for existing
runs unless you reach for it.
What's new
Vatra wait — agents block on a teammate's artifact instead of guessing
Agents on a Vatra team often look for a piece a teammate is supposed to produce — a VFS file
or a board post — don't find it yet, and improvise a guess. A new blocking wait
action lets a specialist that truly depends on a specific artifact pause until it's ready.
- Two conditions.
waittakes apath(vfs:<project>/<file>) or aquery(board
keywords). The/agent/waitendpoint long-polls the VFS file or the blackboard every
second. - Bounded, never hangs a run. Capped at 90s (tool timeout raised to 120s), which is
below the dispatch timeout, so a wait can't stall the run. On timeout it returns a board
digest plus a "proceed deliberately" nudge — so the agent does the right something else
rather than erroring out. Sibling owners keep working while one awaits (they're concurrent
coroutines). - Visible in the timeline. A wait emits a
waitstage event when it starts (⏳ … ≤Ns),
when it resolves (✓ got<file>/ board match), and on timeout (⌛ not ready) — so a
run shows exactly who is blocked on what instead of a waiting agent just looking slow. - Steered, not forced. Worker prompts now point agents to
waitwhen genuinely blocked,
and no longer say "don't stall waiting" unconditionally — don't busy-poll, but wait once
when you really depend on something. There's no forceddepends_onordering; the tool is
the whole mechanism. Owner+project file resolution is sandbox-checked (resolve_under), so
the multi-user Flight Deck process serves the right run's files and rejects escapes.
Coding & Vision tiers in the Library
Two new model tiers join the Library so work routes to a model suited for it:
coding(defaultopus-4-8, 65 536 output context) for multi-file diffs and debugging.vision(defaultsonnet-4-6) for image / screenshot / document understanding.
Both are wired through every touchpoint — the archetypes.json tier table, the backend
_VALID_TIERS, Dubina's TIER_ORDER, consciousness's _TIER_RANK, and the frontend
TIER_ORDER. A backfillTierMap makes existing saved tier sets gain the new cards
(inheriting their provider/key) instead of rendering empty. Code-producing archetypes
(code-implementer, debugger, simplifier) are reassigned from balanced to coding,
and a new "Vision & Multimodal" family (visual-extractor, ui-reviewer,
brand-design-reviewer) plus vision routing lines in the project-coordinator and concierge
rosters round it out.
Sticky phase pill + per-agent tokens in Plan mode
Runs emitted plenty of detail (action / usage / narration) but no clear "what phase is
running now" signal — under a long event stream the stage markers just scrolled away.
- Explicit phase events are now emitted at every transition and surfaced stickily in
the Progress header (with a spinner while running), and phase rows are accented in the log:- Vatra: Planning → Spawning → Intro → Main → Review → Verifying slices → Synthesizing →
Verifying deliverable → Learning → Done. - Basna: Routing → Spawning → Generating → Merging → Verifying → Done.
- Plan:
Step x/y: <goal>per step, plus Planning / Synthesizing / Done.
- Vatra: Planning → Spawning → Intro → Main → Review → Verifying slices → Synthesizing →
- Per-agent token counts in Plan mode. The plan-step progress mirror previously dropped
usageevents, so Plan-mode agent cards showed activity but no running token count. Usage
events (prompt / completion / total) are now forwarded into the parent plan log, so the
cards render the per-agent token header like a normal Basna run.
Agent Folders — inspect and clean up fd-data
A new admin-gated Flight Deck page to inspect and clean up agent subfolders under
fd-data.
- What it shows. Each folder with its on-disk size, file counts, last-modified,
and desktop presence — running / on-desktop / orphaned, where orphaned means a
folder whose slug is no longer in the process registry. - Backend (
/fd/agentfs, admin-gated): list folders, list and view/download workspace
files, and delete a folder (refused if the agent is still running). Path-sandboxed to
direct children offd-data. - Frontend:
AgentFoldersPage+agentFsStore, wired into routing and the sidebar.
Light-theme-friendly colors, filters (running / on-desktop / orphaned / name search), and a
rich file viewer that renders HTML (sandboxed iframe) and Markdown with GFM tables.
Upgrade notes
- No schema changes and no new required configuration. Everything above is additive.
- Library tiers. Existing saved tier configs are auto-backfilled with the new
coding/visioncards (they inherit the provider/key of your existing tiers) the next
time the Library loads — review them if you want code/vision work pointed at different
models. - New endpoints:
POST /fd/<run>/agent/wait(Vatra) and the/fd/agentfs/*router
(admin-gated). Thewaitaction is available to Vatra workers automatically. - Frontend bundle rebuilt for the phase pill, Plan token cards, tier UI, and Agent
Folders page.
Backward compatible with 0.6.4.
Full Changelog: v0.6.3...v0.6.5