Skip to content

ultraswarm v2.4.0 — Portable Host Runner

Choose a tag to compare

@fubak fubak released this 12 Jun 18:12
· 67 commits to main since this release

Portability release: ultraswarm now runs two co-equal ways — as the Claude Code /ultraswarm skill, or as a standalone CLI hosted from Codex, Grok, or any shell (no Claude Code required). Same orchestration core, identical behaviour; the standalone runner just trades the live /workflows UI for portability.

Added

  • Standalone host runner (bin/ultraswarm.mjs + lib/). A host-supplied (or fallback-decomposed) plan JSON runs through dependency waves → implement → adaptive QA → merge → report. Shares a host-agnostic pure core with the skill (router.mjs reused; QA cascade/competition lifted from SKILL.md, proven byte-for-byte by a parity harness). Impl wrappers are plain subprocesses — only the brain roles call an LLM.
    • Flags: --plan-file <json> · --decompose "<task>" (fallback) · --yes · --resume <id> (journaled).
    • Plan contract rejects unknown CLIs, bad tiers, dependency cycles, and unsafe task ids.
    • hosts/codex/AGENTS.md + hosts/grok/ultraswarm.md launchers.
  • claude -p brain adapter — the runner's brain defaults to your local authenticated claude CLI: no ANTHROPIC_API_KEY, no separate API billing, reusing your Claude Code auth. Falls back to the raw Anthropic API when claude isn't on PATH. Override with ULTRASWARM_BRAIN=claude-cli | anthropic-api. Live-smoked against claude 2.1.175.
  • package.json + deps (@anthropic-ai/sdk, ajv); CI runs npm ci; validate.sh check [12] parses bin/+lib/.

Fixed

  • Command-injection hardening (two security reviews): git plumbing on plan-derived values uses execFileSync + argv + --; task ids charset-validated at the boundary.
  • Brain tier→model-id resolution (caught by the final review): QA/judge/lens calls resolve tier labels to real model ids before hitting the brain.
  • README accuracy pass + concrete Codex/Grok/shell run instructions.

Built TDD via subagent-driven development (18 tasks + hardening + 2 review-caught fixes). 83 tests, validate.sh 12/12, proof-of-life verified end-to-end.