Skip to content

ultraswarm v2.2.0 — Behavioral CI + Machine-Readable Gates

Choose a tag to compare

@fubak fubak released this 11 Jun 12:15
· 70 commits to main since this release

ultraswarm v2.2.0

A small, sharp release: the orchestration logic is now behaviorally tested in CI, and the validator speaks JSON. Both additions were produced or hardened by the swarm itself.

What's new

🧪 Workflow behavior harness (CI check [11])

scripts/workflow-harness.test.mjs — 16 node:test cases that extract the actual Workflow JS from SKILL.md and run it with mocked agent primitives, covering model-tier routing, adaptive QA depths, quorum and critical-refutation rules, tier escalation, exhaustion/tombstones, task immutability, and the dependency-wave guard. The embedded orchestration logic is now behaviorally tested on every push, not just parse-checked — a QA-gate regression breaks CI before it can burn tokens in a live run.

📋 validate.sh --json

Emits per-check results as a JSON array of {check, name, pass, detail} for CI dashboards and tooling; default output and exit codes are unchanged. Built by the swarm (grok, 2 attempts): the routine-tier QA review rejected attempt 1 for unescaped node -e interpolation and newline-unsafe JSON escaping — both real bugs — and attempt 2 fixed them with JSON.stringify-based escaping.

📚 README rewritten for v2.1+ reality

Every claim now traces to something measured or exercised in the live validation: dependency waves, both config override forms (flat + tiered), adaptive QA with the quorum/critical rules, the verified model-tier table with the model-ID-drift warning, measured cost calibration (the unmeasured "40–70% savings" claim is gone), the analyze mode, and a new troubleshooting entry for the hangs-on-bad-model-ID failure mode.

Upgrade

/plugin marketplace update ultraswarm

Then /reload-plugins or a new session. Full details in CHANGELOG.md.