A deterministic, offline compiler that turns a natural-language description of a bioinformatics analysis into a self-contained, agent-executable RO-Crate package — with a full-lifecycle conversational shell wrapped around the executing package.
The compiler classifies the intake, selects an archetype, builds a task DAG, emits a package, and an execution harness drives an agent (Claude Code, a shell script, anything callable with a package path) against the emitted DAG. The emitted package is an ECAA (Evidence-Carrying Analysis Artifact) — a typed RO-Crate that carries, alongside the analysis itself, the claims it supports, the evidence backing each claim, and the decision record that produced them. Conformance is enforced by an embedded ECAA validator that gates emission on a machine-checkable contract over those subgraphs.
| Component | Crate / dir | Role |
|---|---|---|
| Compiler | crates/{core, cli} |
Classifier → DAG → emitter. Synchronous, no LLM dependency. |
| Conversation shim | crates/conversation |
Closed tool vocabulary wraps the compiler. LLM is a UX shim only. |
| Chat server | crates/server |
Axum HTTP + SSE backend at /api/chat/* and /api/git/*. |
| Execution harness | crates/harness |
Loops an agent subprocess against ready tasks. Local / Mock / AWS / SLURM executors. |
| ECAA validator | crates/{ecaa-conformance, ecaa-types} + docs/ecaa-spec/ |
Emits + validates the ECAA conformance contract. |
| Web UI | ui/ |
React 18 + Vite + TypeScript chat surface. |
Linux x86-64 is the primary supported target. macOS works for dev. Windows requires WSL2.
# 1. System tools
sudo apt-get install -y build-essential pkg-config libssl-dev mold git curl # Debian/Ubuntu
# or: sudo dnf install -y @development-tools openssl-devel mold git curl # Fedora/RHEL
# or: brew install mold openssl@3 pkg-config # macOS
# 2. Rust toolchain (auto-installs the pinned channel from rust-toolchain.toml)
curl --proto '=https' --tlsv1.2 -sSf https://sh.rustup.rs | sh -s -- -y
source "$HOME/.cargo/env"
# 3. Cargo dev tools
cargo install --locked cargo-nextest cargo-hakari
# 4. Node.js 20+ (for the UI + Playwright)
# install per your platform; verify: node --version # ≥ 20
# 5. Build everything
make build # Rust workspace (debug)
make install # binaries to ~/.cargo/bin
(cd ui && npm install)make doctor prints toolchain readiness; make help lists targets.
Two terminals:
# terminal A — chat server on :3000
make dev-server
# terminal B — Vite dev UI on :5173 (proxies /api/* to :3000)
make dev-uiThe chat surface boots in offline mode without an API key (the UI renders but assistant turns are mocked). For LLM-mediated chat:
export ECAA_ANTHROPIC_API_KEY=<your key>
make dev-serverSmoke-test the compiler against a bundled scenario:
ecaa-workflow intake \
--input testdata/scenarios/01-bulk-rnaseq-ibd/request.md \
--output /tmp/ibd-package
ecaa-workflow dag --package /tmp/ibd-packagemake test # cargo test --workspace
make test-ui # Vitest + axe a11y
make check # test + tsc --noEmit
make e2e-playwright # mocked Playwright tier- Compiler is synchronous.
tokiois allowed inserver,conversation, andcli(forserveonly). Never incoreorharness. Harness usesureq(sync). - Deterministic output. Emitted packages are byte-reproducible. Use
BTreeMap, notHashMap. Avoid timestamps and random IDs outsideuuid_short(). - LLM as UX shim. Closed tool vocabulary (
Tool::COUNTasserted at compile time). High-impact actions are gated by deterministic server state, not LLM inference. - Confirmation discipline.
emit_packagereturnsPreconditionFailureunlesssession.user_confirmed == true. The button click is a server-side action the LLM observes only viaget_session_state. - ECAA conformance. Every emitted package carries the eight ECAA subgraph sidecars (claims, evidence, decisions, equivalence, ...) which the validator checks against the JSON Schemas in
docs/ecaa-spec/subgraph-schemas/.
config/ is the source of truth for modalities, archetypes, atoms, compute profiles, gene panels, plot affordances, and downstream-policy contracts. config/archetypes/ and config/stage-atoms/ carry their own READMEs.
User guide: USERS.md. Contributor guide: CONTRIBUTING.md. ECAA spec: docs/ecaa-spec/.
Apache-2.0 — see LICENSE.