BioSymphony CryoCore

Cryo-EM maps to structural insight for scientific agents: map/model review, density support, figures, states, and local or cloud compute lanes.

CryoCore helps an agent inspect cryo-EM maps and models, summarize what the density supports, plan reproducible structural figures, compare deposited states, and prepare compute lanes for real cryo-EM tools. It works with the agent stack you already use: Symphony driving Codex workers, Linear tickets driving Claude workers, Claude Code or Codex CLI in your terminal, or your own orchestration setup.

The repo supplies skill instructions, prompt fixtures, JSON Schema contracts, tool-lane records, provider launch templates, and local validators your agents can read directly and act on. The same workflow shape runs on a laptop with public accessions or on RunPod, AWS Batch, SSH/HPC, neocloud VMs, and other providers you already use. The rigor stays in the contracts so researchers and agents can move faster without losing provenance, data boundaries, or claim discipline.

How You Use It

Point your agent at this repo and hand it a cryo-EM goal. The agent reads AGENTS.md, the relevant skill under skills/, and the schemas under modules/schemas/. It can fetch public-accession metadata when the workflow calls for it, inspect map/model inputs, draft figure plans, compare states, prepare provider lanes, and return a review package with methods, provenance, artifacts, caveats, and next steps.

You stay the principal. You set the goal, you open gates that need human authorization (paid GPU time, license acceptance, raw-data access, claim escalation), and you review what your agent produces. The commands throughout this README are what your agent runs on your behalf. You can run them yourself when you want to verify a step or explore the repo directly.

What Your Agent Can Do

The pieces an agent reads and acts on while doing real cryo-EM work:

Map/model review starts from public EMDB/PDB accessions or operator-declared inputs and asks what the density, model, fit metrics, and caveats support.
Figure and state workflows route ChimeraX, Mol*, Coot, PyMOL, Blender, heterogeneity, and comparison work into reproducible figure or review outputs.
Cryo-EM lane modules describe real stages: raw movies, corrected micrographs, particles, maps, models, figures, and state-review artifacts. See modules/lane-modules/raw-to-map.v1.json, map-to-model.v1.json, and figure-dossier.v1.json.
Tool posture docs name which cryo-EM tools fit which lane and under what license terms (RELION, MotionCor3, Warp/M, Topaz, cryoDRGN, ModelAngelo, Coot, Phenix, ChimeraX, and more in references/software-registry.yaml). The agent picks tools from this catalog rather than guessing.
Provider profiles for RunPod, AWS Batch, AWS EC2, SSH/HPC Slurm, neocloud GPU pod (e.g. Lambda), generic cloud GPU VM, and local workstation give the agent a shape for launching those tools on real GPUs, with budget and cleanup gates baked in. Users with custom compute can author their own profile against the same schema. See Compute Backends.
Schemas, ledgers, and validators type the intermediate outputs and check artifacts, hashes, cost records, cleanup proof, and claim boundaries so a follow-on agent or reviewer can pick up the work without re-deriving context.

The review outputs, figure manifests, provider plans, and issue waves under Use Cases show how those pieces come together for real goals.

Core Capabilities

Capability	What it gives your agents
Map/model and density review	Hand an agent EMDB/PDB IDs or operator-declared inputs and get back summaries of maps, models, density support, fit metrics, caveats, and follow-up work.
Figure and state workflows	Prepare reproducible structural figures, renderer routes, comparison axes, and heterogeneity or conformational-state review plans.
Real cryo-EM tool routing	RELION, MotionCor3, Warp/M, Topaz, cryoDRGN, ModelAngelo, Coot, Phenix, ChimeraX, Mol*, PyMOL, and related tools stay mapped to lanes, licenses, and runtime boundaries.
Local or cloud, same shape	RunPod, AWS Batch, SSH/HPC, neocloud, generic cloud VM, or laptop CPU. Provider profiles, stage contracts, launch prep, and budget/cleanup gates stay uniform across them.
Works with the harness you already use	Symphony, Linear with Claude or Codex workers, Claude Code, Codex CLI, or your own orchestration. The skill pack, schemas, and validators are harness-agnostic.
Trust layer for agent work	Artifacts, hashes, validation outputs, cost records, cleanup proof, provenance, and claim boundaries are checked before a result is treated as reliable.

Every tool the agent picks routes through one of three license lanes:

What You Can Do With It

Use case	What CryoCore gives you
Point an agent at a cryo-EM repo	Skills, prompts, docs, schemas, and validation commands that turn loose requests into executable structural review workflows.
Review EMDB/PDB accessions	Input audits, map/model summaries, density-support checks, figures, provenance, caveats, and bounded claims.
Prepare figure or state work	Renderer routes, figure manifests, methods/provenance text, heterogeneity comparison axes, and reproducibility notes.
Prepare cloud or HPC execution lanes	Provider profiles, stage contracts, launch-request prep, budget gates, cleanup requirements, and artifact expectations.
Run agent issue waves	Tracker-ready templates, labels, DAGs, worker outcome blocks, and reference checks.
Review a provider run	Confirms artifacts, hashes, cost records, cleanup proof, and allowed claims before a stage is treated as complete.
Track cryo-EM tool posture	Open/watch/gated registry records with license and image-packaging boundaries.

See Workflow Blueprints for how to choose a path, Goal Orchestration for /goal-style agent setup, and Use Cases for copyable prompts.

Choose Your Path

I am...	Start here	First command
New to CryoCore	Public Quickstart	`make demo-local`
Want a guided walk-through	Tour	`make demo-local`
Pointing an agent at the repo	Agent Quickstart	`make skill-check`
Turning a broad goal into work	Goal Orchestration	`make goal-brief-check`
Choosing a workflow	Workflow Blueprints	`make docs-link-check`
Reusing patterns elsewhere	Adoption Guide	`make docs-link-check`
Preparing cloud resources	Compute Backends	`make provider-check`
Planning Linear issue waves	Tracker Orchestration	`make issue-check`
Checking a provider run	Provider Run Review	`make provider-closeout-check`
Preparing a public switch	Public Release	`make release-check`

Workflow Chooser

The Claim ceiling column uses CryoCore's claim ladder. See Claim Levels for what candidate, processed, validated, and publishable mean.

Starting point	Goal	First command	Side effects	Expected output	Claim ceiling
New checkout	See the repo work	`make demo-local`	public RCSB/mmCIF fetch, writes `.runtime/`	HTML report, figures, manifest, claim boundaries	`processed` demo evidence
EMDB/PDB IDs	Plan or build a map/model review	Map/Model Dossier	public metadata/artifact fetch only when commanded	input audit, summaries, figures, provenance, caveats	`processed` or `candidate`
Cloud/HPC idea	Prepare provider work	`make provider-check`	no provider mutation	provider profile, gates, launch-request plan	`candidate` until artifacts are joined
Linear campaign	Split work for agents	`make issue-check`	no network or provider mutation	issue DAG, dependencies, labels	`candidate`
Fetched run artifacts	Decide if run really succeeded	`make provider-closeout-check`	local fixture check only	blockers, hashes, cost, cleanup, claim level	evidence-dependent
Public switch	Check publishability	`make release-check`	local checks and secret scan	release report and blockers	repo readiness only

Five-Minute Start

These commands set up the toolkit and run the first demo. Your agent will execute them once it is pointed at the repo. You can also run them yourself to confirm the demo works on your machine before handing the keys to an agent.

python3 -m pip install -r requirements-dev.txt
make demo-local

This fetches public RCSB/mmCIF data and writes the output under ignored .runtime/. Raw movies, maps, half-maps, model weights, private data, license files, and gated tools stay outside the demo.

Then inspect:

.runtime/t2r14-open-dossier/artifacts/report.html
.runtime/t2r14-open-dossier/artifacts/claim_ledger.md
.runtime/t2r14-open-dossier/artifacts/dossier_manifest.json

What the first run gives you:

Artifact	Why it matters
`report.html`	A human-readable review page with public inputs, figures, and methods.
`claim_ledger.md`	Claim boundaries and caveats, so an agent cannot turn a summary into unsupported mechanism claims.
`dossier_manifest.json`	Machine-readable inputs, artifacts, provenance, and review state.
`runpod-execution.tar.gz`	Portable artifact bundle shape used later by provider review.

No run yet? Inspect the static sample shape in T2R14 Open Dossier Preview.

Expected success looks like:

{
  "ok": true,
  "run_id": "cryocore-demo-t2r14-open-dossier"
}

Run the local public release gate:

make release-check

Prefer explicit commands?

python3 scripts/cryocore/t2r14_open_dossier.py --out .runtime/t2r14-open-dossier --json
python3 -m json.tool .runtime/t2r14-open-dossier/status.json

Agent Prompt

Paste this into your coding agent from the repo root. This is the canonical prompt; docs/agent-quickstart.md uses the same one.

Use the CryoCore skill pack in this repo. Stay local. Read AGENTS.md,
README.md, docs/goal-orchestration.md, docs/workflows.md, docs/use-cases.md,
and the relevant skill under skills/. Build a useful cryo-EM map/model review,
figure workflow, state comparison, provider plan, or evidence package. Keep
private data, secrets, raw or heavy artifacts, provider logs, model weights,
and license files out of git and public outputs. Run the smallest relevant
validators first, then
`make release-check` when the task is release-readiness. Report exact
artifacts, claim levels, validation results, and residual risks.

More agent patterns are in Agent Quickstart and Agent Task Prompts.

Copying this into another repo? Start with Adoption Guide.

Using CryoCore With Your Agents

CryoCore is built so any agent stack can drive it. A few patterns teams already use:

Pattern	How it runs	Where to start
Symphony with Codex workers	Symphony dispatches autonomous Codex (gpt-5.x) workers against Linear issues; each worker reads the relevant CryoCore skill and reports an outcome block.	templates/symphony-cryocore.WORKFLOW.md, docs/linear-orchestration.md
Linear with Claude workers	Linear holds the campaign DAG and labels; Claude Code or Claude API agents pick up issues, read the skill pack, and produce review-ready artifacts.	templates/linear-issue.md, docs/agent-quickstart.md
Claude Code or Codex CLI in your terminal	One agent reads `AGENTS.md`, the chosen skill, and the relevant validators, then drives a single mission end to end.	Agent Prompt, skills/
Your own orchestration	Every contract is plain JSON Schema or Markdown. Wire CryoCore into the orchestration you already run.	docs/agent-skill-guide.md, modules/schemas/

The same skill pack supports a single-agent session on a laptop, a multi-day campaign with dozens of issues running in parallel, and a cloud or HPC dispatch with provider-neutral launch contracts and artifact proof.

Local or cloud, the contracts stay the same:

Laptop or workstation: public-accession demos, CPU-only checks, claim-boundary drafts.
RunPod, AWS Batch, SSH/HPC, neocloud VMs: launch manifests, stage contracts, fetched-artifact reports, cost and cleanup proof.
Mixed: plan and validate locally, then hand the same campaign to cloud workers when GPU time is ready.

Install Model

Use CryoCore as a source checkout. Clone or copy it, install requirements-dev.txt, and run make targets or python3 scripts/cryocore/*.py commands from the repository root. A pip-installable package is on the roadmap.

Core Workflow

The shape a mission follows from starting goal to useful structural output. Your agent carries each step. You step in at the gates.

In detail, each mission:

Declares public accessions or operator-provided inputs.
Audits inputs and data boundaries before work starts.
Inspects maps, models, density support, states, or figure needs.
Routes tools through open, watch, or runtime-gated lanes.
Tracks stage progress, artifacts, hashes, cost, cleanup, and claim level.
Emits a review output with figures, methods, provenance, caveats, and next steps.

The same shape works at three scales:

local: small public demos and validators only
cloud: operator-gated RunPod, AWS, SSH/HPC, or compatible provider contracts
tracker: Linear-style issue waves with explicit dependencies and review gates

Use Workflow Blueprints to pick the right scale before dispatching work.

CryoCore separates experimental cryo-EM processing from AI design runtimes. The design side can consume CryoCore outputs while RELION, Warp/M, MotionCor, ModelAngelo, ChimeraX, Coot, and validation tooling keep their own images and dependency surfaces apart from RFdiffusion, Boltz, Chai, ProteinMPNN, and screening stacks.

The Trust Layer

CryoCore keeps the evidence chain explicit underneath the scientific workflow so an agent can carry work across days or weeks and hand it to a human reviewer with nothing missing:

the public accession, operator dataset, or derived artifact used as input
the tool lane that was planned, gated, or executed
the stage that actually completed
the artifacts that were produced and hashed
the licenses or use-context approvals required
the claims or next steps supported by the evidence

A stage becomes trustworthy when the artifacts are joined to the declared inputs and the validation outputs, checksums, cost records, cleanup proof, and claim boundaries are all in place.

The shape of one mission, at a glance:

   public accessions               operator data
        |                                |
        +----------------+---------------+
                         |
                         v
   +-------------------------------------------+
   |  input audit and resource mode            |
   +---------------------+---------------------+
                         |
                         v
   +-------------------------------------------+
   |  tool lane: open, watch, runtime-gated    |
   +---------------------+---------------------+
                         |
                         v
   +-------------------------------------------+
   |  stage: prep mode or real mode            |
   +---------------------+---------------------+
                         |
                         v
   +-------------------------------------------+
   |  artifacts + hashes + cost records        |
   +---------------------+---------------------+
                         |
                         v
   +-------------------------------------------+
   |  validation: schemas, contract self-check,|
   |  wwPDB rollup                             |
   +---------------------+---------------------+
                         |
                         v
   +-------------------------------------------+
   |  review output: figures, caveats,         |
   |  claim boundaries, next steps             |
   +-------------------------------------------+

Repo Layout

Full file listing

campaigns/        CryoCore campaign contracts
containers/       Public image posture and runtime separation notes
demos/            Public cryo evidence demos and review readouts
docs/             Durable architecture, split, licensing, and data-policy docs
examples/         Tiny example manifests
modules/          Image, lane, and provider contracts
references/       Machine-readable tool registry
scripts/cryocore/ Validators and local utilities
skills/cryocore/  Repo-local skill instructions
templates/        Tracker issue and operator-gate templates
tests/            Lightweight validator tests

High-ROI Assets

modules/lane-modules/raw-to-map.v1.json, map-to-model.v1.json, and figure-dossier.v1.json: scientific lane shapes for processing, model review, and figures.
scripts/cryocore/t2r14_open_dossier.py, poltheta_map_model_dossier.py, and structure_jury_dossier.py: runnable public-accession review demos.
modules/schemas/: provider-run, workflow-run, claim-ledger, figure-manifest, map-model-fit, artifact-index, cost, cleanup, and accession metadata schemas.
modules/artifact-contracts/structure-dossier.v1.json: evidence maturity ladder and required artifacts.
runpod/stage-contracts/: stage contracts with progress-ledger requirements that close only when each stage is confirmed.
scripts/cryocore/provider_closeout_check.py: confirms artifacts, hashes, cost records, and cleanup proof are all in place before a provider run is treated as complete.
scripts/cryocore/contract_self_check.py: verifies that real provider results are backed by real evidence rather than mocks, fixtures, planned-only entries, or fallbacks.
scripts/cryocore/public_snapshot_check.py: scans a public snapshot for secrets, heavy cryo-EM artifacts, local paths, and private execution markers.
scripts/cryocore/runpod_scope_check.py: scans public bridge manifests, inline source bundles, public service scope, and prep-only gates.
scripts/cryocore/runpod_reference_check.py: confirms public entrypoints are present and resume commands are current.
docs/agent-skill-guide.md and docs/prompt-library.md: agent workflows and reusable prompt patterns.
docs/workflows.md: workflow selector for public accessions, agents, cloud resources, Linear issue waves, and provider review.
skills/cryocore-public-safety/SKILL.md: public-release review for privacy, secrets, provider risk, and claims.
docs/recipes/README.md: copyable workflows for release checks, metadata ledgers, demo runs, provider prep, and provider run review.
docs/validation-command-matrix.md and docs/failure-modes.md: command selection and post-run review of stage outcomes.
references/software-registry.yaml: machine-readable tool posture across open, watch, and runtime-gated cryo-EM tools.

Boundary With Structure Factory

CryoCore owns experimental evidence production and structural review. Structure Factory owns cross-lane orchestration, prediction/design, screening, and campaign synthesis.

The two repos intentionally duplicate a small set of shared posture records. ChimeraX is the clearest example: it belongs in CryoCore for map/model inspection and figure rendering, and it belongs in Structure Factory for design and atlas reports. Duplicating these records lets each repo keep a focused scientific runtime.

See Split Evaluation and Move/Duplicate Map.

Related Projects

Proteus: structural-biology skills for AI coding agents — PyMOL and ChimeraX automation, plus AlphaFold DB, RCSB PDB, UniProt, and Rosetta workflows. Pairs well with CryoCore when a mission needs hands-on molecular visualization or sequence/structure lookups alongside cryo-EM map/model review.

Public Demos

T2R14 Open Dossier: CPU-only public PDB/EMDB metadata and coordinate review.
Pol Theta Map/Model Dossier: public EMDB/PDB/wwPDB validation review shape for a map/model lane.
Structure Jury Dual Dossier: joins two public deposited-structure lanes into one claim-audited review.

Demo launch manifests are public scaffolds for the prep stage. Paid provider execution is operator-initiated, uses current credentials kept outside the repo, and is reviewed against fetched, hashed artifacts.

Quickstart

The fastest path is Public Quickstart. The agent-first path is Agent Quickstart.

Current Toolwatch

See Toolwatch 2026-05-27 for the source-backed shortlist refreshed on 2026-05-27: cryo-EM tools, repos, public data APIs, workflow/provenance helpers, and preprints. The prior Toolwatch 2026-05-15 remains as history. See Workflow Orchestration Provenance and Public Accession APIs for the recommended provenance and metadata-helper direction.

Local Commands

The menu your agent has available. Each command is read-only on local files unless noted in the linked docs. You can run any of them directly to verify what the agent is doing.

python3 scripts/cryocore/preflight.py --repo-root . --json
python3 scripts/cryocore/software_registry_check.py references/software-registry.yaml --json
python3 scripts/cryocore/fetch_public_accession_metadata.py --emdb EMD-43816 --pdb 9ASJ --out .runtime/public-accession-metadata.json
make module-check
make runpod-check
make runpod-scope-check
make issue-check
make contract-self-check
make release-check

All commands above run locally. They validate contracts, query public accession APIs when invoked, and write any output to ignored .runtime/. Provider dispatch, raw-data downloads, gated software installs, and GPU workloads are operator-initiated steps documented separately. See Data Policy.

Run the public release gate:

make release-check

Status

Pre-alpha public release. CryoCore currently supports agent-guided map/model review on public accessions, figure and state workflow planning, provider preflight, contract validation, provider-run review templates and fixtures, Linear-style campaign planning, tool and license posture tracking, and claim-bounded structural evidence packets. The CPU-only T2R14 demo runs end to end on a laptop; paid provider lanes ship as prep-mode contracts that an operator executes outside the public repo. The rigor in the contracts is what lets agents move quickly on workflows that later touch expensive GPU compute, gated scientific tools, and heavy artifacts.

Documentation Map

Tour: fifteen-minute guided walk through the repo, with a paste-into-agent prompt at the end.
Public Quickstart: first commands and demo outputs.
Demos: three runnable public demos indexed by complexity and time.
Mission Catalog: menu of seed missions an agent can take on, sorted from smallest to largest.
Pol Theta Walkthrough: narrative end-to-end mission from broad goal to map/model review.
Agent Quickstart: copy-paste agent prompt and routing.
Workflow Blueprints: choose public-accession, agent, cloud, Linear, or run-review paths.
Use Cases: common workflows and copyable prompts.
Adoption Guide: how to reuse CryoCore patterns elsewhere.
Local Installation: source-checkout install model.
Agent Skill Guide: using this repo as a public skill pack.
Skill Installation: using or copying the skill pack locally.
Recipes: copyable workflow recipes.
Validation Command Matrix: validator selection and command side effects.
Failure Modes: triage guide for stage outcomes, privacy issues, and provider-risk situations.
Prompt Library: prompt patterns for agents and reviewers.
Demo Gallery: demo scope, artifacts, and claim boundaries.
Data Policy: data tiers and git boundaries.
Provider Execution Model: provider launch, evidence, and artifact-review model.
Compute Backends, Provider Readiness, and Tracker Orchestration: cloud-resource and Linear-style campaign workflow.
Claim Levels: claim ladder and downgrade triggers.
Schema Catalog and Module Catalog: contract inventory.
Privacy Threat Model: privacy and release-risk controls.
Troubleshooting: common validation failures.
Public Switch Checklist: local-to-public publishing checklist.
Glossary: public terms and internal orchestration vocabulary.
FAQ and Roadmap: community orientation and next milestones.
Governance and Maintainers: review and release ownership.
Agent Task Prompts: prompt fixtures for agents.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.github		.github
campaigns		campaigns
containers		containers
demos		demos
docs		docs
examples		examples
modules		modules
references		references
runpod		runpod
scripts		scripts
skills		skills
templates		templates
tests		tests
.editorconfig		.editorconfig
.gitattributes		.gitattributes
.gitignore		.gitignore
AGENTS.md		AGENTS.md
BIOSAFETY.md		BIOSAFETY.md
CHANGELOG.md		CHANGELOG.md
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
FAQ.md		FAQ.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE		LICENSE
MAINTAINERS.md		MAINTAINERS.md
Makefile		Makefile
NON_CLAIMS.md		NON_CLAIMS.md
NOTICE.md		NOTICE.md
PUBLIC_RELEASE.md		PUBLIC_RELEASE.md
README.md		README.md
ROADMAP.md		ROADMAP.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md
pyproject.toml		pyproject.toml
requirements-dev.txt		requirements-dev.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BioSymphony CryoCore

How You Use It

What Your Agent Can Do

Core Capabilities

What You Can Do With It

Choose Your Path

Workflow Chooser

Five-Minute Start

Agent Prompt

Using CryoCore With Your Agents

Install Model

Core Workflow

The Trust Layer

Repo Layout

High-ROI Assets

Boundary With Structure Factory

Related Projects

Public Demos

Quickstart

Current Toolwatch

Local Commands

Status

Documentation Map

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

BioSymphony CryoCore

How You Use It

What Your Agent Can Do

Core Capabilities

What You Can Do With It

Choose Your Path

Workflow Chooser

Five-Minute Start

Agent Prompt

Using CryoCore With Your Agents

Install Model

Core Workflow

The Trust Layer

Repo Layout

High-ROI Assets

Boundary With Structure Factory

Related Projects

Public Demos

Quickstart

Current Toolwatch

Local Commands

Status

Documentation Map

About

Topics

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages