Public index of mechanistic-interpretability publications submitted via openinterp-mcp.
Each entry references a HuggingFace dataset (artifacts), a Zenodo DOI (citation), and a methodology-check verdict from the Causality Protocol (paper-6).
registry/
├── index.json ← machine-readable index (this is what openinterp.org reads)
├── schema.json ← JSON schema for entries
├── atlas/<year>/<sha>.json ← general atlas entries (probe results, observations)
├── replications/<paper>/<sha>.json ← replications of published papers
├── probebench/<sha>.json ← submissions to ProbeBench leaderboard
├── contributors/<handle>.json ← auto-generated per-contributor index
├── schemas/ ← OpenInterp Eval Standard v0.1 (Six Diagnostics)
│ ├── probe_card.json ← documents a probe + mandatory baselines
│ ├── causal_report.json ← output of Six Diagnostics protocol
│ ├── intervention_trace.json ← record of a steering/patching experiment
│ └── interp_card.json ← per-feature interpretation + autointerp provenance
└── failed-replications/ ← Failed-Replication Registry — claims walked back
├── README.md ← entry template + failure-mode taxonomy
└── <slug>.md ← one entry per walked-back claim
- Researcher runs
openinterp-mcpagainst their Colab session, gets a result - Researcher invokes the
/publishskill (or callsopeninterp_mcp.publish.publish()) - The publish pipeline:
- Creates a HF dataset under
openinterp-community/atlas-<slug> - Optionally creates a Zenodo deposit (free DOI)
- Opens a PR against this repo with a new entry under
atlas/<year>/<sha>.json
- Creates a HF dataset under
- CI (when wired up) runs the Causality Protocol on the submitted artifacts via Claude-Code-as-judge
- If methodology check passes (verdict ∈ {causal, weak-causal, epiphenomenal-* (honest)}), PR auto-merges
Apache-2.0. Entries individually carry their own license declarations (apache-2.0 / mit / cc-by-4.0 / cc0).
For the bootstrap phase (before the publish pipeline is fully wired), entries can be added by hand:
- Fork this repo
- Add a JSON file under the appropriate path matching
schema.json - Run
python -m openinterp_mcp.publish.manifest --validate path/to/entry.jsonlocally - Append a one-line summary to
index.json - Open a PR
See index.json for the live list.
Part of the OpenInterpretability ecosystem.