interp-lab v1.0.0
interp-lab v1.0.0
Stable release candidate for interp-lab: criterion-driven feature discovery, SAE training, causal intervention testing, attribution/path graph workflows, cross-model validation, and the local Studio workflow for browser-based use.
Highlights
- Stable package metadata and public API/schema contract for agent-friendly integrations.
- Full real-model demo sweep command with manifest input preflight, command execution evidence, artifact hashing, and release-gate integration.
- Verified real-model demo suite covering DistilGPT-2, tiny-GPT-2 SAE path patching, and Gemma 4 tool-call workflows on Modal.
- Gemma 4 Modal wrappers now generate local HTML reports and use a broader checked-in tool-call training dataset.
- Browser Studio, report viewing, graph exports, intervention workflows, and release docs are included in the stable release gate.
Verification
interp-lab demo-sweep --run --allow-external --out reports/real-model-demo-sweep.json --strict: passed for all 3 demos.interp-lab release-check --strict --out reports/release-check.json: READY, 13 pass, 0 blockers.- Local
python -m pytest: 183 passed. - Local
python -m build: builtinterp_lab-1.0.0wheel and sdist. - Local
python -m twine check dist\*: passed. - GitHub Actions CI for commit
a9eb6a1: passed on Ubuntu, macOS, Windows, Python 3.10, 3.11, and 3.12.
Publishing this draft release will trigger the PyPI trusted-publishing workflow for interp-lab==1.0.0.