Parameter Decomposition

This repo is for running parameter decomposition on neural networks.

VPD paper (April 2026)

Paper: https://www.goodfire.ai/research/interpreting-lm-parameters
Branch: main
Wandb for run in paper: https://wandb.ai/goodfire/spd/runs/s-55ea3f9b
Comparison CLTs/PLTs: https://github.com/bartbussmann/nn_decompositions/tree/vpd_paper

SPD paper (June 2025)

Paper: https://arxiv.org/abs/2506.20790
Branch: spd-paper
Wandb report: https://wandb.ai/goodfire/spd-tms/reports/SPD-paper-report--VmlldzoxMzE3NzU0MQ

App

This project ships a web app for visualising and interpreting decompositions. You can point it at any decomposed run, including ones we've already trained and stored on wandb (e.g. the canonical goodfire/spd/runs/s-55ea3f9b below). At present, viewing a run still requires running the harvest and autointerp post-processing stages yourself — these produce the artifacts the app reads.

make install-app   # Install frontend dependencies (one-time)
make app           # Launch backend + frontend dev servers

See the app's README and CLAUDE.md for details.

Nano Parameter Decomposition

nano_param_decomp/ is a self-contained, single-file implementation of the whole method. It deliberately omits alternative loss/CI/sigmoid types and various logging for brevity.

Installation

From the root of the repository, run one of:

make install-dev  # Install the package, dev requirements, pre-commit hooks
make install      # Install the package only (`pip install -e .`)

Experiments

Run an experiment locally with pd-local <name>, or on SLURM with pd-run --experiments <name> (adds git snapshot + W&B view; also supports --dp N, --cpu, and --sweep --n_agents N). The two main language-model decompositions:

pile_llama_simple_mlp-4L — 4-layer Llama (MLP-only) on the Pile; the VPD paper run goodfire/spd/runs/s-55ea3f9b (config).
ss_llama_simple_mlp-2L — 2-layer Llama (MLP-only) on SimpleStories; smaller and faster (config).

Other registered experiments (TMS, ResidualMLP, induction heads, GPT-2 / TinyStories variants) are listed in param_decomp/registry.py. The lm experiment can decompose any HuggingFace-loadable model whose target modules are nn.Linear, nn.Embedding, or transformers.modeling_utils.Conv1D.

Post-Processing Pipeline

After a decomposition has finished training, post-processing produces the artifacts the app reads: component statistics, autointerp labels, dataset attributions, and graph-context interpretations. Each stage is a separate CLI; pd-postprocess runs them all under one SLURM dependency graph from a single config:

pd-postprocess param_decomp/postprocess/pile.yaml

The individual stages, with links to their docs:

Harvest (pd-harvest) — collect activation examples, correlations, and token statistics for each component.
Autointerp (pd-autointerp) — generate LLM interpretations of components from harvested examples. Requires OPENROUTER_API_KEY.
Dataset attributions (pd-attributions) — compute component-to-component attribution strengths over the training distribution.
Graph interpretation (pd-graph-interp) — context-aware component labels that combine attributions and correlations.
Clustering (pd-clustering) — ensemble clustering of components.

Default batch sizes (256 for harvest and attributions) work for models like pile_llama_simple_mlp-4L; tune via --batch_size / --n_gpus per stage.

Development

Suggested VSCode/Cursor settings live in .vscode/. Copy .vscode/settings-example.json to .vscode/settings.json to use them. We are unlikely to be able to action new features, though issue reports are greatly appreciated!

Useful make targets:

make check     # Run pre-commit on all files (basedpyright, ruff lint, ruff format)
make type      # basedpyright only
make format    # ruff lint + format
make test      # Tests not marked `slow`
make test-all  # All tests

Name		Name	Last commit message	Last commit date
Latest commit History 334 Commits
.github		.github
.vscode		.vscode
nano_param_decomp		nano_param_decomp
papers		papers
param_decomp		param_decomp
scripts		scripts
tests		tests
typings		typings
.env.example		.env.example
.gitignore		.gitignore
.mcp.json		.mcp.json
.pre-commit-config.yaml		.pre-commit-config.yaml
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
conftest.py		conftest.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Parameter Decomposition

App

Nano Parameter Decomposition

Installation

Experiments

Post-Processing Pipeline

Development

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Parameter Decomposition

App

Nano Parameter Decomposition

Installation

Experiments

Post-Processing Pipeline

Development

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages