JetExpansions

Interpretability with jet expansions of residual networks and transformers.

Code for the paper Decomposing LLM Computation with Jets (ICLR 2026).

Installation

# Standard install
uv sync

# Development install (includes pytest)
uv sync --group dev

# Examples install (includes jupyter, matplotlib, seaborn)
uv sync --group examples

# All groups
uv sync --all-groups

# Activate the environment, or prefix commands with `uv run` (e.g. `uv run pytest`)
source .venv/bin/activate

Start

Carving a two-block residual network into four explicit input→output paths (this implements Sec 4.2 of the paper):

import torch
import jex

lm = jex.toy_two_layer_rn(d=32)

x0 = lm.residual_stream(0)  # Enc(z)
x1 = lm.layer_gamma(0)      # γ₁(Enc(z))

# Step 1: expand blk2 at {x0, x1} → 4 sub-streams (γ₂ and identity terms per center)
inner = jex.expand_lm(lm, layer=2, centers=[x0, x1], order=1)

# Step 2: expand decoder at inner sub-streams → paths in logit space
# `expand_lm` is composable: expansions of the previous operation can be centers of a new one 
outer = jex.expand_lm(lm, layer=lm.depth + 1, centers=inner.terms, order=1)

# Done: we have functionally "expanded" the model into 4 input-to-output paths
assert len(outer.terms) == 4

# Now we can compute these paths on any input
z  = torch.randint(0, lm.vocab_size, (1, 8))  # (batch, seq_len)
paths, remainder = outer.expansions_and_remainder(z, with_unembedding=True)

Beside this toy expansion, here are example notebooks of some applications:

Jet lenses — iterative and joint jet lenses on GPT-2/GPT-Neo
Jet bigrams — token transition probabilities via embedding and MLP paths

What's inside

Pytorch implementation of jet operator via jvp (Jacobian vector product);
composable jet_expand algorithm. This comes in two versions:
- a generic version for any Tensor -> Tensor callable;
- a specialised version for residual nets/transformers and expansions around block non-linearities, closely aligned to Algorithm 1 from the paper;
loaders and abstractions for some HF models (gpt2, gpt neo, llama, ...); extensible to other models (contributions welcomed!);
iterative and joint jet lenses;
jet bigrams: embedding_decoder and embedding_mlp_decoder paths.
example notebooks for jet lenses and jet bigrams.

The package aims at providing the core algorithms and applications of the paper, favouring clarity and generality over (model) specific optimization. The package does not reproduce all experiments in the paper and nor does it include jet trigrams, since all these were based on an older version of the code that included model-specific optimizations. If you're interested in those, please open an issue.

Citation

@inproceedings{chen2026decomposing,
  title     = {Decomposing {LLM} Computation with Jets},
  author    = {Chen, Yihong and Xu, Xiangxiang and Stenetorp, Pontus and Riedel, Sebastian and Franceschi, Luca},
  booktitle = {The Fourteenth International Conference on Learning Representations},
  year      = {2026},
  url       = {https://openreview.net/pdf?id=u6JLh0BO5h}
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
examples		examples
src/jex		src/jex
tests		tests
.gitignore		.gitignore
README.md		README.md
jet-in-out.png		jet-in-out.png
pyproject.toml		pyproject.toml
toy-expansion.png		toy-expansion.png
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JetExpansions

Installation

Start

What's inside

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

JetExpansions

Installation

Start

What's inside

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages