Orthogonal Subspaces, Not Serial Stages: Mechanistic Interpretability of Emotion Processing in Transformers - Replication Repo
affective-computing sae transformer-architecture mechanistic-interpretability transformerlens ai-psychology representation-geometry
-
Updated
Mar 29, 2026 - Python