This is the repository for the paper "What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes". Check out the blog post here!
Setup the environment by pip install -r requirements.txt
and run the evals.sh
script to reproduce the key results from the paper. It runs all of the scripts in experiments
, which are:
all_curves.py
is for the aesthetic twitter plot of polysemanticity curves for multiple modelssparsity.py
is for the plot on the speed of sparsificationfourth_norm.py
is for the plot on the sparsity from noisecollisions.py
is for the plot on the number of polysemantic neuronscorrelations.py
is for the plot on the correlation between start and end weights