Skip to content

tmychow/incidental-polysemanticity

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Incidental Polysemanticity

This is the repository for the paper "What Causes Polysemanticity? An Alternative Origin Story of Mixed Selectivity from Incidental Causes". Check out the blog post here!

Setup the environment by pip install -r requirements.txt and run the evals.sh script to reproduce the key results from the paper. It runs all of the scripts in experiments, which are:

  • all_curves.py is for the aesthetic twitter plot of polysemanticity curves for multiple models
  • sparsity.py is for the plot on the speed of sparsification
  • fourth_norm.py is for the plot on the sparsity from noise
  • collisions.py is for the plot on the number of polysemantic neurons
  • correlations.py is for the plot on the correlation between start and end weights

About

Understanding incidental polysemanticity in autoencoders

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published