AttentionLens

Read the paper.

Read the docs here: https://msakarvadia.github.io/AttentionLens/

Interpreting the latent space representations of attention head outputs for LLMs.

To train attention lenses, run the command python train.py.

Pytorch Lightning has been used to support distributed training, so you can also use torch.distributed.run <args> to distribute training across nodes. Better documentation coming soon.

Demos for how to use a lens to view the vocabulary latent space of a specific attention head can be found in the demos/ dir. Again, better docs coming soon. 😄

Installation

Requirements: python >=3.7,<3.11

git clone https://github.com/msakarvadia/AttentionLens.git
cd AttentionLens
conda create --name attnlens python==3.10
conda activate attnlens
pip install -r requirements.txt # or use requirements_cpu.txt if you only have CPU access
pip install .

Documentation

We used mkdocs to generate our documentation. To launch it locally, first run pip install mkdocs in your environment for AttentionLens. Then, run mkdocs serve. It will ask for you to install additional required packages based on the current configuration. Install those with pip until they're all resolved.

Development

git clone https://github.com/msakarvadia/AttentionLens.git
cd AttentionLens
conda create --name attnlens python==3.10
conda activate attnlens
pip install -r requirements.txt # or use requirements_cpu.txt if you only have CPU access
pip install -e . # editable installation

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
attention_lens		attention_lens
demos		demos
docs		docs
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
load_args.py		load_args.py
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
requirements_cpu.txt		requirements_cpu.txt
setup.py		setup.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

attention_lens

attention_lens

demos

demos

docs

docs

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

load_args.py

load_args.py

mkdocs.yml

mkdocs.yml

pyproject.toml

pyproject.toml

requirements.txt

requirements.txt

requirements_cpu.txt

requirements_cpu.txt

setup.py

setup.py

train.py

train.py

Repository files navigation

AttentionLens

Installation

Documentation

Development

About

Releases

Packages

Contributors 5

Languages

License

msakarvadia/AttentionLens

Folders and files

Latest commit

History

Repository files navigation

AttentionLens

Installation

Documentation

Development

About

Resources

License

Stars

Watchers

Forks

Languages