Cleanup Nodes

We provide concrete evidence for memory management or clean-up in a 4-layer transformer gelu-4l. Then we examine implications on Direct Logit Attribution (DLA), a rough method to measure the relevance of attention heads and MLP layers w.r.t. a specific task. We conclude DLA is misleading because it does not account for the clean-up.

James Dao, Yeu-Tong Lau, Can Rager, and Jett Janiak did this work as the final capstone project of ARENA in 2023. Alignment Research Engineering Acellerator (ARENA) is a fellowship covering software engineering, natural language processing, reinforcement learning and distributed computing.

Environment Setup

If you don't already have conda/mamba, install it with:

cd ~ && \
wget https://github.com/conda-forge/miniforge/releases/latest/download/Mambaforge-Linux-x86_64.sh && \
bash Mambaforge-Linux-x86_64.sh -b && \
mambaforge/bin/mamba init bash && \
exec bash

Clone the repo and cd into it:

git clone https://github.com/canrager/cleanup_nodes.git && cd cleanup_nodes

Install the conda/mamba environment and activate it:

mamba env create -f environment.yaml && mamba activate cleanup

Name		Name	Last commit message	Last commit date
Latest commit History 121 Commits
assets		assets
figure_recreation		figure_recreation
sanity_checks		sanity_checks
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
environment.yaml		environment.yaml
jamesd_utils.py		jamesd_utils.py
load_data.py		load_data.py
plotting.py		plotting.py
requirements.txt		requirements.txt
setup.py		setup.py
utils.py		utils.py
writer0-2.py		writer0-2.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

assets

assets

figure_recreation

figure_recreation

sanity_checks

sanity_checks

.gitignore

.gitignore

.python-version

.python-version

README.md

README.md

environment.yaml

environment.yaml

jamesd_utils.py

jamesd_utils.py

load_data.py

load_data.py

plotting.py

plotting.py

requirements.txt

requirements.txt

setup.py

setup.py

utils.py

utils.py

writer0-2.py

writer0-2.py

Repository files navigation

Cleanup Nodes

Environment Setup

About

Releases

Packages

Contributors 3

Languages

canrager/cleanup_nodes

Folders and files

Latest commit

History

Repository files navigation

Cleanup Nodes

Environment Setup

About

Resources

Stars

Watchers

Forks

Languages