LLMs Process Lists With General Filter Heads

Project Website | Arxiv Preprint

This repository contains code, data, and visualizations for the paper "LLMs Process Lists With General Filter Heads".

How does an LLM perform filtering operation over a list of items? We find that a small set of specialized attention heads, which we call filter heads, are responsible for this across a range of different situations. The query states of these heads encode a compact representation of the filtering criterion (the predicate), which can be transported to a different context to trigger the execution of the same filtering operation on a different list of items, presented in a different format, language, even different tasks.

Checkout filter.baulab.info for more details.

Setup

All code is tested on Ubuntu 20.04 and 24.04 with A100 or A6000 GPUs. We've used python >= 3.11, torch >= 2.7, and transformers >= 4.55. We recommend duplicating our conda environment:

conda env create -f conda_env.yml

Some of the packages in conda_env.yml won't be strictly required for the code to run. We will clean this up in the future.
baukit may not get installed with conda. Please install it separately using pip install git+https://github.com/davidbau/baukit.
You will need to have a env.yml file (similar to the env_demo.yml) in the root directory of the project.

Code

Checkout the demo notebook for a quick overview of the main idea.
You can locate the filter heads using the scripts/locate_selection_heads.py script. Usage:

python -m scripts.locate_selection_heads --model="<model_name>"

Please refer to the script for additional arguments.

The data_save folder contains the items/entities used to generate the samples for different tasks. Please refer to src/selection/data.py for more details.

Citation

@article{sensharma2023filter,
    title={LLMs Process Lists With General Filter Heads}, 
    author={Arnab Sen Sharma and Giordano Rogers and Natalie Shapira and David Bau},
    year={2025},
    eprint={2510.26784},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 327 Commits
.remote_jobs		.remote_jobs
data_save		data_save
notebooks		notebooks
run_jobs		run_jobs
scripts		scripts
src		src
test_suite		test_suite
.gitignore		.gitignore
conda_env.yml		conda_env.yml
demo.ipynb		demo.ipynb
env_demo.yml		env_demo.yml
globals.yml		globals.yml
readme.md		readme.md
run_finetuning.py		run_finetuning.py
run_monitor.py		run_monitor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LLMs Process Lists With General Filter Heads

Project Website | Arxiv Preprint

Setup

Code

Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

arnab-api/filter

Folders and files

Latest commit

History

Repository files navigation

LLMs Process Lists With General Filter Heads

Project Website | Arxiv Preprint

Setup

Code

Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages