lmexp

About

Simple starter code for experiments on open-source LLMs. Built for my SPAR project participants, but anyone is welcome to use it.

Setup

# optional: create a virtual environment
python3 -m venv venv
source venv/bin/activate 
# run from the root of the repo, this will install everything you need
pip install -e .

To download Llama models from huggingface and/or use Claude API, add a .env file in the root of the repo with your API keys (see .env.example).

See models/implementations/gpt2small.py for an example of how to use this class. The idea is that we can write a single implementation of a technique, and then apply it to any model we want. Note that this is very similar to the TransformerLens paradigm but pared down a lot to just provide the functionality we're likely to use. Feel free to use TransformerLens if you want more features.

`models`

Model implementations. Currently has:

Gemma 2
Llama 3.1
Qwen 1.5
GPT2 (useful for testing locally)

`notebooks`

Jupyter notebooks demonstrating basic use-cases.

To do

Integrate with Gemma 2 SAEs / SAE feature steering
Port over all the experiments / plotting code from CAA repo
More contrast pair datasets

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
lmexp		lmexp
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lmexp

About

Setup

Contents

`datasets`

`finetuning`

`generic`

`models`

`notebooks`

To do

About

Releases

Packages

Languages

nrimsky/lmexp

Folders and files

Latest commit

History

Repository files navigation

lmexp

About

Setup

Contents

datasets

finetuning

generic

models

notebooks

To do

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

`datasets`

`finetuning`

`generic`

`models`

`notebooks`

Packages