A Mechanistic Analysis of a Transformer Trained on Symbolic Multi-Step Reasoning Task

This is the official implementation of A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task.

Figure 1: Given an input prompt, the model concatenates edge tokens in a single token position (A), and copies the goal node into the final token position (B). Then, the next step is identified by applying an iterative algorithm that climbs the tree one level per layer (C).

Usage

1. Dependencies

To install dependencies:

conda env update --file environment.yml

2. Training and Evaluation Code

To train a model from scratch or continue the training, use training.py. We provide functions that have been used for aanay in src/utils.py.

3. Pre-trained Model

The model checkpoint we studied in our work is provided in model.pt.

4. Replication of Results

The notebook figures.ipynb replicates all figures we report in our paper.

Citation Information

BibTeX citation:

@misc{brinkmann2024mechanistic,
      title={A Mechanistic Analysis of a Transformer Trained on a Symbolic Multi-Step Reasoning Task}, 
      author={Jannik Brinkmann and Abhay Sheshadri and Victor Levoso and Paul Swoboda and Christian Bartelt},
      year={2024},
      eprint={2402.11917},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
src		src
.gitignore		.gitignore
README.md		README.md
dataset.txt		dataset.txt
environment.yml		environment.yml
figures.ipynb		figures.ipynb
model.pt		model.pt
training.py		training.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Mechanistic Analysis of a Transformer Trained on Symbolic Multi-Step Reasoning Task

Usage

1. Dependencies

2. Training and Evaluation Code

3. Pre-trained Model

4. Replication of Results

Citation Information

About

Releases

Packages

Languages

jannik-brinkmann/backward-chaining-circuits

Folders and files

Latest commit

History

Repository files navigation

A Mechanistic Analysis of a Transformer Trained on Symbolic Multi-Step Reasoning Task

Usage

1. Dependencies

2. Training and Evaluation Code

3. Pre-trained Model

4. Replication of Results

Citation Information

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages