Tracing Thought Processes in Large Language Models

Overview

This project investigates the internal reasoning mechanisms of Large Reasoning Models (LRMs), such as DeepSeek-R1, by tracing attention patterns from the final answer back to the intermediate Chain-of-Thought (CoT) steps. By employing both token-level and segment-level attention analysis, we construct hierarchical "Reasoning Trees" that reveal which specific thoughts most heavily influence the model's conclusion. Our analysis highlights a "bookending" effect, where models tend to prioritize early framing and final deductions over intermediate reasoning steps. This framework provides a method to visualize redundancy and localize errors within the model's thought process.

---

Setup Instructions

Follow these steps to set up the environment from scratch.

1. Prerequisites

Ensure you have Python 3.11+ and CUDA-compatible GPUs available (as the code defaults to cuda).

2. Installation

Create a virtual environment and install the required dependencies.

# Create a virtual environment
python -m venv venv
source venv/bin/activate  # On Windows use: venv\Scripts\activate

# Install dependencies
pip install -r requirements.txt

Usage Guide

1. Run a Single Example

To understand how the visualization works on a single prompt (e.g., "What is the color of the sky?"), run example.py. This script performs inference, aggregates attention scores, and generates reasoning trees and heatmaps for one sample.

python example.py

Outputs:

Console: Prints the reasoning tree and top-K attended tokens.
images/: Saves attention heatmaps (token-level and segment-level) for the single sample.

2. Run the Full Dataset Analysis

To perform the analysis on the ARC (AI2 Reasoning Challenge) dataset, run main_reasoning_tree.py. This script automatically downloads the data, iterates through samples, and generates an aggregate visualization of attention patterns across all inputs.

python main_reasoning_tree.py

Dataset: The project uses the MechanisticProbe ProofWriter ARC dataset.

HuggingFace Link: Dataset

Outputs:

samples.txt: A text file containing the parsed questions and answers.
IMAGES/: Directory containing individual heatmaps for every sample processed.
IMAGES/Final_Average_Pooled_Attention_All_Samples.png: A summary heatmap showing how the model attends to different reasoning segments across the entire dataset.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
figures		figures
README.md		README.md
example.py		example.py
main_reasoning_tree.py		main_reasoning_tree.py
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tracing Thought Processes in Large Language Models

Overview

Setup Instructions

1. Prerequisites

2. Installation

Usage Guide

1. Run a Single Example

2. Run the Full Dataset Analysis

About

Uh oh!

Releases

Packages

Contributors 4

Uh oh!

Languages

matindaghyani/Reasoning-Tree

Folders and files

Latest commit

History

Repository files navigation

Tracing Thought Processes in Large Language Models

Overview

Setup Instructions

1. Prerequisites

2. Installation

Usage Guide

1. Run a Single Example

2. Run the Full Dataset Analysis

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Uh oh!

Languages

Packages