This repository contains all scripts for re-producing the results of the our paper Lost in Transmission: When and Why LLMs Fail to Reason Globally.
Reference:
@misc{schnabel2025bapo,
title={Lost in Transmission: When and Why LLMs Fail to Reason Globally},
author={Tobias Schnabel and Kiran Tomlinson and Adith Swaminathan and Jennifer Neville},
year={2025},
eprint={2505.08140},
url={https://arxiv.org/abs/2505.08140},
}We recommend using a new environment for the requirements. You can do this using venv or conda.
For conda:
conda env create -f environment.yml
conda activate runbapoFor venv:
python -m venv runbapo
source runbapo/bin/activate # On Windows use `runbapo\Scripts\activate`
pip install -r requirements.txtSet the OPENAI_API_KEY, ANTHROPIC_API_KEY, and GOOGLE_API_KEY environment variables, e.g.,
export OPENAI_API_KEY=<your_openai_api_key>
export ANTHROPIC_API_KEY=<your_anthropic_api_key>
export GOOGLE_API_KEY=<your_google_api_key>- Download the raw Space digest dataset from this link as well as the subset from the ZeroScrolls benchmark benchmark.
- Place the files in a new directory called
processed_data. - Run the preprocessing script:
python preprocess_space_digest.py
python bapo_experiments.pyCode structure:
- Each experiment is implemented as a subclass inheriting from the
Experimentbase class. generate_data()provides the main functionality for generating the data used in the experiment
python plot_results.pyIf you have any questions or issues regarding this code, please open an issue on the GitHub repository. For questions related to the paper, please contact the authors via email.
This project is licensed under the MIT License. See the LICENSE file for details.