Overview

This repository contains all scripts for re-producing the results of the our paper Lost in Transmission: When and Why LLMs Fail to Reason Globally.

Reference:

@misc{schnabel2025bapo,
      title={Lost in Transmission: When and Why LLMs Fail to Reason Globally}, 
      author={Tobias Schnabel and Kiran Tomlinson and Adith Swaminathan and Jennifer Neville},
      year={2025},
      eprint={2505.08140},
      url={https://arxiv.org/abs/2505.08140}, 
}

How To Run

Install requirements

We recommend using a new environment for the requirements. You can do this using venv or conda.

For conda:

conda env create -f environment.yml
conda activate runbapo

For venv:

python -m venv runbapo
source runbapo/bin/activate  # On Windows use `runbapo\Scripts\activate`
pip install -r requirements.txt

Set up API keys

Set the OPENAI_API_KEY, ANTHROPIC_API_KEY, and GOOGLE_API_KEY environment variables, e.g.,

 export OPENAI_API_KEY=<your_openai_api_key>
 export ANTHROPIC_API_KEY=<your_anthropic_api_key>
 export GOOGLE_API_KEY=<your_google_api_key>

Pre-process the Space Digest dataset

Download the raw Space digest dataset from this link as well as the subset from the ZeroScrolls benchmark benchmark.
Place the files in a new directory called processed_data.
Run the preprocessing script:
```
python preprocess_space_digest.py
```

Run Experiments

python bapo_experiments.py

Code structure:

Each experiment is implemented as a subclass inheriting from the Experiment base class.
generate_data() provides the main functionality for generating the data used in the experiment

Generate Plots

python plot_results.py

Questions and Issues

If you have any questions or issues regarding this code, please open an issue on the GitHub repository. For questions related to the paper, please contact the authors via email.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
exps		exps
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
SECURITY.md		SECURITY.md
bapo_experiments.py		bapo_experiments.py
environment.yml		environment.yml
plot_results.py		plot_results.py
preprocess_space_digest.py		preprocess_space_digest.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Overview

How To Run

Install requirements

Set up API keys

Pre-process the Space Digest dataset

Run Experiments

Generate Plots

Questions and Issues

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Overview

How To Run

Install requirements

Set up API keys

Pre-process the Space Digest dataset

Run Experiments

Generate Plots

Questions and Issues

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages