🚲 Thought Tracing

This is the official repository of our 2025 COLM paper:
"Thought Tracing: Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models"

Please cite our work if you found the resources in this repository useful:

@inproceedings{kim2025tracing,
    title={Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models},
    author={Hyunwoo Kim and Melanie Sclar and Tan Zhi-Xuan and Lance Ying and Sydney Levine and Yang Liu and Joshua B. Tenenbaum and Yejin Choi},
    booktitle={COLM},
    year=2025
}

Environment setup

conda env create -f environment.yml; conda activate thought-tracing
pip install flash-attn==2.5.1.post1
python -m ipykernel install --user --name thought-tracing --display-name "thought-tracing"
huggingface-cli login

Adding your own agent

All you need to do is create an agent class with the method interact() or batch_interact().

Running Evaluation

ToMi

cd revised_tomi
python test_tomi.py --model gpt-4o-2024-11-20 --use-tracing --tracing-model gpt-4o-2024-11-20 --print --run-id tracer-first-run --dataset tomi --tracer-type tracer

FANToM

cd fantom
python test_fantom.py --model gpt-4o-2024-08-06 --use-tracing --tracing-model gpt-4o-2024-08-06 --print --dataset fantom --tracer-type multi-tracer --input-is-chat --run-id tracer-first-run

BigToM

cd bigtom
python test_bigtom.py --model Qwen/Qwen2.5-72B-Instruct-Turbo --use-tracing --tracing-model Qwen/Qwen2.5-72B-Instruct-Turbo --dataset bigtom --print True --tracer-type tracer --target-subset agree90 --use-helper-llm --run-id tracer-first-run

MMToM-QA

cd mmtom_qa
python test_mmtom.py --model gpt-4o --use-tracing --tracing-model gpt-4o --dataset mmtom --print --likelihood-estimate prompting --tracer-type tracer --run-id debugging_gpt-4o

💡 Note that all four evaluation scripts have been adapted from the original implementations of the benchmarks. I aimed to keep the changes minimal.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
agents		agents
bigtom		bigtom
data		data
fantom		fantom
mmtom_qa		mmtom_qa
prompt_templates		prompt_templates
revised_tomi		revised_tomi
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
hypothesis.py		hypothesis.py
tracer.py		tracer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🚲 Thought Tracing

Environment setup

Adding your own agent

Running Evaluation

ToMi

FANToM

BigToM

MMToM-QA

About

Uh oh!

Releases

Packages

Languages

License

skywalker023/thought-tracing

Folders and files

Latest commit

History

Repository files navigation

🚲 Thought Tracing

Environment setup

Adding your own agent

Running Evaluation

ToMi

FANToM

BigToM

MMToM-QA

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages