Hi-ToM Dataset

Contains ToMh data consisting of story-question pairs and the corresponding answers. The names of subfolder branches have the following meanings:

Tell / No_Tell: whether or not the stories contain communications among agents.
MC / CoT: the prompting style. MC corresponds to Vanilla Prompting (VP) in the paper, while CoT stands for Chain-of-Thought Prompting (CoTP).
length_n: the story length, i.e. the number of chapters in a story. From 1 to 3.
sample_n: the numbering of different sample stories.
order_n: the ToM order of the question. From 0 to 4.

Contains prompt files that can be directly input to API. The data in it are almost the same as Hi-ToM_data, except that answers are eliminated.

Run the script generate_tomh.sh.

Name		Name	Last commit message	Last commit date
Latest commit History 61 Commits
Hi-ToM_data		Hi-ToM_data
Hi-ToM_prompts		Hi-ToM_prompts
media		media
.DS_Store		.DS_Store
.gitignore		.gitignore
.gitmodules		.gitmodules
Hi-ToM_data.json		Hi-ToM_data.json
LICENSE		LICENSE
README.md		README.md
actions.py		actions.py
clause.py		clause.py
create_world.py		create_world.py
dynamic_actions.py		dynamic_actions.py
generate_prompts.py		generate_prompts.py
generate_tasks.py		generate_tasks.py
generate_tomh.sh		generate_tomh.sh
oracle.py		oracle.py
stringify.py		stringify.py
tasks.py		tasks.py
toJSON.ipynb		toJSON.ipynb
utils.py		utils.py
world.py		world.py
world_large.txt		world_large.txt
world_small.txt		world_small.txt
world_tiny.txt		world_tiny.txt

ying-hui-he/Hi-ToM_dataset