What does it take to bake a cake? The RecipeRef corpus and anaphora resolution in procedural text

Introduction

This repository contains code and experiment results introduced in the following paper:

For the data and detailed annotation guideline of the RecipeRef corpus, please refer to RecipeRef dataset. We also provided data in jsonlines format in data but for the original data, please refer to RecipeRef dataset.

Install python (preference 3) requirement: pip install -r requirements.txt
Download GloVe embeddings and also another version glove_50_300_2.txt
Download RecipeRef dataset and put it into the data directory. Note that we separate full set and partition 80.
run setup_all.sh and then setup_training.sh
Install brat evalation tool
We use nltk to tokenzize the brat file for training and generating the jsonlines files. Our code can be found in convert_brat_into_training_format-clear.ipynb

Evaluation: python evaluate_folds.py <experiment>
Evaluation tool provides differnet settings, exact and relax mention matching. For this paper, we use exact mention matching

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
brateval		brateval
chemu_check_point		chemu_check_point
data		data
README.md		README.md
anaphora_model.py		anaphora_model.py
brat.py		brat.py
cache_elmo.py		cache_elmo.py
char_vocab.english.txt		char_vocab.english.txt
continuous_evaluate.py		continuous_evaluate.py
conversion.py		conversion.py
convert_brat_into_training_format-clear.ipynb		convert_brat_into_training_format-clear.ipynb
coref_kernels.cc		coref_kernels.cc
coref_ops.py		coref_ops.py
evaluate_folds.py		evaluate_folds.py
experiments.conf		experiments.conf
filter_embeddings.py		filter_embeddings.py
get_char_vocab.py		get_char_vocab.py
metrics.py		metrics.py
minimize.py		minimize.py
ps.py		ps.py
requirements.txt		requirements.txt
sentencesplit.py		sentencesplit.py
setup_all.sh		setup_all.sh
setup_training.sh		setup_training.sh
ssplit.py		ssplit.py
train_folds.py		train_folds.py
util.py		util.py
worker.py		worker.py