Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games

The official code for the EMNLP-FINDINGS 2022 paper Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games.

Overview

Language models pre-trained on large self-supervised corpora, followed by task-specific fine-tuning has become the dominant paradigm in NLP. These pre-training datasets often have a one-to-many structure—e.g. in dialogue there are many valid responses for a given context. However, only some of these responses will be desirable in our downstream task. This raises the question of how we should train the model such that it can emulate the desirable behaviours, but not the undesirable ones. Current approaches train in a one-to-one setup—only a single target response is given for a single dialogue context—leading to models only learning to predict the average response, while ignoring the full range of possible responses. Using text-based games as a testbed, our approach, PASA, uses discrete latent variables to capture the range of different behaviours represented in our larger pre-training dataset. We then use knowledge distillation to distil the posterior probability distribution into a student model. This probability distribution is far richer than learning from only the hard targets of the dataset, and thus allows the student model to benefit from the richer range of actions the teacher model has learned. Results show up to 49% empirical improvement over the previous state-of-the-art model on the Jericho Walkthroughs dataset.

Getting Started

First, you will want to download the ClubFloyd dataset from: https://github.com/princeton-nlp/calm-textgame/tree/master/calm. The paper also makes use of the Jericho Walkthroughs, the preprocessed versions of which are included in this repo. The original repo for the Jericho framework can be found at: https://github.com/microsoft/jericho.

The requirements for the code are:

transformers
pytorch
datasets
spacy
nltk
scikit-learn
scipy

The entry point for training the model is train.py.

(1) we pre-train the teacher model on ClubFloyd with:


python train.py --model_type latent \
  --task_type clubfloyd \
  --data_directory $PATH/TO/CLUBFLOYD$ \
  --intent_type regex \
  --model_path distilbert-base-uncased \
  --output_dir clubfloyd_regex \
  --epochs 3

(2) We then similarly pre-train the student model on ClubFloyd with:

python train.py --model_type baseline \
  --task_type clubfloyd \
  --data_directory $PATH/TO/CLUBFLOYD$ \
  --model_path distilbert-base-uncased \
  --output_dir clubfloyd_baseline \
  --epochs 3

(3) Fine-tune the teacher model on Jericho Walkthroughs:

python train.py --model_type latent \
 --task_type jericho \
 --intent_type regex \
 --model_path clubfloyd_regex \
 --output_dir jericho_regex \
 --epochs 1

(4) Knowledge distillation on Jericho Walkthroughs:

python train.py --model_type distillation \
 --task_type jericho \
 --intent_type regex \
 --student_path clubfloyd_baseline \
 --teacher_path jericho_regex \
 --output_dir distillation \
 --output_dir jericho_regex \
 --epochs 1

(5) Obtain game-by-game evaluation results on the Jericho test data:

python eval.py --model_path distillation \
 --model_type baseline \
 --task_type jericho \

Citation

Please cite our paper if you found PASA useful in your work:

@inproceedings{towle-zhou-2022-learn,
   title = "Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games",
   author = "Towle, Benjamin  and
     Zhou, Ke",
   booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2022",
   month = dec,
   year = "2022",
   address = "Abu Dhabi, United Arab Emirates",
   publisher = "Association for Computational Linguistics",
   url = "https://aclanthology.org/2022.findings-emnlp.364",
   pages = "4955--4965"
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
engine.py		engine.py
eval.py		eval.py
intents.py		intents.py
models.py		models.py
pasa_overview.png		pasa_overview.png
test_game.jsonl		test_game.jsonl
train.py		train.py
train_game.jsonl		train_game.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games

Overview

Getting Started

Citation

About

Releases

Packages

Languages

License

BenjaminTowle/PASA

Folders and files

Latest commit

History

Repository files navigation

Learn What Is Possible, Then Choose What Is Best: Disentangling One-To-Many Relations in Language Through Text-based Games

Overview

Getting Started

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages