CHAI: A CHatbot AI

This repository contains the code for CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning.

Installation Steps

The minimum python version required for this repo is 3.8. Install this package by running the following command in the main directory:

pip install -r requirements.txt
export PYTHONPATH=$PWD:$PYTHONPATH

Reproducing our results

We list all steps required to reproduce our results and baselines below.

Download data

To download all data required for the experiments and baselines, navigate to the data folder and run `make`. To only download data for training CHAI, run `make dataset`. To remove all downloaded files, run `make clean`.

Finetuning the Language Model

We use GPT-2 as our language model. To finetune GPT-2 on the craigslist dataset, determine the largest batch size that your GPU can hold, and run the following command from the base directory scripts/transformers:

python finetune_gpt2.py \
	 --gpt2-type gpt2-medium \
	 --train-fp ../../data/train.json \
	 --val-fp ../../data/dev.json \
	 --batch-size <BATCH_SIZE> \
	 --output-dir ./logs/gpt2

We then use the Finetuned GPT to generate candidates and embeddings via the following command:

python generate_sentences.py <CHECKPOINT_DIR>
python generate_embeddings.py <CHECKPOINT_DIR>

where <CHECKPOINT_DIR> is the directory created from the previous command, for example, logs/gpt2/checkpoint-2000/.

This should create sentences.pkl and embeddings.pkl files.

Training the RL agent

After generating the candidates and embeddings, we can train CHAI by using the scripts in the scripts/train directory. For example, to train CHAI with EMAQ, run

python chai_emaq.py \
	   --logdir ./logs/chai \
	   --filepath ./data/train.json \
	   --embeddings <PATH TO embeddings.pkl> \
	   --sentences <PATH TO sentences.pkl>

Evaluate the model

To play around with CHAI, run the following command:

python eval.py \
	 --data-path <path to test.json> \
	 --gpt-dir <gpt checkpoint> \
	 --checkpoint-file <chai checkpoint> \
	 --buyer human \
	 --seller ours \
	 --num-rollouts 50 \
	 --output-path "results_human.json" \
	 --debug

The eval.py script also contains options for the automatic evaluations done in the paper, which can be accomplished by changing the buyer option to different values. Check out the script for more details.

Cite our work

If you found our work useful, please cite it as follows:

@inproceedings{verma-2022-chai,
    title = "{CHAI}: A {CH}atbot {AI} for Task-Oriented Dialogue with Offline Reinforcement Learning",
    author = "Verma, Siddharth AND Fu, Justin AND Yang, Sherry AND Levine, Sergey",
    booktitle = "Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies",
    month = jul,
    year = "2022",
    address = "Seattle, United States",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2022.naacl-main.332",
    pages = "4471--4491",
}

Feel free to contact me at vsiddharth@berkeley.edu for any questions or concerns!

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
cocoa		cocoa
data		data
neural_chat		neural_chat
onmt		onmt
scripts		scripts
test		test
.gitignore		.gitignore
LICENSE		LICENSE
README.org		README.org
chai.png		chai.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cocoa

cocoa

data

data

neural_chat

neural_chat

onmt

onmt

scripts

scripts

test

test

.gitignore

.gitignore

LICENSE

LICENSE

README.org

README.org

chai.png

chai.png

requirements.txt

requirements.txt

Repository files navigation

CHAI: A CHatbot AI

Installation Steps

Reproducing our results

Download data

Finetuning the Language Model

Training the RL agent

Evaluate the model

Cite our work

About

Releases

Packages

Languages

License

siddharthverma314/chai-naacl-2022

Folders and files

Latest commit

History

Repository files navigation

CHAI: A CHatbot AI

Installation Steps

Reproducing our results

Download data

Finetuning the Language Model

Training the RL agent

Evaluate the model

Cite our work

About

Resources

License

Stars

Watchers

Forks

Languages