Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation

Abstract:

A challenge in the Dialogue State Tracking (DST) field is adapting models to new domains without using any supervised data — zero-shot domain adaptation. Parameter-Efficient Transfer Learning (PETL) has the potential to address this problem due to its robustness. However, it has yet to be applied to the zero-shot scenarios, as it is not clear how to apply it unsupervisedly.

Our method, Prompter, uses descriptions of target domain slots to generate dynamic prefixes that are concatenated to the key and values at each layer's self-attention mechanism. This allows for the use of prefix-tuning in zero-shot. Prompter outperforms previous methods on both the MultiWOZ and SGD benchmarks. In generating prefixes, our analyses find that Prompter not only utilizes the semantics of slot descriptions but also how often the slots appear together in conversation. Moreover, Prompter's gains are due to its improved ability to distinguish none-valued dialogue slots, compared against baselines.

Method:

(a) Slot Prompt Generation where the information from the description is fused with some global prompt to generate slot-specific prompts, (b) Prefix Generation which feeds slot prompts across two linear layers and an activation function to generate per-layer key and value prefixes, (c) Finally these prefixes are concatenated to keys and values at every layer of the T5 encoder.

We utilized the majority of the code for data generation and training from the GitHub repository at https://github.com/facebookresearch/Zero-Shot-DST/tree/main/T5DST

Setting up the environment

Create a new environment with python>=3.8.12

conda create -n prompter python==3.8.12

Change directory to the main folder and install the packages

cd Prompter
pip install -r requirements.txt
pip install -e .

Experiments

Dataset Generation

MultiWOZ

python create_mwoz.py

use create_mwoz_2_1.py if want to run with multiwoz2.1

SGD

./download_sgd.sh

Zero-shot cross-domain Experiments

MultiWOZ

./run_mwoz.sh "domain"

domain: the left out domain, choose one from [hotel, train, attraction, restaurant, taxi]

Note: For MultiWOZ, 5 separate runs needed to replicate experiments in the paper where each time an individual domain is left out.

SGD

./run_sgd.sh

Evaluation

The training script automatically evaluates the results and puts the relative files under the results folder of the run (src/save/path_to_the_run/results/...)

For SGD dataset you can also use the provided official scripts. Set the PRED variable inside sgd_eval.sh to the run directory under 'src/save/...' :

cd scripts
./sgd_eval.sh

This will generate a new file result.json in the run directory.

Generating Heatmaps for models trained on MultiWOZ

cd scripts
./generate_heat_map.sh "domain" "ckpt_file"

domain: The leftout domain
ckpt_file: Path to the .ckpt file in the model directory

The resulting figure will be saved under figures/heatmap.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation

Abstract:

Method:

Setting up the environment

Experiments

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
analysis		analysis
data		data
figures		figures
save		save
scripts		scripts
src		src
LICENSE		LICENSE
README.md		README.md
create_mwoz.py		create_mwoz.py
create_mwoz_2_1.py		create_mwoz_2_1.py
download_sgd.sh		download_sgd.sh
requirements.txt		requirements.txt
run_mwoz.sh		run_mwoz.sh
run_sgd.sh		run_sgd.sh
setup.py		setup.py
trans_environment.yml		trans_environment.yml

License

cuthalionn/Prompter

Folders and files

Latest commit

History

Repository files navigation

Prompter: Zero-shot Adaptive Prefixes for Dialogue State Tracking Domain Adaptation

Abstract:

Method:

Setting up the environment

Experiments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages