Avoiding Inference Heuristics in Prompt-based Finetuning

Quick links

Overview
Requirements
Prepare the data
Run the model
- Quick start

Overview

The code includes implementation to reproduce results shown in our submitted paper. The experiments include:

Standard non-prompt finetuning on MNLI, QQP, and SNLI.
Few-shot finetuning using prompts on MNLI, QQP, and SNLI. Trained model on each dataset will be evaluated against HANS, PAWS, and Scramble Test, respectively.
Few-shot finetuning trained using regularized objectives such as L2 loss between the updated and the pretrained weights; and partial layers freezing.

This implementation is based on LM-BFF by Gao et al (2020).

Requirements

To run the code, please install all the dependencies with the following command:

pip install -r requirements.txt

Prepare the data

The original datasets (MNLI, SNLI, QQP) can be downloaded here here . Please download it and extract the files to ./data/original, or run the following commands:

cd data
bash download_dataset.sh

The challenge datasets are included in this code under ./data

Then use the following command (in the root directory) to generate the few-shot data we need:

python tools/generate_k_shot_data.py

See tools/generate_k_shot_data.py for more options. For results in the paper, we use the default options: we take K=16 and take 5 different seeds of 13, 21, 42, 87, 100. The few-shot data will be generated to data/k-shot. In the directory of each dataset, there will be folders named as $K-$SEED indicating different dataset samples.

NOTE: During training, the model will generate/load cache files in the data folder. If your data have changed, make sure to clean all the cache files (starting with "cache").

Run

Quick start

The code is built on transformers. We did all our experiments with version 3.4.0.

Run the following bash file to reproduce the experiments:

bash run_experiment.sh

The script contain all commands for setting the experimentation, e.g., model (roberta-base/large), dataset, learning rate, num of epoch, etc.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
data		data
src		src
tools		tools
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
run.py		run.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

data

data

src

src

tools

tools

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

run.py

run.py

Repository files navigation

Avoiding Inference Heuristics in Prompt-based Finetuning

Quick links

Overview

Requirements

Prepare the data

Run

Quick start

About

Releases

Packages

Languages

UKPLab/emnlp2021-prompt-ft-heuristics

Folders and files

Latest commit

History

Repository files navigation

Avoiding Inference Heuristics in Prompt-based Finetuning

Quick links

Overview

Requirements

Prepare the data

Run

Quick start

About

Resources

Stars

Watchers

Forks

Languages