Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following (TAPP)

This is the official github repository for Task-Agnostic Prefix Prompt.

Overview of Task-Agnostic Prefix Prompt (TAPP)

Dataset

For evaluation dataset, we used SuperNatural-Instructions, which can be assessed in the official repo. Simply clone the repo under this directory and change the directory name to data. We run our main experiments on the 119 tests tasks designated by this official repository. We also randomly selected 100 instances (at maximum) per each of these tasks.

Setting

The following command will clone the project:

git clone https://github.com/seonghyeonye/TAPP.git

Before experimenting, you can make a virtual environment for the project.

conda create -n zeroshotlm python=3.8
conda activate zeroshotlm
pip install -r requirements.txt

Run

You can run inference with various prompting schemes by running files under scripts/gpt3 or scripts/decoder. For instance, if you want to test our TAPP on GPT-3 (davinci) for 119 Test tasks from SuperNatural-Instructions, you can run scripts/gpt3/run_ICIL.sh

Result of Task-Agnostic Prefix Prompt (TAPP)

Irrelevant TAPP

For Irrelevant TAPP, we randomly corrupt input sentences from the prompts with sentences from cc_news. To access this file, please visit our google drive. we have downloaded the huggingface cc_news dataset from here and parsed using en_core_sm from spacy and made it into a .txt file comprised of its sentences.

But under demo directory, we have all the irr_ICIL demos already extracted, so instead, you can run run_ICIL.sh(instaed of run_irr_ICIL.sh) after you change the --demo_path option.

Experiments

To replicate our experiments in the paper, please refer to the following resources:

Figure 1,4
- run files under scripts/gpt3 for GPT3 API based models.
  - change the model type name by changing for engine in "davinci" code snippet to for engine in {your model}
- run files under scripts/decoder for models run with GPU.
  - change the model type name by changing --model_name_or_path facebook/opt-30b code snippet to --model_name_or_path {your model})
Table 1
- run scripts/gpt3/run_ICIL.sh with --demo_path:
  - demos/analysis/distribution/irr_inst.json for Random Instruction,
  - demos/irr_ICIL/irr_ICIL_seed1.json for Random Input,
  - demos/analysis/distribution/irr_output.json for Random Output.
Figure 5
- run scripts/gpt3/run_ICIL.sh and
  - For (a): use demos in demos/analysis/classification
  - For (b): use demos in demos/analysis/shot.
  - For (c): use demos in demos/random_ordering_ICIL.
Table 2
- run scripts/gpt3/run_ICIL.sh with --demo_path demos/analysis/answer_choice_overlap/eight_overlap_mixed.json. for Overlap Setting.
Figure 6
- For (a), run scripts/gpt3/run_ICIL.sh and use demo in demos/analysis/chatgpt/chatgpt.json for Generated.
- For (b), run files under scripts/ablation. Note that this may take some time before starting the inference process.

OpenAI API

For experiments with GPT-3 models (curie, davinci, text-curie, text-davinci), you should have a OpenAI API key, which can be applied here. After acquiring the key, insert your API key on export openai_key="" for each of the script file in scripts directory.

Acknowledgements

Our code repository is mainly based on Tk-Instruct. Special thanks to the contributors of the repository!

Citations

@article{ye2023context,
  title={In-Context Instruction Learning},
  author={Ye, Seonghyeon and Hwang, Hyeonbin and Yang, Sohee and Yun, Hyeongu and Kim, Yireun and Seo, Minjoon},
  journal={arXiv preprint arXiv:2302.14691},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
demos		demos
preprocess		preprocess
scripts		scripts
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
TAPP.png		TAPP.png
clf_task_list.txt		clf_task_list.txt
requirements.txt		requirements.txt
result.png		result.png

License

seonghyeonye/TAPP

Folders and files

Latest commit

History

Repository files navigation

Investigating the Effectiveness of Task-Agnostic Prefix Prompt for Instruction Following (TAPP)

Dataset

Setting

Run

Irrelevant TAPP

Experiments

OpenAI API

Acknowledgements

Citations

About

Topics

Resources

License

Stars

Watchers

Forks

Languages