GitHub - RenzeLou/Pick-Rank

This repository contains the code for the paper "Forget Demonstrations, Focus on Learning from Textual Instructions".

We use pointer network to pick up several critical sentences from the task definition, and then utilize an additional training objective (i.e., ranking loss) to train the text-to-text language model.

The main system requirements:

Python == 3.8.0
Pytorch == 1.12.1
Transformers == 4.18.0
CUDA == 11.3

Environment Setup

Please run the following script to setup the conda environment:

sh setup_env.sh

You can further use conda activate pick_rank to activate the environment.

Data Preparation

We use the Super-NaturalInstructions for the experiments. Please download the dataset by running:

git clone git@github.com:allenai/natural-instructions.git data

Since there is no official development set in the Super-NaturalInstructions, we randomly select 100 tasks from the "excluded" set as the development set, a maximum of 100 instances per task, to tune the hyper-parameters. Please use the following script to process and split the data:

sh setup_data.sh

The data split information can be found in data/splits/add_dev, and the processed data can be found in data/tasks/def_segmentation. You can use the following script to print the data statistics:

python data_statistics.py

Experiments

We use the Hugging Face T5-base for all our experiments and analysis. You can use the following script to train the model:

sh scripts/run.sh

The results can be found in output, including the saved model files, the predictions, and all the intermediate results.

You can also use the following script to quickly read and print the evaluation scores on the test set:

python read_results.py

Citation

Please cite this paper if you use any scores or scripts of this repository:

@article{lou2023forget,
  title={Forget demonstrations, focus on learning from textual instructions},
  author={Lou, Renze and Yin, Wenpeng},
  journal={arXiv preprint arXiv:2308.03795},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
baselines		baselines
ds_configs		ds_configs
pics		pics
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
data_proess.py		data_proess.py
data_statistics.py		data_statistics.py
del.py		del.py
edition_huge.py		edition_huge.py
edition_slight.py		edition_slight.py
read_results.py		read_results.py
requirements.txt		requirements.txt
select_dev.py		select_dev.py
setup_data.sh		setup_data.sh
setup_env.sh		setup_env.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Environment Setup

Data Preparation

Experiments

Citation

About

Releases

Packages

Languages

License

RenzeLou/Pick-Rank

Folders and files

Latest commit

History

Repository files navigation

Environment Setup

Data Preparation

Experiments

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages