Name		Name	Last commit message	Last commit date
parent directory ..
docs		docs
examples		examples
fairseq		fairseq
fairseq_cli		fairseq_cli
fused_ops		fused_ops
preprocess		preprocess
scripts		scripts
tests		tests
README.md		README.md
hubconf.py		hubconf.py
install.sh		install.sh
pyproject.toml		pyproject.toml
run_glue.sh		run_glue.sh
run_squad.sh		run_squad.sh
setup.py		setup.py
train.py		train.py

README.md

AMOS (Fairseq)

This directory contains the Fairseq version of scripts for fine-tuning AMOS pretrained models on GLUE and SQuAD benchmarks. The scripts are based on the Fairseq Library.

Paper: Pretraining Text Encoders with Adversarial Mixture of Training Signal Generators

Requirements

The scripts require Python 3.6+ and Pytorch 1.5.0+. In addition, you need to install the codebase by running:

bash install.sh

Pretrained Models

We release one AMOS pretrained model (Note: Please follow the links here to download it; do not use the huggingface version of pretrained models as they are not compatible with Fairseq), which corresponds to the base++ model mentioned in the paper. You need to extract the models from the archives.

GLUE Fine-tuning

The General Language Understanding Evaluation (GLUE) benchmark is a collection of sentence- or sentence-pair language understanding tasks for evaluating and analyzing natural language understanding systems.

Download GLUE Data: You can download the GLUE data by running this script and unpack it to some directory.

Process GLUE Data: Since Fairseq training takes binary input files, you need to first preprocess the GLUE data to generate binary files by running the following:

cd preprocess/glue
bash process.sh <glue_data_folder> <task_name> <dict_dir> <output>

where <glue_data_folder> is the path of the raw GLUE data; <task_name> is one of the following: {ALL, QQP, MNLI, QNLI, MRPC, RTE, STS-B, SST-2, CoLA}; use ALL for preprocessing all the GLUE tasks; <dict_dir> is the directory containing two dictionary files sp.model and dict.txt which can be downloaded here; <task_name> is the output directory for processed GLUE data.

Fine-Tuning: After preprocessing the GLUE data, you can run the run_glue.sh script for fine-tuning on each GLUE task. An example for using the script for fine-tuning on MNLI is shown below:

TASK=MNLI
PRETRAINED_MODEL_PATH=/path/to/amos/model.pt
GLUE_DATA_DIR=/path/to/processed/glue_data
OUT_PATH=./glue_finetune/amos
BSZ=32
LR=2e-5
EPOCH=2
WARMUP=16
SEED=1

export CUDA_VISIBLE_DEVICES=0
bash run_glue.sh $TASK $PRETRAINED_MODEL_PATH $GLUE_DATA_DIR $OUT_PATH $EPOCH $WARMUP $BSZ $LR $SEED

Note: The WARMUP argument is the reciprocal of the warmup ratio (e.g., WARMUP=16 means that we are using a 6.25% warmup ratio)

Optimal Hyperparameters: The fine-tuning hyperparameters leading to the best dev set performance in our experiments are shown below (please note that the results and optimal hyperparameters might slightly differ in your runs due to different computation environments):

AMOS base++

	MNLI-m/mm	QQP	QNLI	SST-2	CoLA	RTE	MRPC	STS-B
BSZ	32/32	32	32	32	32	32	32	16
LR	1e-5/1e-5	2e-5	2e-5	1e-5	2e-5	4e-5	4e-5	4e-5
EPOCH	3/2	5	3	5	10	5	10	10
WARMUP	16/16	16	16	16	10	10	16	16
Result	90.5/90.4	92.3	94.5	95.6	71.8	87.4	91.4	92.0

SQuAD 2.0 Fine-tuning

Stanford Question Answering Dataset (SQuAD) is a reading comprehension dataset, consisting of questions posed by crowdworkers on a set of Wikipedia articles, where the answer to every question is a segment of text, or span, from the corresponding reading passage, or the question might be unanswerable.

Download & Process SQuAD Data: Since Fairseq training takes binary input files, you need to first preprocess the SQuAD data to generate pickle files by running the following (SQuAD raw data will be automatically downloaded):

cd preprocess/squad
bash process.sh <squad_data_folder> <dict_dir>

where <squad_data_folder> is the path where raw and processed SQuAD data will be stored to; <dict_dir> is the directory containing two dictionary files sp.model and dict.txt which can be downloaded here.

Fine-Tuning: After preprocessing the SQuAD data, you can run the run_squad.sh script for fine-tuning on SQuAD 2.0. An example for using the script is shown below:

PRETRAINED_MODEL_PATH=/path/to/amos/model.pt
DATA_DIR=/path/to/processed/squad2_data
OUT_PATH=./squad2_finetune/amos
BSZ=16
LR=2e-5
EPOCH=3
WARMUP=10
SEED=1

export CUDA_VISIBLE_DEVICES=0
bash run_squad.sh $PRETRAINED_MODEL_PATH $DATA_DIR $OUT_PATH $EPOCH $WARMUP $BSZ $LR $SEED

Note: The WARMUP argument is the reciprocal of the warmup ratio (e.g., WARMUP=16 means that we are using a 6.25% warmup ratio)

Optimal Hyperparameters: The fine-tuning hyperparameters leading to the best dev set performance in our experiments are shown below (please note that the results and optimal hyperparameters might slightly differ in your runs due to different computation environments):

AMOS base++

	EM	F1
BSZ	16	16
LR	2e-5	2e-5
EPOCH	3	3
WARMUP	16	10
Result	85.0	87.9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fairseq

fairseq

README.md

AMOS (Fairseq)

Requirements

Pretrained Models

GLUE Fine-tuning

SQuAD 2.0 Fine-tuning

Files

fairseq

Directory actions

More options

Directory actions

More options

Latest commit

History

fairseq

Folders and files

parent directory

README.md

AMOS (Fairseq)

Requirements

Pretrained Models

GLUE Fine-tuning

SQuAD 2.0 Fine-tuning