Rehearsal-free Continual Language Learning via Efficient Parameter Isolation

In this repository, we provide the code to reproduce the results presented in our paper, as well as the necessary steps to use it. If you wish to reference our paper, please cite:

Zhicheng Wang, Yufang Liu, Tao Ji, Xiaoling Wang, Yuanbin Wu, Congcong Jiang, Ye Chao, Zhencong Han, Ling Wang, Xu Shao and Wenqiu Zeng: Rehearsal-free Continual Language Learning via Efficient Parameter Isolation, ACL 2023

Download and pre-processing data

We used the same dataset as LAMOL and IDBR. You can download the raw dataset from this link.

After downloading, please put the data in ./data folder, and then run the following command:

cd ./data
tar -xvzf LAMOL.tar.gz
python ./preprocess.py

Regarding the other datasets mentioned in our paper, you can specify them in the command line arguments, and they will be automatically downloaded from Hugging Face.

Data structure after processing the data:

./data
├── ag
│   ├── test.csv
│   └── train.csv
├── amazon
│   ├── test.csv
│   └── train.csv
├── dbpedia
│   ├── test.csv
│   └── train.csv
├── preprocess.py
├── yahoo
│   ├── test.csv
│   └── train.csv
└── yelp
    ├── test.csv
    └── train.csv

Requirements

Here is the hardware and software that we used:

Ubuntu: 18.04
GPU: RTX 3090 24GB

python: 3.7.11
pytorch: 1.11.0
transformers: 4.23.1
datasets: 2.1.0

We recommend that you create a new conda or python environment and install the required packages manually. We also provide a list of all packages and their versions in ./package-list.txt for checking environment issues.

Fine-tuning

We provide a run script ./run.py that you can use by running the following command:

python run.py \
    --dataset 5ds \
    --method epi \
    --query_mode mahalanobis \
    --prompt_mode prefix \
    --pre_seq_len 16 \
    --n_per_class 2000 \
    --batch_size 32 \
    --lr 0.03 \
    --epochs 5 \
    --gpu_id 0 \
    --seed 42

Since prefix-tuning is affected by your environment and machine, we highly recommend that you run the hyperparameter search script in ./scripts for best performance:

bash ./scripts/5ds.sh

You can modify the corresponding parameters in the script according to your needs, such as base model, task order, training epoch.

For more information on the available arguments, please refer to ./utils/assets.py.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rehearsal-free Continual Language Learning via Efficient Parameter Isolation

Download and pre-processing data

Requirements

Fine-tuning

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
data		data
model		model
scripts		scripts
trainers		trainers
utils		utils
LICENSE		LICENSE
README.md		README.md
package-list.txt		package-list.txt
run.py		run.py

License

Dicer-Zz/EPI

Folders and files

Latest commit

History

Repository files navigation

Rehearsal-free Continual Language Learning via Efficient Parameter Isolation

Download and pre-processing data

Requirements

Fine-tuning

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages