LLMCL

Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

Overview

LLMCL is a repository based on the Hugging Face Transformers library, designed to assess the continuous learning capability of large language models. Through this repository, users can easily customize datasets, specify models, and experiment with existing classical continuous learning methods.

Key Features

Continual Learning Methods: The repository includes several classical continuous learning methods for users to reference and use.
Model Customization: You can easily customize the model you want to use, and the repository will automatically download the corresponding model.

Quick Start

1.Install dependencies

conda create -n llmcl python==3.10
pip install -r requirements.txt

2.Start Training

./scripts/train_seq.sh

3.Inference

./scripts/infer_seq.sh

4. customize

You can easily customize scripts for your own use:

Ensure your dataset is organized in JSON format with prompt and answer as keys.
Save the dataset file to <DATA_PATH>/<DATASET_NAME>/<SPLIT>.json
For more details, refer to the get_dataset.py file.

Reproduce

To Reproduce our results, you need
1. Request the access to llama2 model and download TRACE Benchmark , MedMCQA,JEC-QA to ./data_files folder.

2.run scripts customize your training scripts and run it.

Citation

If you find this repository helpful, please consider citing our work.

@misc{ren2024analyzing,
      title={Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning}, 
      author={Weijieying Ren and Xinlong Li and Lei Wang and Tianxiang Zhao and Wei Qin},
      year={2024},
      eprint={2402.18865},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
method		method
scripts		scripts
utils		utils
README.md		README.md
evaluation.py		evaluation.py
get_dataset.py		get_dataset.py
inference.py		inference.py
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLMCL

Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

Overview

Key Features

Quick Start

1.Install dependencies

2.Start Training

3.Inference

4. customize

Reproduce

Citation

About

Releases

Packages

Contributors 2

Languages

which47/LLMCL

Folders and files

Latest commit

History

Repository files navigation

LLMCL

Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning

Overview

Key Features

Quick Start

1.Install dependencies

2.Start Training

3.Inference

4. customize

Reproduce

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages