Self-Synthesized Rehearsal (SSR)

🎉 Welcome to the repository for "Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal" (ACL2024, 📃arXiv Paper).

Codebase Structure

This codebase is built on top of LLaMA-Factory framework.

To get started with SSR, please refer to the following directory tree structure of the codebase:

├── custom
│   ├── alpaca_eval
│   ├── icl_gen                     # Instance synthesis
│   └── niv2-c012                   # SuperNI data preprocessing
├── data                            # datasets
├── mmlu_test
├── saves
└── src
    ├── llmtuner
    └── scripts-ni-c012             # Examples of run scripts

Installation

pip install -r requirements.txt

Pipeline

Step 1: In-Context Learning Based Instance Synthesis

use custom/icl_gen/complete_param_nic010_cate.py for generation
use custom/icl_gen/parser.py for postproessing
use custom/icl_gen/random_select.py or custom/icl_gen/kmeans_self.py for instance selection (recommended for efficiency)
- for KMeans-based instance selection, use custom/niv2-c012/text2emb.py to get the embedding of instance inputs

Step 2: Synthetic Output Refinement

use custom/icl_gen/label_param.py

Step 3: Rehearsal with Selected Synthetic Instances

multi-task learning (MTL): src/scripts-ni-c012/lora/all/[model_name]/[model_name].lora.[all|all_5].3ep.bs32x1x1.bf16.sh
single task (& Stage 1 in continual learing): src/scripts-ni-c012/lora/sing/[model_name]/[model_name].lora.single.3ep.bs32x1x1.bf16.sh
Non-rehearsal: src/scripts-ni-c012/lora/[cl|cl2|cl3]/[model_name]/[model_name].lora.[cl_queue|cl_queue2|cl_queue3].3ep.bs32x1x1.lr2e-04.bf16.sh
RandSel: src/scripts-ni-c012/lora/[cl|cl2|cl3]/[model_name]/[model_name].lora.[cl_queue|cl_queue2|cl_queue3]_rp.3ep.bs32x1x1.lr2e-04.bf16.sh
KMeansSel: src/scripts-ni-c012/lora/[cl|cl2|cl3]/[model_name]/[model_name].lora.[cl_queue|cl_queue2|cl_queue3]_km20_rp.3ep.bs32x1x1.lr2e-04.bf16.sh
SSR: src/scripts-ni-c012/lora/[cl|cl2|cl3]/[model_name]/[model_name].lora.[cl_queue|cl_queue2|cl_queue3]_iclgen_self.3ep.bs32x1x1.lr2e-04.bf16.sh

Citation

If you find this useful in your research, please consider citing:

@misc{huang2024mitigating,
    title={Mitigating Catastrophic Forgetting in Large Language Models with Self-Synthesized Rehearsal}, 
    author={Jianheng Huang and Leyang Cui and Ante Wang and Chengyi Yang and Xinting Liao and Linfeng Song and Junfeng Yao and Jinsong Su},
    year={2024},
    eprint={2403.01244},
    archivePrefix={arXiv},
    primaryClass={cs.CL}
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
custom		custom
data		data
mmlu_test		mmlu_test
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_LLaMA_Efficient_Tuning.md		README_LLaMA_Efficient_Tuning.md
README_LLaMA_Efficient_Tuning_zh.md		README_LLaMA_Efficient_Tuning_zh.md
framework.png		framework.png
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-Synthesized Rehearsal (SSR)

Codebase Structure

Installation

Pipeline

Step 1: In-Context Learning Based Instance Synthesis

Step 2: Synthetic Output Refinement

Step 3: Rehearsal with Selected Synthetic Instances

Citation

About

Releases

Packages

Languages

License

DeepLearnXMU/SSR

Folders and files

Latest commit

History

Repository files navigation

Self-Synthesized Rehearsal (SSR)

Codebase Structure

Installation

Pipeline

Step 1: In-Context Learning Based Instance Synthesis

Step 2: Synthetic Output Refinement

Step 3: Rehearsal with Selected Synthetic Instances

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages