CURaTE: Continual Unlearning in Real Time with Ensured Preservation of LLM Knowledge

Seyun Bae, Seokhan Lee, Eunho Yang

CURaTE, the first unlearning method for large language models enabling continual unlearning in real time while also maintaining near perfect preservation of existing knowledge.

Setup

1. Create a new conda environment

Create a fresh conda environment and install the required packages from requirements.txt.

2. Download the base model

Run download_model.py to download the model into a local directory.

Training

Unlearning Sentence Embedding Model

To train the sentence embedding model used for unlearning, run: python train_sentemb.py

DB Construction for Continual Unlearning

After training the unlearning sentence embedding model, the scripts in DB_files/ are used to prepare the forget set embedding database for continual unlearning evaluation.

Given the trained unlearning sentence embedder, these scripts construct the cumulative forget set for each stage of the continual setting and encode the corresponding samples into the embedding space. They also precompute cosine similarity mappings between the forget set embeddings and samples from the retain set or other utility evaluation datasets.

Evaluation

TOFU

To evaluate CURaTE on the TOFU dataset:

Update the path in get_available_cache_dir() inside TOFU/evaluate_tofu_sentemb.py
Run the evaluation: python TOFU/evaluate_tofu_sentemb.py

TruthfulQA

To evaluate CURaTE on the TruthfulQA benchmark:

Update the path in get_available_cache_dir() inside truthfulQA/truthfulQA_evaluation_sentemb.py
Run the TruthfulQA evaluation: python truthfulQA/truthfulQA_evaluation_sentemb.py
Run the CommonsenseQA evaluation to measure general knowledge preservation: python commonsense/evaluation_commonsenseQA_sentemb.py

RETURN

To evaluate CURaTE on the RETURN dataset:

Update the path in get_available_cache_dir() inside RETURN/evaluate_return_sentemb.py
Run the evaluation: python RETURN/evaluate_return_sentemb.py

ScienceQA

To evaluate CURaTE on the ScienceQA dataset:

Update the path in get_available_cache_dir() inside ScienceQA/evaluate_return_sentemb.py
Run the evaluation: python ScienceQA/evaluate_return_sentemb.py

Ablation Study

To reproduce the ablation experiments:

Train each baseline model with each ablation dataset: python train_sentemb.py
Run the scripts in DB_files/ to generate mapping files for each ablation setting
Run the evaluation scripts with no_gen.py to obtain Precision, Recall, and F1 scores for each ablation

Name		Name	Last commit message	Last commit date
Latest commit History 106 Commits
DB_files		DB_files
RETURN		RETURN
ScienceQA		ScienceQA
TOFU		TOFU
ablation		ablation
assets		assets
commonsense		commonsense
truthfulQA		truthfulQA
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
data_augmentation.py		data_augmentation.py
download_model.py		download_model.py
ds_config.json		ds_config.json
faiss_run.sh		faiss_run.sh
faiss_with_unrelated_corpus.py		faiss_with_unrelated_corpus.py
refusal_answer.json		refusal_answer.json
requirements.txt		requirements.txt
train_sentemb.py		train_sentemb.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CURaTE: Continual Unlearning in Real Time with Ensured Preservation of LLM Knowledge

Setup

1. Create a new conda environment

2. Download the base model

Training

Unlearning Sentence Embedding Model

DB Construction for Continual Unlearning

Evaluation

TOFU

TruthfulQA

RETURN

ScienceQA

Ablation Study

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

CURaTE: Continual Unlearning in Real Time with Ensured Preservation of LLM Knowledge

Setup

1. Create a new conda environment

2. Download the base model

Training

Unlearning Sentence Embedding Model

DB Construction for Continual Unlearning

Evaluation

TOFU

TruthfulQA

RETURN

ScienceQA

Ablation Study

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages