LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

Environment Setup

This code has been tested with Python 3.11.5 and PyTorch 2.1.2 with CUDA 12.1 on Ubuntu 22.04. The required packages are listed in environment.yaml.

To set up a conda environment, please follow these steps:

conda env create -f environment.yaml -n lmeraser
conda activate lmeraser

File Structure

The structure of the repository is as follows:

.
├── arguments.py
├── data_utils
│   ├── datasets
│   │   ├── cifar
│   │   │   └── dataset.py
│   │   ├── gtsrb
│   │   │   └── dataset.py
│   │   ├── __init__.py
│   │   └── svhn
│   │       └── dataset.py
│   ├── loader.py
│   └── transforms.py
├── environment.yaml
├── eraser
│   ├── eraser.py
│   └── main.py
├── launch.py
├── LICENSE
├── models
│   ├── backbones
│   │   ├── backbone_swin.py
│   │   ├── backbone_vit.py
│   │   └── __init__.py
│   ├── builder.py
│   ├── checkpoints
│   │   ├── swin_base_patch4_window7_224_22k.pth
│   │   └── vit_base_p16_224_in22k.pth
│   ├── model_zoo
│   │   ├── __init__.py
│   │   ├── swin.py
│   │   └── vit.py
│   └── prompters.py
├── README.md
├── scripts
│   ├── run_distributed_gpu.sh
│   ├── run_one_gpu.sh
│   └── run_sbatch.sh
└── utils
    ├── distributed.py
    ├── file_io.py
    ├── logging.py
    ├── lr.py
    └── seed.py

Dataset Preparation

Datasets are sourced from torchvision and downloaded automatically. For more details, please refer to torchvision datasets. The custom datasets directory can be set using --base_dir when running the code.

Pre-trained Model Preparation

The pre-trained vision models used can be downloaded from the provided links and should be placed in models/checkpoints/.

Pre-trained Models

Backbone	Pre-trained Objective	Pre-trained Dataset	Download Link	md5sum
ViT-B/16	Supervised	ImageNet-22k	Download	-
Swin-B	Supervised	ImageNet-22k	Download	bf9cc1

Training

Three scripts are provided for training on a single GPU, multiple GPUs, and a Slurm cluster, respectively. These scripts are located in scripts/.

Changable Arguments

Key arguments are listed in arguments.py. The default settings are configured for training on CIFAR-100 with a ViT-22k backbone.

Important Arguments

--erasing_method: Select the erasing method (e.g., lmeraser, random_part_tuning).
--base_dir: Directory to store datasets.
--test_dataset: Dataset for training.
--pretrained_model: Pre-trained model to use.
--one_prompt: Use one or multiple prompts (default: False).
--num_gpus: Number of GPUs to use.
--batch_size: Total batch size.
--num_epochs: Number of training epochs.
--lr: Learning rate.
--distance_threshold: Distance threshold for clustering.

Acknowledgement

This repository is partially based on VP, VPT, and DAM-VP. We thank the authors for their impressive works!

License

This code is released under the MIT License (see LICENSE file for details).

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commits
data_utils		data_utils
eraser		eraser
models		models
scripts		scripts
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
arguments.py		arguments.py
environment.yaml		environment.yaml
launch.py		launch.py

License

lmeraser/lmeraser

Folders and files

Latest commit

History

Repository files navigation

LMEraser: Large Model Unlearning through Adaptive Prompt Tuning

Environment Setup

File Structure

Dataset Preparation

Pre-trained Model Preparation

Pre-trained Models

Training

Changable Arguments

Important Arguments

Acknowledgement

License

About

Resources

License

Stars

Watchers

Forks

Languages