IMPERATIVE

This repository contains the code of the paper "Towards Imperceptible Backdoor Attack in Self-supervised Learning", an imperceptible and effective backdoor attack against self-supervised models.

Required python packages

Our code is tested under the following environment: NVIDIA GeForce RTX 3090, Ubuntu 22.04, cuda 11.8, Python 3.8.5, torch 2.0.1, torchvision 0.15.2, numpy 1.23.4, pandas 2.0.3, pillow 10.1.0, and tqdm 4.65.0.

Pretraining image encoders

The file pretraining_encoder.py is used to pre-train an image encoder.

To pre-train an image encoder on CIFAR10 or STL10, you could first download the data from the following link data (put the data folder under IMPERATIVE). Then, you could run the following script to pre-train image encoders on CIFAR10 and STL10:

python3 scripts/run_pretraining_encoder.py

It may take up more than 10 hours and 15G to pretrain the encoder on a NVIDIA GeForce RTX 3090. You can also download pre-trained image encoders on CIFAR10 or STL10 from the following link data and put them under output folder.

Pretraining backdoor injectors

The file optimize_filter/run_pretrain.sh is a script used to pre-train a backdoor encoder.

You could run the following script in the optimize_filter directory to pre-train a backdoor encoder on CIFAR10 and STL10:

python3 scripts/run_pretrain.sh

It may take up more than 5 hours and 10G to pretrain the backdoor injectors on a NVIDIA GeForce RTX 3090.

The file imperative.py implements our IMPERATIVE.

You can use the following example script to optimize a imperative backdoor trigger and embed it to an image encoder, where the shadow dataset is CIFAR10 and the reference inputs are images of a truck, digit one, and priority traffic sign:

python3 scripts/run_imperative.py

It may take up more than 10 hours and 10G to pretrain the backdoor injectors on a NVIDIA GeForce RTX 3090.

Training downstream classifiers

The file training_downstream_classifier.py can be used to train a downstream classifier on a downstream task using an image encoder. Here are some example scripts:

python3 scripts/run_cifar10_training_downstream_classifier.py
python3 scripts/run_imagenet_training_downstream_classifier.py

It may take up more than 1 hours and 1G to pretrain the backdoor injectors on a NVIDIA GeForce RTX 3090.

Experimental results

This table shows the experimental results when the pre-training dataset is CIFAR10 and STL10.

Pre-training Dataset	Downstream Dataset	CA	WaNet BA↑	WaNet ASR↑	CTRL BA↑	CTRL ASR↑	Ins-kelvin BA↑	Ins-kelvin ASR↑	Ins-xpro2 BA↑	Ins-xpro2 ASR↑	Ours BA↑	Ours ASR↑
STL10	CIFAR10	86.77	84.43	10.28	87.19	8.72	86.75	18.63	86.85	16.83	87.11	99.58
STL10	GTSRB	76.12	74.45	5.23	77.57	8.17	76.49	72.95	76.71	14.02	75.82	97.97
STL10	SVHN	55.35	58.29	16.83	54.29	3.32	56.67	38.03	58.42	18.68	58.62	99.76
CIFAR10	STL10	76.14	72.73	9.78	75.73	16.85	74.89	1.16	74.11	5.91	74.48	95.00
CIFAR10	GTSRB	81.84	75.85	5.46	79.94	97.95	78.56	2.50	75.08	42.40	79.15	98.73
CIFAR10	SVHN	61.52	54.79	17.99	66.33	40.91	68.49	22.13	68.95	30.91	63.67	98.79

This table shows the results when applying IMPERATIVE to image encoder pre-trained on ImageNet:

Downstream Dataset	CA	ISSBA BA↑	ISSBA ASR↑	Ours BA↑	Ours ASR↑
STL10	95.68	92.58	9.97	93.48	100.00
GTSRB	80.32	66.29	5.10	82.84	96.00
SVHN	74.77	67.67	18.03	75.40	99.99

We refer to the following code in our implementation: https://github.com/google-research/simclr, https://github.com/jinyuan-jia/BadEncoder, https://github.com/leftthomas/SimCLR

Citation

If you use our code or data in this repo or find our work helpful, please consider giving a citation:

@misc{zhang2024imperceptible,
      title={Towards Imperceptible Backdoor Attack in Self-supervised Learning}, 
      author={Hanrong Zhang and Zhenting Wang and Tingxu Han and Mingyu Jin and Chenlu Zhan and Mengnan Du and Hongwei Wang and Shiqing Ma},
      year={2024},
      eprint={2405.14672},
      archivePrefix={arXiv},
      primaryClass={id='cs.CV' full_name='Computer Vision and Pattern Recognition' is_active=True alt_name=None in_archive='cs' is_general=False description='Covers image processing, computer vision, pattern recognition, and scene understanding. Roughly includes material in ACM Subject Classes I.2.10, I.4, and I.5.'}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
datasets		datasets
evaluation		evaluation
log		log
models		models
optimize_filter		optimize_filter
output/stl10		output/stl10
pytorch_ssim		pytorch_ssim
reference		reference
scripts		scripts
trigger		trigger
.gitignore		.gitignore
README.md		README.md
imperative.png		imperative.png
imperative.py		imperative.py
loss.py		loss.py
pretraining_encoder.py		pretraining_encoder.py
requirements.txt		requirements.txt
training_downstream_classifier.py		training_downstream_classifier.py
util.py		util.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

IMPERATIVE - Towards Imperceptible Backdoor Attack in Self-supervised Learning

Required python packages

Pretraining image encoders

Pretraining backdoor injectors

IMPERATIVE

Training downstream classifiers

Experimental results

Citation

About

Releases

Packages

Languages

Zhang-Henry/INACTIVE

Folders and files

Latest commit

History

Repository files navigation

IMPERATIVE - Towards Imperceptible Backdoor Attack in Self-supervised Learning

Required python packages

Pretraining image encoders

Pretraining backdoor injectors

IMPERATIVE

Training downstream classifiers

Experimental results

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages