Annealing Self-Distillation Rectification Improves Adversarial Training

This repository is the official implementation of "Annealing Self-Distillation Rectification Improves Adversarial Training" by Yu-Yu Wu, Hung-Jui Wang and Shang-Tse Chen (National Taiwan University).

Requirements

The code has been implemented and tested with Python 3.9.14 To install requirements:

pip install -r requirements.txt

Directory Layout

.
|__ src # Source files
|__ data # Directory to put data
|     |__ cifar10
|     |__ cifar100
|     |__ tiny-imagenet-200
|
|__ config # Directory to store experiment configs
|__ *_experiment # Directory to store experiment checkpoints

Data

Before start training, manully create data/ directory and downloaded the required data to cifar10/, cifar100/ (cifar10 and cifar100), and tiny-imagenet-200 directory (tiny-imagenet-200).

If you wish to use additional DDPM sysnthetic data for experiment, please refer to the original paper Rebuffi et al., 2021. The synthetic data is public available here.

Training

To train the model(s) in the paper, run this command:

python train.py
	--description <experiment name and other description>
	--gin_config <absolute path to experiment configs>
	--cuda <cuda id>
	--num_workers <how many workers used in data loader>
	--batch_size <batch size>
	--aux_batch_size <synthetic data batch size, specify when using addtional synthetic data>
	--ema <boolean optional, evaluate on the ema teacher is specified>

The parameters used in the original paper can be found in the config/ directory

Evaluation

To do evaluation on the trained model, run:

python robust_eval.py 
	--dataset <the dataset used to evaluate>
	--model_type <model type for the checkpoint, should be one of resnet18, preact-resnet18, wideresnet-34-10>
	--model_path <path to the checkpoint>
	--activation_name <activation for the checkpoint, should be one of relu or swish>
	--attack_type <type of attack to evaluate, should be one of fgsm, pgd, autoattack, square>
	--epsilon <epsilon budget (in 255 range) used for evaluation, 8 as default setting>
	--steps <number of steps to attack for pgd evaluation. This argument is not needed for other attacks.>
	--cuda <cuda id>
	--batch_size <batch size>
	--ema <boolean optional, evaluate on the ema teacher is specified>

Pre-trained Models

You can download pretrained models here:

ADR checkpoints.

Results

If you want to test the model with WA, you should specify --ema for robust_eval.py.

Our model achieves the following performance with on:

CIFAR-10

Architecture	Method	AutoAttack	Standard Accuracy	Training config
ResNet-18	ADR	50.39%	82.41%	Link
ResNet-18	ADR + WA	50.86%	82.59%	Link
ResNet-18	ADR + WA + AWP	51.24%	83.26%	Link
WRN-34-10	ADR	53.24%	84.67%	Link
WRN-34-10	ADR + WA	54.13%	82.93%	Link
WRN-34-10	ADR + WA + AWP	55.22%	86.11%	Link

CIFAR-100

Architecture	Method	AutoAttack	Standard Accuracy	Training config
ResNet-18	ADR	26.89%	56.10%	Link
ResNet-18	ADR + WA	27.54%	58.30%	Link
ResNet-18	ADR + WA + AWP	28.52%	57.36%	Link
WRN-34-10	ADR	29.36%	59.76%	Link
WRN-34-10	ADR + WA	30.46%	57.42%	Link
WRN-34-10	ADR + WA + AWP	31.63%	62.21%	Link

TinyImageNet-200

Architecture	Method	AutoAttack	Standard Accuracy	Training config
ResNet-18	ADR	19.47%	48.19%	Link
ResNet-18	ADR + WA	20.21%	48.55%	Link
ResNet-18	ADR + WA + AWP	20.10%	48.27%	Link
WRN-34-10	ADR	21.85%	51.52%	Link
WRN-34-10	ADR + WA	23.03%	51.03%	Link
WRN-34-10	ADR + WA + AWP	23.35%	51.44%	Link

Citing this work

@misc{wu2023annealing,
      title={Annealing Self-Distillation Rectification Improves Adversarial Training}, 
      author={Yu-Yu Wu and Hung-Jui Wang and Shang-Tse Chen},
      year={2023},
      eprint={2305.12118},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
figures		figures
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Annealing Self-Distillation Rectification Improves Adversarial Training

Requirements

Directory Layout

Data

Training

Evaluation

Pre-trained Models

Results

CIFAR-10

CIFAR-100

TinyImageNet-200

Citing this work

About

Releases

Packages

Languages

License

yuyuwu5/ADR

Folders and files

Latest commit

History

Repository files navigation

Annealing Self-Distillation Rectification Improves Adversarial Training

Requirements

Directory Layout

Data

Training

Evaluation

Pre-trained Models

Results

CIFAR-10

CIFAR-100

TinyImageNet-200

Citing this work

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages