HeatSmoothing

Adversarial Boot Camp: label free certified robustness in one epoch

The following code associated with the paper submitted to ICLR 2021. We implement a variational method to deterministically average DNNs.

Randomizd smoothing is a known stochastic method to achieve a Gaussian average of some initial model. However, we can also achieve a Gaussian averaged model by training with some regularized loss (see Figure 1). In this work, we present an iterative determinmistic smoothing method for classification neural networks, as opposed to well known stochastic methods. This iterative method is shown as follows,

where the gradient-norm-squared penalization term of the model output is estimated via the Johnson–Lindenstrauss lemma.

In our experiments, we test our iterative method on the CIFAR-10 and ImageNet-1k datasets. We compare our models to the stochastically smoothed models of Cohen et. al and Salman et. al.

The first experiment is to compute the L2 certified accurcies using the method implemented in Cohen et. al. We obtain the resulting plot

The corresponding plotting code is given in `figs/cert_plots.ipynb`.

Next, we compute a lower bound on adversarial distance using the Lipschitz constant of averaged models. We also attack our models using the PGD and DDN attacks. Results for CIFAR-10 and ImageNet-1k are presented as follows.

Our attack curve plotting notebook is given in `figs/adv_plots.ipynb`.

A result of deterministic smoothing is faster inference computation time. When performing classification, our models do not require a randomized smoothing procedure as is done with stochastic models from Cohen et al.

Experiments

The code is tested with python3 and PyTorch v1.5.0 (along with torchvision and CUDA toolkit version 10.2). See https://pytorch.org/get-started/locally/ for installation details.

Then clone this repository and run cd HeatSmoothing.

ImageNet-1k Experiments

Now, run cd imagenet.

Training

Here, we use code modified from Train ImageNet in 18 minutes. If you run locally, you may need to download the special ImageNet dataset yourself from here. This faster training is achieved by training on special smaller images for the first 15 epochs or so.

To train a base model, simply execute

./run.sh

from the command line.

Using this initial model, train the deterministic averaged model by running

./run_ours.sh

Alternatively, you can download the pretrained version of these two models, along with the pretrained Cohen and Salman models here.

Certification

As well as computing the certified L2 certified radius, our certification code also computed classification and certification times. Run cd certify. To certify the baseline model and our averaged model by running

python certify.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'MODEL PATH.pth.tar' --std 0.25 --rule 'top5'

For the pretrained Cohen and Salman models, the respective model loading code is slightly different. For the Cohen model, run

python certify-cohen.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'COHEN MODEL PATH.pth.tar' --std 0.25 --rule 'top5' --is-cohen

and for the Salman model, run

python certify-salman.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'SALMAN MODEL PATH.pth.tar' --std 0.25 --rule 'top5' --is-cohen

Using the resulting .pkl dataframes, make the certification plot (Figure 3(b)) using the code provided in the notebook figs/cert_plots.ipynb.

Attacking

First, compute the L-bound from the paper for any of our baseline and our averaged models by running

python test_statistics.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'MODEL PATH.pth.tar'

For the Cohen and Salman models, be sure to uncomment the appropriate lines in test_statistics.py and run

python test_statistics.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'COHEN or SALMAN MODEL PATH.pth.tar' --is-cohen

Run cd attack. To attack the baseline model and our averaged model by running

python run-attack.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'MODEL PATH.pth.tar' --attack 'DDN or PGD' --criterion 'top5'

For the Cohen model, run

python run-attack-cohen.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'COHEN MODEL PATH.pth.tar' --attack 'DDN or PGD' --criterion 'cohen'

For the Salman model, run

python run-attack-salman.py --datadir 'DIRECTORY WHERE IMAGENET VALIDATION DATASET IS STORED' --model-path 'SALMAN MODEL PATH.pth.tar' --attack 'DDN or PGD' --criterion 'cohen'

Using the resulting .npz adversarial distances, make the attack curves (Figures 4(b)(d)) using the code provided in the notebook figs/adv_plots.ipynb.

CIFAR-10 Experiments

Begin by running cd cifar10.

Training

Train the base model, Cohen model, and Salman model by running

./run.sh

from the command line with the correct script selected on line 42.

To train our averaged model, run

python train_ours.py --data-dir 'PATH TO CIFAR-10 DATASET' --init-model-dir 'DIRECTORY OF THE TRAINED BASE MODEL' --pth-name 'best.pth.tar'

from the command line.

Alternatively, the four pretrained models can be downloaded here.

Certification

As well as computing the certified L2 certified radius, our certification code also computed classification and certification times. To certify the baseline and our adveraged models, cd into certify and run the following from the command line,

python certify.py --data-dir 'WHERE THE DATA IS STORED' --model-dir 'MODEL DIRECTORY' --pth-name 'MODEL PATH.pth.tar'

For the Cohen and Salman models, run

python certify.py --data-dir 'WHERE THE DATA IS STORED' --model-dir 'MODEL DIRECTORY' --pth-name 'MODEL PATH.pth.tar' --is-cohen

Using the resulting .pkl dataframes, make the certification plot (Figure 3(a)) using the code provided in the notebook figs/cert_plots.ipynb.

Attacking

First, compute the L-bound from the paper for any of our baseline and our averaged models by running

python test_statistics.py --data-dir 'LOCATION OF DATA' --model-dir 'WHERE MODEL IS STORED' --pth-name 'PATH NAME.pth.tar'

For the Cohen and Salman models, run

python test_statistics.py --data-dir 'LOCATION OF DATA' --model-dir 'WHERE MODEL IS STORED' --pth-name 'PATH NAME.pth.tar' --is-cohen

Now it is time to perform the DDN and PGD attacks. First, uncomment lines 94-95 in /cifar10/attack/salman_attacks.py. When attacking our models, we want to terminate the attack if the model is successfully adversarially perturbed. To attack our baseline and our averaged models, cd into attack and run the following from the command line

python run-attack.py --data-dir 'WHERE THE DATA IS STORED' --model-dir 'MODEL DIRECTORY' --pth-name 'MODEL PATH.pth.tar' --criterion 'top1' --attack 'DDN or PGD'

To attack the Cohen and Salman models, run

python run-attack.py --data-dir 'WHERE THE DATA IS STORED' --model-dir 'MODEL DIRECTORY' --pth-name 'MODEL PATH.pth.tar' --criterion 'cohen' --attack 'DDN or PGD'

Using the resulting .npz adversarial distances, make the attack curves (Figures 4(a)(c)) using the code provided in the notebook figs/adv_plots.ipynb.

Citing our Work

Please cite this work using

@misc{campbell2020adversarial,
      title={Adversarial Boot Camp: label free certified robustness in one epoch}, 
      author={Ryan Campbell and Chris Finlay and Adam M Oberman},
      year={2020},
      eprint={2010.02508},
      archivePrefix={arXiv},
      primaryClass={cs.LG}
}

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
cifar10		cifar10
figs		figs
imagenet		imagenet
.gitignore		.gitignore
README.md		README.md
cohen.py		cohen.py
criteria.py		criteria.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cifar10

cifar10

figs

figs

imagenet

imagenet

.gitignore

.gitignore

README.md

README.md

cohen.py

cohen.py

criteria.py

criteria.py

requirements.txt

requirements.txt

Repository files navigation

HeatSmoothing

Adversarial Boot Camp: label free certified robustness in one epoch

Experiments

ImageNet-1k Experiments

Training

Certification

Attacking

CIFAR-10 Experiments

Training

Certification

Attacking

Citing our Work

About

Releases

Packages

Languages

ryancampbell514/HeatSmoothing

Folders and files

Latest commit

History

Repository files navigation

HeatSmoothing

Adversarial Boot Camp: label free certified robustness in one epoch

Experiments

ImageNet-1k Experiments

Training

Certification

Attacking

CIFAR-10 Experiments

Training

Certification

Attacking

Citing our Work

About

Resources

Stars

Watchers

Forks

Languages