Asymmetrical Adversarial Training on CIFAR10

Base detector training

First download and extract model checkpoints. This will put the pretrained classifier in a designated location.

Use train.py to train base detectors. For example, the following command trains the k=0 eps=8.0 model.

$ python train.py --target_class 0 --epsilon 8.0 --norm Linf --num_steps 40 --step_size 0.5

Evaluation

First download and extract model checkpoints.

Robustness test. Use eval_base_detector.py to evaluate base detectors. As an example, the following tests the first eps8.0 base detector.

$ python eval_base_detector.py --epsilon 8.0 --norm Linf --steps 10 --step_size 2.0  --target_class 0 \
--prefixed models/cifar10_ovr_Linf_8.0_iter40_lr0.5_bs300/class0_ckpt_best/checkpoint-27000

Robustness test — Nattack based Black-box test

$ python eval_base_detector_Nattack.py --target_class 0 --prefixed \
models/cifar10_ovr_Linf_8.0_iter40_lr0.5_bs300/class0_ckpt_best/checkpoint-27000

Detection performance. Use eval_detection.py to test the detection performances of integrated detection and generated detection.

Robust classification performance. Use eval_generative_classifier.py and eval_integrated_classifier.py to test the classification performances of generative classification and integrated classification.

Minimum mean L2 distance. Use min_L2_perturb.py to reproduce the minimum mean L2 distance results.

Synthesize images.

$ # Generate ship images by attacking the class 8 base detector
$ python synthesis.py --target_class 8 --prefixed \
models/cifar10_ovr_Linf_8.0_iter40_lr0.5_bs300/class8_ckpt_best/checkpoint-16000

Images generated with eps16.0 constrained models

$ # Generate ship images by attacking the class 8 base detector (eps16.0 model)
$ python synthesis.py --epsilon 25500 --num_steps 200 --target_class 8 --prefixed \
models/cifar10_ovr_Linf_16.0_iter80_lr0.5_bs300/class8/checkpoint-10000

Gaussian noise attack (i.e., rubbish examples)

Image generated by attacking the generative classifier and discriminative robust classifier using (the same) Gaussian noise image. Image titles are the logit outputs of corresponding models. We used unconstrained L2 based PGD attack of step-size 0.5*255. The five columns corresponding to the perturbed images at step 0, 50, 100, 150, and 200. Notebook Gaussian_noise_attack.ipynb

Model checkpoints

Pretrained models include naturally trained classifiers, an adversarially trained classifier, and eps8.0 base detectors.

Download the extract pretrained models. This will create a new directory "models" and populate it with pretrained models.

$ wget https://asymmetrical-adversarial-training.s3.amazonaws.com/cifar10/checkpoints.tar.gz
$ tar zxvf checkpoints.tar.gz
$ # eps16 constrained detectors
$ wget https://asymmetrical-adversarial-training.s3.amazonaws.com/cifar10/checkpoints_eps16.tar.gz
$ tar zxvf checkpoints_eps16.tar.gz

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
cifar10_data		cifar10_data
synthesized		synthesized
Gaussian_noise_attack.ipynb		Gaussian_noise_attack.ipynb
README.md		README.md
cifar10_input.py		cifar10_input.py
class8_synthesis.png		class8_synthesis.png
eval_base_detector.py		eval_base_detector.py
eval_base_detector_Nattack.py		eval_base_detector_Nattack.py
eval_detection.py		eval_detection.py
eval_generative_classifier.py		eval_generative_classifier.py
eval_integrated_classifier.py		eval_integrated_classifier.py
eval_utils.py		eval_utils.py
min_L2_perturb.py		min_L2_perturb.py
model.py		model.py
nearest_neighbor_analysis.ipynb		nearest_neighbor_analysis.ipynb
noise_attack_p001.png		noise_attack_p001.png
pack_checkpoints.sh		pack_checkpoints.sh
pgd_attack.py		pgd_attack.py
requirements.txt		requirements.txt
speed_test.ipynb		speed_test.ipynb
synthesis.py		synthesis.py
train.py		train.py

yyht/AAT-CIFAR10

Folders and files

Latest commit

History

Repository files navigation

Asymmetrical Adversarial Training on CIFAR10

Base detector training

Evaluation

Gaussian noise attack (i.e., rubbish examples)

Model checkpoints

About

Resources

Stars

Watchers

Forks

Languages