Improving adversarial robustness via joint classification and multiple explicit detection classes

We improved the trade-off between natural accuracy and robust verifiable accuracy by introducing multiple abstain classes to the training and verification procedures of neural networks.

Run

To train the network with multiple abstain classes for the CIFAR-10 dataset with epsilon=12/255, run the following:

python train.py "training_params:method=robust_natural" "training_params:method_params:bound_type=interval" --config config/cifar_dm-shallow_12_255.json

In the config file you can assign parameters such as M (the number of abstain classes), gamma (the hyper-parameter for the regularization), is_regularized (whether you wan to regularize model to have balance between classes), and alpha (the step-size for the Bergman divergence).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
config		config
3rd-party-licenses.txt		3rd-party-licenses.txt
LICENSE.txt		LICENSE.txt
README.md		README.md
adv.png		adv.png
argparser.py		argparser.py
attack.py		attack.py
bound_layers.py		bound_layers.py
bound_layers_joint.py		bound_layers_joint.py
clean.png		clean.png
config.py		config.py
converter.py		converter.py
datasets.py		datasets.py
defaults.json		defaults.json
eps_scheduler.py		eps_scheduler.py
eval.py		eval.py
model_defs.py		model_defs.py
model_defs_gowal.py		model_defs_gowal.py
pgd.py		pgd.py
requirements.txt		requirements.txt
train.py		train.py

License

sinaBaharlouei/MultipleAbstainDetection

Folders and files

Latest commit

History

Repository files navigation

Improving adversarial robustness via joint classification and multiple explicit detection classes

Run

About

Resources

License

Stars

Watchers

Forks

Languages