Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Security

This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Security."

Prerequisite

The code is implemented and tested on Keras (with TensorFlow backend) and PyTorch. It runs on Python 3.6.9.

Keras Version

Keras 2.3.0
Tensorflow 1.14.0

PyTorch Version

PyTorch 1.7.0

Usage

The main functions are located in src/main.py file.

Model Orthogonalization

To harden a model using MOTH, please use the following command:

python3 src/main.py --phase moth

The default dataset and model are CIFAR-10 and ResNet20. You can harden different model structures on other datasets by passing the arguments --dataset [dataset] and --network [model structure]. We have included four datasets (CIFAR-10, SVHN, LISA, and GTSRB) and four model structures (ResNet, VGG19, NiN, and CNN). (The datasets will be uploaded soon.)

To measure the pair-wise class distance, please run:

python3 src/main.py --phase validate --suffix [suffix of checkpoint] --seed [seed id]

Models hardened by MOTH will have a suffix of _moth in addition to the original checkpoint path. Please provide the checkpoint extension using argument --suffix. The distance shall be measured using three different random seeds by passing seed ids 0, 1, and 2 to the argument --seed separately.

The final pair-wise class distance of the evalauted model can be obtained through the following command:

python3 src/main.py --phsae show --suffix [suffix of checkpoint]

It prints out a matrix of class distances of all the pairs. Each row denotes the source label and each column the target label. The average distance and relative enlargement are also presented in the end.

Model Functionality

To test the accuracy of a model, simply run:

python3 src/main.py --phase test --suffix [suffix of checkpoint]

The robustness of a given model can be evaluated using PGD with the following command:

python3 src/main.py --phase measure --suffix [suffix of checkpoint]

Acknowledgement

The code of trigger inversion is inspired by Neural Cleanse.

The PGD code is adapted from cifar10_challenge.

Thanks for their amazing implementations.

Reference

Please cite for any purpose of usage.

@inproceedings{tao2022model,
  title={Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Security},
  author={Tao, Guanhong and Liu, Yingqi and Shen, Guangyu and Xu, Qiuling and An, Shengwei and Zhang, Zhuo and Zhang, Xiangyu},
  booktitle={2022 IEEE Symposium on Security and Privacy (SP)},
  year={2022},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
ckpt		ckpt
data/distance		data/distance
pics		pics
src		src
src_torch		src_torch
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Supplementary.md		Supplementary.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ckpt

ckpt

data/distance

data/distance

pics

pics

src

src

src_torch

src_torch

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Supplementary.md

Supplementary.md

Repository files navigation

Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Security

Prerequisite

Keras Version

PyTorch Version

Usage

Model Orthogonalization

Model Functionality

Acknowledgement

Reference

About

Releases

Packages

Contributors 2

Languages

License

Gwinhen/MOTH

Folders and files

Latest commit

History

Repository files navigation

Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Security

Prerequisite

Keras Version

PyTorch Version

Usage

Model Orthogonalization

Model Functionality

Acknowledgement

Reference

About

Resources

License

Stars

Watchers

Forks

Languages