CapsNet-Adversarial

I show that reconstruction error can be used to detect adversarial attacks against encoder-decoder network architectures. These attacks are carried out in a white-box scenario for the classification+encoder network (in this case a capsule network), and black-box for the decoder network. This method can detect ~70% of adversarial attacks at a 5% false positive rate. Check out Attack-CapsNet.ipynb for implementation details and results.

This project was done as part of my final project for CMPS 290C: Advanced Machine Learning at UC Santa Cruz. The associated talk can be found at Adversarial-Defenses-Talk

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
models		models
.gitignore		.gitignore
Attack-CapsNet.ipynb		Attack-CapsNet.ipynb
README.md		README.md
Train-Baselines.ipynb		Train-Baselines.ipynb
Train-CapsNet.ipynb		Train-CapsNet.ipynb
attacks.py		attacks.py
datasets.py		datasets.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CapsNet-Adversarial

About

Releases

Packages

Languages

KellerJordan/CapsNet-Adversarial

Folders and files

Latest commit

History

Repository files navigation

CapsNet-Adversarial

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages