Adversarial Example Attacks on Tabular Data

The official repository of Cost aware Feasible Attack (CaFA) on Tabular Data. It provides a modular, clean and accessible implementation of CaFA and its variants, complying with Adversarial Robustness Toolbox framework. Thus, it allows: transparency of technical details of our work, future extension of the work and utilizing the attack for practical means (e.g., evaluation of models' robustness).

What is CaFA?

CaFA is an Adversarial Example attack, suited for tabular data. That is, given a set of samples and a classification ML-model, CaFA crafts malicious inputs--based on the original ones--that are misclassified by the model.

CaFA is composed of 3 main logical components:

Mine: employing a constraints mining algorithm (we use FastADC and our ranking scheme) on a portion of the dataset; we focus on Denial Constraints.
Perturb: attacking the model with TabPGD (a PGD variation we propose to attack tabular data) and TabCWL0 (a variation of Carlini-Wagner's attack) to craft adversarial examples under structure constraints and cost limitations.
Project: The crafted samples are then projected onto the constrained space embodied by the constraints learned in the first step. For this end we use a SAT solver (Z3 Theorem Prover).

Setup

The project requires Python 3.8.5 and on, and Java 11 and on (to run FastADC). Additionally, the installation of pip install -r requirements.txt is required (preferably in an isolated venv).

Usage

To run the attack use:

python attack.py data=<dataset_name>

Where <dataset_name> is one of the datasets listed in the data/ dir (which can be enriched).

The attack's components can be enabled/disabled/modified through the Hydra's configuration dir (config/) or overriden through CLI. These components include:

data: the dataset to preprocess, train on, attack and mine constraints from.
ml_model: the ML model to load/train and target as part of the attack.
attack: the attack's (CaFA) parameters.
constraints: the specification of the utilized constraints, their mining process and whether to incorporate projection; in this these are Denial Constraints.

Datasets

We evaluate on three commonly used tabular datasets: Adult and Bank Marketing, and Phishing Websites.

Additional tabular datasets can be added following the same structure and format as the existing ones; that is, it is requried to provide the attack with the data itself, its structure and optionally the mined constraints (see: config/data/).

Citation

If you use this code in your research, please cite our paper:

@inproceedings{BenTov24CaFA,
  title={{CaFA}: {C}ost-aware, Feasible Attacks With Database Constraints Against Neural Tabular Classifiers},
  author={Ben-Tov, Matan and Deutch, Daniel and Frost, Nave and Sharif, Mahmood},
  booktitle={Proceedings of the 45th IEEE Symposium on Security and Privacy (S&P)},
  year={2024}
}

License

attack-tabular repository is licensed under the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
config		config
data		data
docs		docs
src		src
trained-models		trained-models
LICENSE		LICENSE
README.md		README.md
attack.py		attack.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

config

config

data

data

docs

docs

src

src

trained-models

trained-models

LICENSE

LICENSE

README.md

README.md

attack.py

attack.py

requirements.txt

requirements.txt

Repository files navigation

Adversarial Example Attacks on Tabular Data

What is CaFA?

Setup

Usage

Datasets

Citation

License

About

Releases

Packages

Contributors 2

Languages

License

matanbt/attack-tabular

Folders and files

Latest commit

History

Repository files navigation

Adversarial Example Attacks on Tabular Data

What is CaFA?

Setup

Usage

Datasets

Citation

License

About

Resources

License

Stars

Watchers

Forks

Languages