FACET: Robust Counterfactual Explanation Analytics

This repository contains all code, results, and figures for the paper FACET: Robust Counterfactual Explanation Analytics, which was published at SIGMOD 2024. Please use the following BibTex citation when referencing our work.

@article{10.1145/3626729,
    author = {VanNostrand, Peter M. and Zhang, Huayi and Hofmann, Dennis M. and Rundensteiner, Elke A.},
    title = {FACET: Robust Counterfactual Explanation Analytics},
    year = {2023},
    issue_date = {December 2023},
    publisher = {Association for Computing Machinery},
    address = {New York, NY, USA},
    volume = {1},
    number = {4},
    url = {https://doi.org/10.1145/3626729},
    doi = {10.1145/3626729},
    journal = {Proc. ACM Manag. Data},
    month = {dec},
    articleno = {242},
    numpages = {27},
}

FACET Overview

FACET (Fast Actionable Counterfactuals for Ensembles of Trees) generates a novel type of explanation which we call counterfactual regions for decisions made by ensembles of trees. For an instance x a counterfactual region R defines a portions of the feature space where all points x' in R are guaranteed to be counterfactual to x, e.g. if y=f(x)=A then y=f(x')=B. We design FACET to be highly performant and support a wide variety of user parameters such that explanations can be interactively personalized to meet real users needs.

Running FACET

Requirements

The code in this repository was developed using Python 3.8.13, requirements.yml contains a list of required packages and is formatted for use with Anaconda and requirements.txt a list formatted for pip. To run experiments with OCEAN, a state-of-the-art method we compare to, you will need a license for the Gurobi optimizer. Free academic licenses are available here. Setup can be done as follow

# create the anaconda environment
conda config --add channels https://conda.anaconda.org/gurobi
conda create --name facet --file requirements.txt
conda activate facet
# install solver needed for SOTA comparison method MACE
pysmt-install --z3 --confirm-agreement
# activate gurobi for SOTA comparison method OCEAN
grbgetkey <your_acadmic_license_key>

Generating Explanations

For convenience main.py takes a variety of command line arguments

flag	purpose	allowed values
`--expr`	the experiment to run	`simple`, `ntrees`, `nrects`, `compare`, `k`, `m`, `nconstraints`, `perturb`, `robust`
`--values`	the experimental values to test	space separated list of values e.g. `10 50 100` or `0.1 0.2 0.3`
`--ds`	the dataset to explain	cancer, glass, magic, spambase, vertebral
`--method`	the XAI method to use	`FACETIndex`, `OCEAN`, `RFOCSE`, `AFT`, `MACE`
`--ntrees`	the ensemble size to test	integer value, overridden in for `--expr` `ntrees`
`--maxdepth`	the max depth of ensemble trees	integer value, `-1` for no max depth
`--it`	the iteration to run, used as random seed	space separated integer values
`--fmod`	a filename modifier append to append to results file	string value
`--model`	the underlying mode to explain	`rf` or `gbc`

Executing python main.py with no flags will perform a simple explanation of 20 instances on the vertebral dataset using FACET and an ensemble with T=10, Dmax=5. Parameters not involved in any given experiment are set to the default values provided in experiments.py

All results are output to ./results/<expr_name>.csv. Generated explanations, all parameters used in each iteration, and a summary of results can be found at ./results/<expr_name>.csv. Code for generating all figures from the paper are available in Jupyter Notebooks at ./figures/<expr_name>.ipynb and should be pointed to a matching results csv file of your choice.

Datasets

A listing of datasets which FACET has been applied on including the number of instance N, the number of features n and the number of features after one hot encoding. Datasets marked with * have results included in the paper with the remainder except loans presented here. All figures shown here and in the paper can be found in the ./figures/ directory.

Dataset Name	Abbreviated Name	N	n	n (one-hot)	Source
Adult*	`adult`	45222	11	41	OCEAN
Breast Cancer Wisconsin (Diagnostic) Data Set*	`cancer`	699	9	9	UCI
ProPublica COMPAS Recidivism Data Set	`compas`	5278	5	5	OCEAN
Credit Card Default*	`credit`	29623	14	14	OCEAN
Glass Identification Data Set	`glass`	214	9	9	UCI
Loan Predication (user study only)	`loans`	615	13	NA	Kaggle
MAGIC Gamma Telescope Data Set*	`magic`	19020	10	10	UCI
Spambase*	`spambase`	4600	57	57	UCI
Vertebral Column Data Set	`vertebral`	310	6	6	UCI

Additional Results

Our paper includes results for FACET on adult, cancer, credit, magic, and spambase. We include results for compas, glass, and vertebtral in additional_results.md due to space constraints.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
baselines		baselines
data		data
detectors		detectors
experiments		experiments
explainers		explainers
figures		figures
results		results
utilities		utilities
.gitignore		.gitignore
FACET_full_paper.pdf		FACET_full_paper.pdf
LICENSE		LICENSE
README.md		README.md
dataset.py		dataset.py
main.py		main.py
manager.py		manager.py
requirements.txt		requirements.txt
requirements.yml		requirements.yml

License

PeterVanNostrand/FACET

Folders and files

Latest commit

History

Repository files navigation

FACET: Robust Counterfactual Explanation Analytics

FACET Overview

Running FACET

Requirements

Generating Explanations

Datasets

Additional Results

About

Resources

License

Stars

Watchers

Forks

Languages