Perturb-and-compare Approach for Detecting Out-of-distribution Samples in Constrained Access Environments

This repossitory contains the code for the paper Perturb-and-compare Approach for Detecting Out-of-distribution Samples in Constrained Access Environments (ECAI 2024).

Abstract

Accessing machine learning models through remote APIs has been gaining prevalence following the recent trend of scaling up model parameters for increased performance. Even though these models exhibit remarkable ability, detecting out-of-distribution (OOD) samples remains a crucial safety concern for end users as these samples may induce unreliable outputs from the model. In this work, we propose an OOD detection framework, MixDiff, that is applicable even when the model's parameters or its activations are not accessible to the end user. To bypass the access restriction, MixDiff applies an identical input-level perturbation to a given target sample and a similar in-distribution (ID) sample, then compares the relative difference in the model outputs of these two samples. MixDiff is model-agnostic and compatible with existing output-based OOD detection methods. We provide theoretical analysis to illustrate MixDiff's effectiveness in discerning OOD samples that induce overconfident outputs from the model and empirically demonstrate that MixDiff consistently enhances the OOD detection performance on various datasets in vision and text domains.

Setup

Install packages

conda env create -f environment.yml
conda activate mixdiff

Prepare datasets

Place the datasets as below:

CIFAR10
- The dataset is automatically downloaded when the code is run.
CIFAR100
- The dataset is automatically downloaded when the code is run.
CIFAR10+
- The dataset is automatically downloaded when the code is run.
CIFAR50+
- The dataset is automatically downloaded when the code is run.
TinyImageNet
- Run tinyimagenet.sh script in data directory.

Experiments where model outputs are logits

Evaluate MixDiff's performance when the model outputs are logits.

bash mixup/scripts/logit/mixdiff_logit.sh

Experiments where model outputs are prediction probabilities

Run the script below after replacing OOD_METHOD with one of the following names: entropy, msp.

bash mixup/scripts/prob/mixdiff_prob_{OOD_METHOD}.sh

Evaluate the setup where oracle samples are used as auxiliary samples.

bash mixup/scripts/prob/mixdiff_entropy_orc.sh

Evaluate the setup where random ID samples are used as auxiliary samples.

bash mixup/scripts/prob/mixdiff_entropy_rnd_id.sh

Experiments where model outputs are prediction labels

Evaluate using oracle samples as auxiliary samples.

bash mixup/scripts/label/onehot_agmax_orc.sh

Evaluate using random ID samples as auxiliary samples.

bash mixup/scripts/label/onehot_agmax_rnd_id.sh

Experiments where model outputs are last layer activations

Evaluate MixDiff's performance when the model outputs are last layer activation.

bash mixup/scripts/embed/mixdiff_embedding.sh

Evaluate baselines

Evaluate output-based baseline OOD scoring functions (MSP, MLS, etc.) on benchmark datasets.

bash mixup/scripts/baselines/baselines.sh

Adversarial attack experiments

Evaluate baseline under various adversarial attack scenarios.

bash mixup/scripts/adv_attack/baselines.sh

Evaluate MixDiff+Entropy under various adversarial attack scenarios.

bash mixup/scripts/adv_attack/entropy_orc_mixdiff.sh

Evaluate MixDiff only under various adversarial attack scenarios.

bash mixup/scripts/adv_attack/entropy_orc_mixdiff_only.sh

Evaluate MixDiff only on clean samples.

bash mixup/scripts/adv_attack/entropy_orc_mixdiff_only_clean.sh

Out-of-scope detection experiments

Prepare datasets

ACID
- Download the dataset from the official repository.
- Place data/acid/customer_training.csv and data/acid/customer_testing.csv under the directory data/acid/.
CLINIC150
- Download the dataset from the official repository.
- Place data_full.json under the directory data/clinc150.
TOP
- Download the dataset from the link provided in the paper.
- Unzip the file under the directory data/top.
Banking77
- This dataset is automatically downloaded when the code is run.

Train intent classification models

Run the script after replacing DATASET_NAME with one of the following names: clinic150, banking77, acid, top. This will train intent classification models for MixDiff hyperparameter search as well as the final OOS detection performance evaluation.

bash text_classification/scripts/train_{DATASET_NAME}.sh

Hyperparameter search on validation splits

bash mixup_text/scripts/run_val.sh

Evaluation on test splits

Run evaluation on the test split by selecting appropriate values in the script below.

bash mixup_text/scripts/run_test.sh

Run baselines

Run MLS, MSP, Energy and entropy OOS detection baselines by using the script below.

bash mixup_text/scripts/run_baselines.sh

Citation

TODO

Acknowledgments

We built our experiment pipeline from the codebase of ZOC repository. We thank the authors of "Zero-Shot Out-of-Distribution Detection Based on the Pre-trained Model CLIP" for sharing their code.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
clip		clip
dataloaders		dataloaders
mixup		mixup
mixup_text		mixup_text
scripts		scripts
text_classification		text_classification
.gitignore		.gitignore
LICENSE_ZOC		LICENSE_ZOC
README.md		README.md
cifar100_eval.py		cifar100_eval.py
cifar10_eval.py		cifar10_eval.py
cifarplus_eval.py		cifarplus_eval.py
environment.yml		environment.yml
tinyimagenet.sh		tinyimagenet.sh
tinyimagenet_eval.py		tinyimagenet_eval.py
train_decoder.py		train_decoder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Perturb-and-compare Approach for Detecting Out-of-distribution Samples in Constrained Access Environments

Abstract

Setup

Install packages

Prepare datasets

Experiments where model outputs are logits

Experiments where model outputs are prediction probabilities

Experiments where model outputs are prediction labels

Experiments where model outputs are last layer activations

Evaluate baselines

Adversarial attack experiments

Out-of-scope detection experiments

Prepare datasets

Train intent classification models

Hyperparameter search on validation splits

Evaluation on test splits

Run baselines

Citation

Acknowledgments

About

Releases

Packages

Languages

License

hy18284/mixdiff

Folders and files

Latest commit

History

Repository files navigation

Perturb-and-compare Approach for Detecting Out-of-distribution Samples in Constrained Access Environments

Abstract

Setup

Install packages

Prepare datasets

Experiments where model outputs are logits

Experiments where model outputs are prediction probabilities

Experiments where model outputs are prediction labels

Experiments where model outputs are last layer activations

Evaluate baselines

Adversarial attack experiments

Out-of-scope detection experiments

Prepare datasets

Train intent classification models

Hyperparameter search on validation splits

Evaluation on test splits

Run baselines

Citation

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages