whisper_attack

This repository contains code to fool Whisper ASR models with adversarial examples. It accompanies our paper.

We provide code to generate examples as we did, and to evaluate Whisper on our examples via huggingface transformers.

Requirements

To use the HF inference pipeline you'll need transformers>=4.23.0, datasets>=2.5.0 and evaluate>=0.2.2.

Usage

Generate adversarial examples

The run_attack.py file runs the robust_speech attack evaluation script. Configuration files in attack_configs/ detail the attacks, datasets used and hyperparameters, and can be customized with command line arguments. Model configurations in model_configs/ detail the loading information for each Whisper model.

For examples, please check our bash scripts which reproduce the attacks we ran in the paper:

cw.sh runs a targeted attack on the ASR decoder
pgd.sh runs an untargeted attack on the ASR decoder (with 35dB and 40dB SNR respectively)
smooth.sh runs an untargeted attack on the ASR decoder while using the Randomized Smoothing defense
lang.sh runs a targeted attack on the language detector, leading to a degradation of ASR performance. We run it with 7 source languages and 3 target languages.
rand.sh applies gaussian noise for comparison

You will need to setup the datasets for robust_speech. For the language detection attack those are CommonVoice datasets in the source languages. For all other attacks, it's the LibriSpeech test-clean set. If like use you generate attacks on a subset of the dataset, you should generate the subset csvs. Here is an example for LibriSpeech: head test-clean.csv -n 101 > test-clean-100.csv

Use our precomputed adversarial examples

in the whisper_adversarial_examples folder, we provide all our precomputed adversarial examples in the form of a Huggingface dataset. You can use that dataset directly with the inference.py script, for example:

python inference.py --model whisper-medium.en --config untargeted-35

More examples are proposed in the inference.sh script.

The dataset is also available on the Hub. Or you can just download the archives.

Name		Name	Last commit message	Last commit date
Latest commit History 62 Commits
attack_configs		attack_configs
model_configs		model_configs
whisper_adversarial_examples		whisper_adversarial_examples
.gitignore		.gitignore
README.md		README.md
cw.sh		cw.sh
cw_whisper.py		cw_whisper.py
fit_attacker.py		fit_attacker.py
inference.py		inference.py
inference.sh		inference.sh
lang.sh		lang.sh
lang_attack.py		lang_attack.py
loss.py		loss.py
pgd.sh		pgd.sh
rand.py		rand.py
rand.sh		rand.sh
run_attack.py		run_attack.py
sb_whisper_binding.py		sb_whisper_binding.py
smooth.sh		smooth.sh
universal_lang_attack.py		universal_lang_attack.py
whisper_with_gradients.py		whisper_with_gradients.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whisper_attack

Requirements

Usage

Generate adversarial examples

Use our precomputed adversarial examples

About

Releases

Packages

Languages

RaphaelOlivier/whisper_attack

Folders and files

Latest commit

History

Repository files navigation

whisper_attack

Requirements

Usage

Generate adversarial examples

Use our precomputed adversarial examples

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages