Create DirtyFlipping #2376

OrsonTyphanel93 · 2023-12-29T08:32:23Z

Target Label-Flipping Attack Using Dirty Label-Inversion : Speech Vulnerability !

A dirty label-flipping attack is used in the backdoor approach to produce poisoned data collection. Input consists of clean labels and clean data samples; output is a set of poisoned labels and data. The initial labels and data are kept if the target label is absent from the clean labels. The selected dirty label is applied to the labels of poisoned samples. With a given probability, the label is reversed once the trigger function is applied to the input data. The attack aims to introduce a backdoor for a potential model misclassification by carefully crafting a trigger and injecting it into clean data samples of a certain target class. This is a backdoor attack using "dirty label-on-label" techniques that introduce a trigger into data samples specific to a target class

Testing

The full code

notebook Description

Hi guys @beat-buesser !, I just created the first dynamic backdoor attack by dirty label and label inversion, the attack is stealthy and undetectable, I test them on complex databases TIMIT and AudioMnist,

I also added speaker verification tests such as NeMo from Nividia, my attack was 100% deceptive, all HugginFace speaker verification link failed to detect the deception.

Additional work applying 'DirtyFlipping' to HugginFace models

notebook HugginFace Backdoor link HugginFace Backdoor attack

Test Configuration:

OS
Python version
ART version or commit number
TensorFlow / Keras / PyTorch / MXNet version

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Co-authored-by: Beat Buesser <49047826+beat-buesser@users.noreply.github.com> Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

…rsarial-robustness-toolbox into composite-adversarial-attack

Signed-off-by: abigailt <abigailt@il.ibm.com>

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 5.0.0 to 5.1.0. - [Release notes](https://github.com/docker/build-push-action/releases) - [Commits](docker/build-push-action@0565240...4a13e50) --- updated-dependencies: - dependency-name: docker/build-push-action dependency-type: direct:production update-type: version-update:semver-minor ... Signed-off-by: dependabot[bot] <support@github.com>

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

…actions/docker/build-push-action-5.1.0 Bump docker/build-push-action from 5.0.0 to 5.1.0

Bumps [torch](https://github.com/pytorch/pytorch) from 1.13.1 to 2.1.1. - [Release notes](https://github.com/pytorch/pytorch/releases) - [Changelog](https://github.com/pytorch/pytorch/blob/main/RELEASE.md) - [Commits](pytorch/pytorch@v1.13.1...v2.1.1) --- updated-dependencies: - dependency-name: torch dependency-type: direct:production update-type: version-update:semver-major ... Signed-off-by: dependabot[bot] <support@github.com>

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug

Updates the requirements on [pytest-cov](https://github.com/pytest-dev/pytest-cov) to permit the latest version. - [Changelog](https://github.com/pytest-dev/pytest-cov/blob/master/CHANGELOG.rst) - [Commits](pytest-dev/pytest-cov@v4.0.0...v4.1.0) --- updated-dependencies: - dependency-name: pytest-cov dependency-type: direct:production ... Signed-off-by: dependabot[bot] <support@github.com>

…py-gte-1.18.5-and-lt-1.27 Update numpy requirement from <1.25,>=1.18.5 to >=1.18.5,<1.27

…est-cov-approx-eq-4.1.0 Update pytest-cov requirement from ~=4.0.0 to ~=4.1.0

Bumps [librosa](https://github.com/librosa/librosa) from 0.10.0.post2 to 0.10.1. - [Release notes](https://github.com/librosa/librosa/releases) - [Changelog](https://github.com/librosa/librosa/blob/main/docs/changelog.rst) - [Commits](librosa/librosa@0.10.0.post2...0.10.1) --- updated-dependencies: - dependency-name: librosa dependency-type: direct:production update-type: version-update:semver-patch ... Signed-off-by: dependabot[bot] <support@github.com>

…rosa-0.10.1 Bump librosa from 0.10.0.post2 to 0.10.1

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

This reverts commit 4db7626. Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Hugging Face Notebook Improvements

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Update to ART 1.17.0

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Target Label-Flipping Attack Using Dirty Label-Inversion The attack aims to inject a carefully crafted trigger into clean data samples of a specific target class, introducing a backdoor for potential model misclassification, this is a dirty label-on-label backdoor attack that injects a trigger into clean data samples of a specific target class.

OrsonTyphanel93 · 2024-01-07T00:24:53Z

extended experience in the SLU case, backdoor still 100% effective

Thanks !

beat-buesser · 2024-01-09T11:35:39Z

Hi @OrsonTyphanel93 Thank you very much for your pull request! It will be reviewed as soon as possible targeting ART 1.18.

codecov-commenter · 2024-01-09T12:12:21Z

Codecov Report

Attention: 171 lines in your changes are missing coverage. Please review.

Comparison is base (0400813) 85.60% compared to head (2f9d216) 78.07%.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@              Coverage Diff               @@
##           dev_1.18.0    #2376      +/-   ##
==============================================
- Coverage       85.60%   78.07%   -7.53%     
==============================================
  Files             324      327       +3     
  Lines           29326    30205     +879     
  Branches         5407     5589     +182     
==============================================
- Hits            25104    23584    -1520     
- Misses           2840     5215    +2375     
- Partials         1382     1406      +24

Files	Coverage Δ
art/__init__.py	`100.00% <100.00%> (ø)`
art/attacks/evasion/__init__.py	`98.24% <100.00%> (+0.03%)`	⬆️
...asion/adversarial_patch/adversarial_patch_numpy.py	`74.25% <ø> (ø)`
art/attacks/evasion/dpatch.py	`91.25% <ø> (ø)`
...cks/evasion/imperceptible_asr/imperceptible_asr.py	`90.33% <100.00%> (ø)`
art/attacks/extraction/knockoff_nets.py	`89.93% <ø> (ø)`
...ks/inference/membership_inference/shadow_models.py	`44.82% <ø> (-49.14%)`	⬇️
...cks/poisoning/perturbations/audio_perturbations.py	`88.09% <100.00%> (+0.29%)`	⬆️
art/defences/detector/poison/activation_defence.py	`83.28% <100.00%> (+0.04%)`	⬆️
...nces/detector/poison/spectral_signature_defense.py	`84.72% <100.00%> (+0.21%)`	⬆️
... and 32 more

... and 29 files with indirect coverage changes

OrsonTyphanel93 · 2024-01-09T12:47:23Z

Hi guys, I'm doing it, but I don't have access to the 1.18 target! Do you have the possibility to change it directly by yourself?

twweeb and others added 30 commits September 15, 2023 04:25

Add CompositeAdversarialAttack

9808223

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Apply suggestions from code review

5d82170

Co-authored-by: Beat Buesser <49047826+beat-buesser@users.noreply.github.com> Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Address code review comments

3a7e69e

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Merge branch 'dev_1.16.0' into composite-adversarial-attack

44cd354

Fix Coding Style

faaab20

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Merge branch 'composite-adversarial-attack' of github.com:twweeb/adve…

8fdd3c7

…rsarial-robustness-toolbox into composite-adversarial-attack

Support membership black box with no labels (fix Trusted-AI#2154)

9dd6284

Signed-off-by: abigailt <abigailt@il.ibm.com>

Fix tests

e55f388

Signed-off-by: abigailt <abigailt@il.ibm.com>

Fix assert

a5087db

Signed-off-by: abigailt <abigailt@il.ibm.com>

Fix Coding Style

57d8ed0

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

move hook input to original model device

4f2a479

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

updating notebook to confirm fix

430af84

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

get device model is running on to move hook input onto

c2d333f

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

add re-executed notebook

49acd32

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

Fix Coding Style and Add Unit test

3528185

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Fix style check and unit test

03aaeb7

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

Fix docstring style

55f3c72

Signed-off-by: Lei Hsiung <leihsiung.ray@gmail.com>

progress bar development

d8bab78

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

flatten activations for poisoning defenses

123af2c

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

remove huggingface estimator activation hack

4db7626

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

revert check on dim for fit-generator and move to a separate PR

7bed09d

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

update kwarg test to run with pb display

b3dec0f

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

run on CI pipeline

a32d798

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

remove CI to run on feature branch

043752d

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

Merge pull request Trusted-AI#2332 from Trusted-AI/dependabot/github_…

ab389e7

…actions/docker/build-push-action-5.1.0 Bump docker/build-push-action from 5.0.0 to 5.1.0

Merge branch 'dev_1.17.0' into simple_pb_inclusion

b71810a

Merge branch 'dev_1.17.0' into activation-defense-bug

b9f5a4d

Merge branch 'dev_1.17.0' into hf_model_wrapper_update

e896821

beat-buesser and others added 19 commits December 21, 2023 00:52

Merge pull request Trusted-AI#2327 from f4str/activation-defense-bug

95c778e

Fix `ActivationDefense` and `SpectralSignatures` expected flattened bug

Merge pull request Trusted-AI#2364 from Trusted-AI/dependabot/pip/num…

3c189ac

…py-gte-1.18.5-and-lt-1.27 Update numpy requirement from <1.25,>=1.18.5 to >=1.18.5,<1.27

Merge pull request Trusted-AI#2367 from Trusted-AI/dependabot/pip/pyt…

3de2078

…est-cov-approx-eq-4.1.0 Update pytest-cov requirement from ~=4.0.0 to ~=4.1.0

Merge pull request Trusted-AI#2369 from Trusted-AI/dependabot/pip/lib…

ea1fa92

…rosa-0.10.1 Bump librosa from 0.10.0.post2 to 0.10.1

flatten activations for poisoning defenses

4111de6

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

remove huggingface estimator activation hack

47801a7

Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

Revert "remove huggingface estimator activation hack"

b8607cf

This reverts commit 4db7626. Signed-off-by: Farhan Ahmed <Farhan.Ahmed@ibm.com>

Update KerasClassifier for verbose argument

b62f866

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Merge branch 'GiulioZizzo-simple_pb_inclusion' into dev_1.17.0

74be71f

Merge branch 'dev_1.17.0' into hf_notebook_dev

f29950a

Merge pull request Trusted-AI#2338 from GiulioZizzo/hf_notebook_dev

25f7ac0

Hugging Face Notebook Improvements

Merge branch 'main' into dev_1.17.0

089c929

Fix unit test

5549564

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Merge pull request Trusted-AI#2373 from Trusted-AI/dev_1.17.0

bc8a15f

Update to ART 1.17.0

Update docs

501ad92

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

Bump version to ART 1.17.0

044f87e

Signed-off-by: Beat Buesser <beat.buesser@ibm.com>

beat-buesser self-requested a review January 9, 2024 11:35

beat-buesser self-assigned this Jan 9, 2024

beat-buesser added the enhancement New feature or request label Jan 9, 2024

beat-buesser added this to the ART 1.18.0 milestone Jan 9, 2024

beat-buesser changed the base branch from main to dev_1.18.0 January 9, 2024 12:09

beat-buesser requested review from f4str and GiulioZizzo January 9, 2024 12:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Create DirtyFlipping #2376

Create DirtyFlipping #2376

OrsonTyphanel93 commented Dec 29, 2023 •

edited

OrsonTyphanel93 commented Jan 7, 2024

beat-buesser commented Jan 9, 2024

codecov-commenter commented Jan 9, 2024 •

edited

OrsonTyphanel93 commented Jan 9, 2024

Create DirtyFlipping #2376

Are you sure you want to change the base?

Create DirtyFlipping #2376

Conversation

OrsonTyphanel93 commented Dec 29, 2023 • edited

Target Label-Flipping Attack Using Dirty Label-Inversion : Speech Vulnerability !

Testing

The full code

Additional work applying 'DirtyFlipping' to HugginFace models

OrsonTyphanel93 commented Jan 7, 2024

beat-buesser commented Jan 9, 2024

codecov-commenter commented Jan 9, 2024 • edited

Codecov Report

OrsonTyphanel93 commented Jan 9, 2024

OrsonTyphanel93 commented Dec 29, 2023 •

edited

codecov-commenter commented Jan 9, 2024 •

edited