CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support `L0` and `Linf` for CW and CWBS, rewrite FAB attack. #168

rikonaka · 2023-11-12T17:06:53Z

PR Type and Checklist

What kind of change does this PR introduce?

CW attack fix

There is an obscure bug in the original CW attack code F function.

In CW original code from Carlini, the real is calculate as

https://github.com/carlini/nn_robust_attacks/blob/c6b8f6a254e82a79a52cfbc673b632cad5ea1ab1/l2_attack.py#L96

It was a sum, but in torchattacks, it become max, I discovered this problem accidentally 😋.

adversarial-attacks-pytorch/torchattacks/attacks/cw.py

Line 136 in 936e86d

real = torch.max(one_hot_labels * outputs, dim=1)[0]

I also reduced the large number of tensor detech() operations and view() operations in the original code, instead used index to assign tensors, its more simple and efficiency.

At the same time, I also added the binary search version of CW (CWBS), issues #167 . Binary search can indeed significantly reduce the size of the perturbations. The red line is the value of best_L2.

I tested three cw attack algorithms L0, L2 and Linf and found that 100% attack success rate can be achieved on 50 test images.

And its pertubations is still invisible.

FAB attack fix

The original FAB code was too complicated and difficult to maintain, so I rewritten the FAB attack and split L1, L2 attacks into separate files, and I found that previous FAB code when the user specifies a target label, it does not work good with the target attack.

The old FAB code is rename as AFAB so that it could be used in autoattack.

In the FAB code forward() function

adversarial-attacks-pytorch/torchattacks/attacks/fab.py

Line 84 in 23620a6

def forward(self, images, labels):

There are no parameters for the target label, in contrast, the FAB target attack requires both labels, one for the original label and the other for the target label.

adversarial-attacks-pytorch/torchattacks/attacks/fab.py

Line 127 in 23620a6

def get_diff_logits_grads_batch_targeted(self, imgs, la, la_target):

But there is only one label entered in the entire code. If the user wants to specify the target label to be used for the attack, since there is only one label input, the computation of the code related to the target attack will actually be meaningless.

adversarial-attacks-pytorch/torchattacks/attacks/fab.py

Line 132 in 23620a6

diffy = -(y[u, la] - y[u, la_target])

For example, here la=la_target, then diffy here is meaningless.

I'll try to fix this, but don't have any clue at the moment because we need to enter two labels for the attack, which conflicts with the existing framework. So first submitted the FAB attack without the target attack version now.

FAB target attack has been completed.

…lation of the F function of the CW attack, and add CW attack binary search version

codecov-commenter · 2023-11-12T18:07:49Z

Codecov Report

Attention: Patch coverage is 81.61051% with 322 lines in your changes are missing coverage. Please review.

Project coverage is 76.69%. Comparing base (936e86d) to head (dbb4942).
Report is 1 commits behind head on master.

❗ Your organization needs to install the Codecov GitHub app to enable full functionality.

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #168      +/-   ##
==========================================
+ Coverage   73.37%   76.69%   +3.32%     
==========================================
  Files          44       54      +10     
  Lines        3827     4931    +1104     
  Branches      578      604      +26     
==========================================
+ Hits         2808     3782     +974     
- Misses        862      979     +117     
- Partials      157      170      +13

Files	Coverage Δ
code_coverage/test_atks.py	`100.00% <100.00%> (+6.89%)`	⬆️
torchattacks/__init__.py	`100.00% <100.00%> (ø)`
torchattacks/attacks/autoattack.py	`80.64% <100.00%> (ø)`
torchattacks/attacks/cw.py	`100.00% <100.00%> (ø)`
torchattacks/attacks/cwbs.py	`100.00% <100.00%> (ø)`
torchattacks/attacks/cwl0.py	`100.00% <100.00%> (ø)`
torchattacks/attacks/mifgsm.py	`100.00% <100.00%> (ø)`
torchattacks/attacks/cwbsl0.py	`98.97% <98.97%> (ø)`
torchattacks/attacks/cwbslinf.py	`98.95% <98.95%> (ø)`
torchattacks/attacks/cwlinf.py	`98.52% <98.52%> (ø)`
... and 9 more

... and 5 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 936e86d...dbb4942. Read the comment docs.

ZaberKo · 2023-11-23T09:18:12Z

I think the calculation of other is still incorrect, which neglects that output logits could be negative numbers.
cwl2.py#L146

rikonaka · 2023-11-23T13:50:08Z

I think the calculation of other is still incorrect, which neglects that output logits could be negative numbers. cwl2.py#L146

Thank you very much for your advice, but this other calculation is actually translated from carlini's code (Tensorflow to Pytorch). You can check it out 😁.

https://github.com/carlini/nn_robust_attacks/blob/c6b8f6a254e82a79a52cfbc673b632cad5ea1ab1/l2_attack.py#L97

And you mentioned that logits may be negative, the original author's code also directly used the value before softmax.

https://github.com/carlini/nn_robust_attacks/blob/c6b8f6a254e82a79a52cfbc673b632cad5ea1ab1/l2_attack.py#L90C33-L90C33

So this should be correct 😉.

ZaberKo · 2023-11-23T14:50:04Z

I think the calculation of other is still incorrect, which neglects that output logits could be negative numbers. cwl2.py#L146

Thank you very much for your advice, but this other calculation is actually translated from carlini's code (Tensorflow to Pytorch). You can check it out 😁.

https://github.com/carlini/nn_robust_attacks/blob/c6b8f6a254e82a79a52cfbc673b632cad5ea1ab1/l2_attack.py#L97

And you mentioned that logits may be negative, the original author's code also directly used the value before softmax.

https://github.com/carlini/nn_robust_attacks/blob/c6b8f6a254e82a79a52cfbc673b632cad5ea1ab1/l2_attack.py#L90C33-L90C33

So this should be correct 😉.

Thanks for the quick response. I think you misunderstand the issue. A quick fix of cwl2.py#L146 would be like:

other = torch.max((1 - one_hot_labels) * outputs - one_hot_labels*10000., dim=1)[0]

rikonaka · 2023-11-23T15:14:28Z

Thanks for the quick response. I think you misunderstand the issue. A quick fix of cwl2.py#L146 would be like:
other = torch.max((1 - one_hot_labels) * outputs - one_hot_labels*10000., dim=1)[0]

Good question, well, in here we will pick the maximum value of the logits except true label, so if here we only have 1 images, the outputs will be

[
[x1, x2, x3, x4]
]

Then we used the one_hot_labels to mask one positon (suppose the x3), we will got

[
[x1, x2, 0, x4]
]

So the torch.max will caculater the max value of x1, x2, 0 and x3.

In Tensorflow, the original author subtracts that value (one_hot_labels*10000) to prevent the all logits are negative (I haven't used tensorflow for a long time 🤣), this is a point that can be improved. But In pytroch, and in here logits is greater than 0.

So the situation where all logits are negative that you are worried about will not happen 😉.

ZaberKo · 2023-11-24T02:33:59Z

Thanks for the quick response. I think you misunderstand the issue. A quick fix of cwl2.py#L146 would be like:
other = torch.max((1 - one_hot_labels) * outputs - one_hot_labels*10000., dim=1)[0]
Good question, well, in here we will pick the maximum value of the logits except true label, so if here we only have 1 images, the outputs will be
[
[x1, x2, x3, x4]
]
Then we used the one_hot_labels to mask one positon (suppose the x3), we will got
[
[x1, x2, 0, x4]
]
So the torch.max will caculater the max value of x1, x2, 0 and x3.

In Tensorflow, the original author subtracts that value (one_hot_labels*10000) to prevent the all logits are negative (I haven't used tensorflow for a long time 🤣), this is a point that can be improved. But In pytroch, and in here logits is greater than 0.

So the situation where all logits are negative that you are worried about will not happen 😉.

However, there is no such guarantee that the output logits must be non-negtive in pytorch, for arbitrary models under any training methods.

rikonaka · 2023-11-24T03:33:19Z

However, there is no such guarantee that the output logits must be non-negtive in pytorch, for arbitrary models under any training methods.

😵‍💫 The same, there is also no such guarantee that the output logits must be negative in pytorch, for arbitrary models under any training methods. If you can provide any evidence that the logits output of some model is all negative, it may be able to further support your argument.

ZaberKo · 2023-11-24T08:03:23Z

However, there is no such guarantee that the output logits must be non-negtive in pytorch, for arbitrary models under any training methods.

😵‍💫 The same, there is also no such guarantee that the output logits must be negative in pytorch, for arbitrary models under any training methods. If you can provide any evidence that the logits output of some model is all negative, it may be able to further support your argument.

That is not the point. The point here is that we need to cover all cases, even though some of them are rare. Here are some other implementations of CW f_func in pytorch for reference:

rikonaka · 2023-11-24T08:46:41Z

However, there is no such guarantee that the output logits must be non-negtive in pytorch, for arbitrary models under any training methods.

😵‍💫 The same, there is also no such guarantee that the output logits must be negative in pytorch, for arbitrary models under any training methods. If you can provide any evidence that the logits output of some model is all negative, it may be able to further support your argument.

That is not the point. The point here is that we need to cover all cases, even though some of them are rare. Here are some other implementations of CW f_func in pytorch for reference:
* [imrahulr/adversarial_robustness_pytorch](https://github.com/imrahulr/adversarial_robustness_pytorch/blob/6df6a8f0cd49cf6d18507a4b574c004ab6eedf49/core/attacks/utils.py#L212)

* [thu-ml/ares](https://github.com/thu-ml/ares/blob/306e35fe4309d791f9252bb6aab51198d2b9b511/ares/attack/cw.py#L133)

Thanks for your suggestion 👍, I will rewrite this f function quickly. Next time, please provide detailed information directly from the beginning, instead of wasting other people's time by making people guess and misunderstand of your short information.

…s of the graph are freed when you call .backward() or autograd.grad().`

Adversarian · 2023-11-29T09:01:24Z

Thanks for the effort you made to improve the implementation of CW in this library. I had one suggestion, and correct me if it is not feasible to implement, but wouldn't it be better if you aliased one of the variants of CW (e.g. CWL0 or CWLinf etc.) as CW so that this version doesn't introduce a breaking change for torchattacks.CW to preserve backward compatibility?

You could use the version of CW that was previously used (I believe CWL2 in the current implementation) as an alias to remediate this (as easily as something like CW = CWL2 for instance).

rikonaka · 2023-11-29T13:26:33Z

Thanks for the effort you made to improve the implementation of CW in this library. I had one suggestion, and correct me if it is not feasible to implement, but wouldn't it be better if you aliased one of the variants of CW (e.g. CWL0 or CWLinf etc.) as CW so that this version doesn't introduce a breaking change for torchattacks.CW to preserve backward compatibility?

You could use the version of CW that was previously used (I believe CWL2 in the current implementation) as an alias to remediate this (as easily as something like CW = CWL2 for instance).

Thank you very much for your suggestion. I will move CWL2 to CW now. 😉

… algorithm (they are essentially one and the same)

…respondence with the pseudo-code in the paper

rikonaka added 3 commits November 13, 2023 00:53

Improve the efficiency of the CW attack and fix an error in the calcu…

c407e6c

…lation of the F function of the CW attack, and add CW attack binary search version

Remove some note

64d9d86

Fix some mistakes

7a1ce4a

rikonaka changed the title ~~Improve the efficiency of the Original CW attack and fix an error in the calculation of the F function of the CW attack algorithm, and add CW binary search version.~~ CW efficiency improvement and bug fix, add CW binary search version. Nov 12, 2023

rikonaka added 3 commits November 13, 2023 12:41

Unify two CW algorithms and reduce unnecessary operations

67a338f

Change CW and CWBS abort_early default value from False to True

4bdae87

Add L0 and Linf for CW and CWBS

1e82289

rikonaka changed the title ~~CW efficiency improvement and bug fix, add CW binary search version.~~ CW efficiency improvement and bug fix, add CW binary search version, support L0 and Linf for CW and CWBS. Nov 19, 2023

rikonaka added 6 commits November 19, 2023 13:52

Split cw

ae89b03

Fix some class name

8aac46b

Fix some info

c955109

Add readme

6cb7015

Fix CW and CWBS L0 error

4adf68b

Change some name avoid misunderstood

628b829

Rename some var avoid misunderstood

f89bcda

rikonaka added 2 commits November 24, 2023 16:58

Fix other as ZaberKo suggest

d870493

Clone the attack result avoid pytorch error `Saved intermediate value…

a464809

…s of the graph are freed when you call .backward() or autograd.grad().`

rikonaka added 2 commits November 29, 2023 21:28

Move CWL2 to CW as Adversarian suggestion

6ba76d4

Add parameter type restrictions

9762cab

rikonaka added 16 commits January 31, 2024 21:31

Small fix

add20d4

Fix PGDES default parameter value

463b471

Fix PGDES name in info

b491f83

Add PyTroch 2.1 and 2.2 test

6381b5d

Remove duplicate imports

bb32c37

Auto-test small fix

d614f92

Rewrite the EAD algorithm to make the code logically closer to the CW…

df65dea

… algorithm (they are essentially one and the same)

Rewrite the EAD algorithm to make the code logically closer to the CW…

66de69b

… algorithm (they are essentially one and the same)

Fix EADEN name mistake

c08b63e

Remove one duplicate line in EAD attack

74ea149

Rename PGDES to ESPGD

b761501

Rename PGDES to ESPGD

784b4f4

Fix some name error

21ae294

Fix some name error

866d14a

Re-write FAB attack

be99cb3

Add info in readme and fix name mistake

66a2e13

rikonaka added 11 commits April 1, 2024 00:55

Fix type error

9701743

Fix type error

8dedd74

Fix cuda error

1ca9fe3

Fix autoattack FAB attack bug

5e9c07a

Fix target attack how labels input problems

59e5c34

Fix target attack how labels input problems

3c12bbc

Add target attack for FAB

c696144

Fix L1 some mistakes

1386710

Fix L1 some mistakes

53d35ec

The code on momentum in the original mi-fgsm is complex and lacks cor…

58d55d4

…respondence with the pseudo-code in the paper

The code on momentum in the original mi-fgsm is complex and lacks cor…

dbb4942

…respondence with the pseudo-code in the paper

rikonaka mentioned this pull request May 20, 2024

[BUG] Bugs in the f function of cw.py #184

Open

Try to fix JSMA huge GPU mem usage

411e41e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support `L0` and `Linf` for CW and CWBS, rewrite FAB attack. #168

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support `L0` and `Linf` for CW and CWBS, rewrite FAB attack. #168

rikonaka commented Nov 12, 2023 •

edited

codecov-commenter commented Nov 12, 2023 •

edited

ZaberKo commented Nov 23, 2023 •

edited

rikonaka commented Nov 23, 2023

ZaberKo commented Nov 23, 2023

rikonaka commented Nov 23, 2023 •

edited

ZaberKo commented Nov 24, 2023 •

edited

rikonaka commented Nov 24, 2023 •

edited

ZaberKo commented Nov 24, 2023

rikonaka commented Nov 24, 2023 •

edited

Adversarian commented Nov 29, 2023 •

edited

rikonaka commented Nov 29, 2023

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support L0 and Linf for CW and CWBS, rewrite FAB attack. #168

Are you sure you want to change the base?

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support L0 and Linf for CW and CWBS, rewrite FAB attack. #168

Conversation

rikonaka commented Nov 12, 2023 • edited

PR Type and Checklist

CW attack fix

FAB attack fix

codecov-commenter commented Nov 12, 2023 • edited

Codecov Report

ZaberKo commented Nov 23, 2023 • edited

rikonaka commented Nov 23, 2023

ZaberKo commented Nov 23, 2023

rikonaka commented Nov 23, 2023 • edited

ZaberKo commented Nov 24, 2023 • edited

rikonaka commented Nov 24, 2023 • edited

ZaberKo commented Nov 24, 2023

rikonaka commented Nov 24, 2023 • edited

Adversarian commented Nov 29, 2023 • edited

rikonaka commented Nov 29, 2023

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support `L0` and `Linf` for CW and CWBS, rewrite FAB attack. #168

CW efficiency improvement and bug fix, add CW binary search version, early stop PGD version, support `L0` and `Linf` for CW and CWBS, rewrite FAB attack. #168

rikonaka commented Nov 12, 2023 •

edited

codecov-commenter commented Nov 12, 2023 •

edited

ZaberKo commented Nov 23, 2023 •

edited

rikonaka commented Nov 23, 2023 •

edited

ZaberKo commented Nov 24, 2023 •

edited

rikonaka commented Nov 24, 2023 •

edited

rikonaka commented Nov 24, 2023 •

edited

Adversarian commented Nov 29, 2023 •

edited