Projected and Greedy Randomized Block Coordinate Descent (NeurIPS 2021) #5972

sigeisler · 2022-11-14T15:58:37Z

Hi PyG-team,

Thank you for the tremendous effort in putting this code base together and maintaining it to this high level of quality.

Similar to the Explainer and GNN-Explainer models, I hereby submit the code for our adversarial attacks based on Randomized Block Coordinate Descent (RBCD) that appeared (NeurIPS 2021). Our efficient (i.e sparse) gradient-based attacks apply to most models in PyG and cover the most popular threat models (see below for more details).

Adversarial robustness is quite related to explainability (here considering w.r.t. graph structure). Specifically, in an adversarial attack, we are asking for a limited number of edges that harm the prediction the most, and (GNN-) Explainer seeks for the set of friendly edges (i.e. minimal set without changing prediction). Due to the similarities, I have integrated attacks similar to how GNNExplainer was integrated.

Adversarial robustness of GNNs is an emerging field (the two seminal works were cited more than 1000 times) and it would be of even greater practical relevance if the "entry barrier" would be lowered. Moreover, having practical adversarial attacks in PyGs code base comes with opportunities for its users. These include: (1) seamless adversarial training for PyG's large collection of models; (2) adversarial attacks can be understood as a building block towards counterfactual explainability (and explainability is already part of PyG's codebase); (3) Adversarial attacks can be used for unsupervised representation learning (similar to Deep Graph Infomax).

Importantly, having adversarial attacks generically implemented that can be directly applied to models (i.e. basic adaptive attacks) would counteract a common antipattern in the graph learning community. In our work
Are Defenses for Graph Neural Networks Robust? (NeurIPS 2022), we show that such adaptive attacks would greatly help to obtain more realistic robustness estimates.

Our Projected- and Greedy Randomized Block Coordinate Descent attacks are (afaik) the only practical general-purpose gradient-based adversarial attacks on GNNs (i.e. work with sparse matrices and come with a linear overhead for test time attacks). Our gradient-based approach relaxes the weights of the adjacency matrix from {0,1} -> [0,1] during the attack and, therefore, seamlessly integrates with all models that support weighted adjacency matrices and do not apply non-differentiable operations to the edge weights. Although, non-differentiable models can often be made differentiable (e.g. using the gumble softmax trick).

Also with respect to tasks and threat models, this implementation covers the most popular cases. The defaults are set for node and graph classification (see unit tests for the latter) and other tasks (e.g. regression) can be handled by providing an appropriate model loss pair. The threat models covered are: (a) "global attacks": the adversary targets a large fraction of nodes at once (e.g. the test set); (b) "local attacks": the adversary targets the prediction of a single node in node-classification; any combination with (I) "evasion": attack at test time (most relevant for inductive learning); (II) "poisoning": attack at training time (most relevant for transductive learning, see poisoning example).

I am happy to include further suggestions and answer any questions. Please let me know if you have any feedback.

for more information, see https://pre-commit.ci

codecov · 2022-11-14T16:05:30Z

Codecov Report

Merging #5972 (b4f831b) into master (2463371) will increase coverage by 0.20%.
The diff coverage is 96.59%.

❗ Current head b4f831b differs from pull request most recent head d300ba9. Consider uploading reports for the commit d300ba9 to get more accurate results

@@            Coverage Diff             @@
##           master    #5972      +/-   ##
==========================================
+ Coverage   85.30%   85.51%   +0.20%     
==========================================
  Files         402      403       +1     
  Lines       21731    22024     +293     
==========================================
+ Hits        18537    18833     +296     
+ Misses       3194     3191       -3

Impacted Files	Coverage Δ
torch_geometric/contrib/nn/models/rbcd_attack.py	`96.57% <96.57%> (ø)`
torch_geometric/contrib/nn/models/__init__.py	`100.00% <100.00%> (+100.00%)`	⬆️
torch_geometric/contrib/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️
torch_geometric/contrib/nn/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️
torch_geometric/contrib/explain/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️
torch_geometric/contrib/nn/conv/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️
torch_geometric/contrib/datasets/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️
torch_geometric/contrib/transforms/__init__.py	`100.00% <0.00%> (+100.00%)`	⬆️

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

torch_geometric/nn/models/rbcd_attack.py

sigeisler · 2022-11-15T11:15:47Z

Thank you @wsad1

I am a bit confused, though, that you assigned me. Is there any action required from my side at the moment?

rusty1s · 2022-11-15T12:02:10Z

@sigeisler No action required until reviews lands. This just means you are the author of this PR :)

for more information, see https://pre-commit.ci

EdisonLeeeee · 2022-11-15T12:34:57Z

@sigeisler Thanks for adding these. Looks pretty good! This is also related to my current research and it did really help!

EdisonLeeeee

Left a few initial comments. Will take a look again soon :)

torch_geometric/nn/models/__init__.py

examples/rbcd_attack.py

examples/rbcd_attack_poisoning.py

torch_geometric/nn/models/rbcd_attack.py

…into rbcd

for more information, see https://pre-commit.ci

EdisonLeeeee

Sorry for the late. Left some minor points. Will review again these days.

torch_geometric/nn/models/rbcd_attack.py

Co-authored-by: Jintang Li <cnljt@outlook.com>

sigeisler · 2022-12-15T22:17:42Z

Hi @rusty1s I think we are done at the moment and would appreciate input from your side.

rusty1s · 2022-12-16T04:06:25Z

Sounds good, thanks for the effort.

rusty1s

Thank you! Overall, this looks great. There are some questions for me on the need of adding model arguments as attributes of the module, and the need for the device transfers. Other than that, most comments are personal nits :)

torch_geometric/nn/models/rbcd_attack.py

examples/rbcd_attack.py

examples/rbcd_attack_poisoning.py

rusty1s · 2022-12-17T11:29:47Z

examples/rbcd_attack_poisoning.py

+print(f'Clean accuracy: {clean_accuracy:.3f}')
+
+
+class PoisoningPRBCDAttack(PRBCDAttack):


Any reason this is implemented in the example rather than as a module in nn/models?

The main reason is that the train loop can have so many different variations. Of course, we could pass a method that defines the training procedure instead of tons of configurations. Moreover, the implementation here requires an additional dependency (i.e., higher).

If you prefer to have the "poisoning" variant as a dedicated model or configurable option, I am more than happy to implement it.

examples/rbcd_attack_poisoning.py

sigeisler · 2022-12-19T13:32:07Z

@rusty1s thank you for your time and the comments!

Here are my responses to the high-level questions:

... the need of adding model arguments as attributes of the module...

This was in analogy to the (old) implementation of GNNExplainer (have not checked how it is now). But I agree that we could add it to the attack method. I think the sole advantage of having this as a class attribute is, that one can move the attack (including) the model to a specific device. Although, the user anyhow has to handle anyways the device of the inputs of the attack method.

... the need for the device transfers.

The idea is to save (valuable) memory of the GPU/... Note that here we maintain a clean graph and construct its perturbed counterpart. Thus, without moving the clean clean graph to cpu, the hardware accelerator needs to store two graphs of similar size.

rusty1s · 2022-12-20T10:05:24Z

Thanks for the clarification. This makes sense to me.

sigeisler · 2022-12-29T20:00:03Z

@rusty1s I worked through your comments and adapted the code accordingly. I left some comments to explain etc. Please let me know if there are further action items.

Thanks again for your time and the helpful feedback!

sigeisler · 2023-01-12T09:43:38Z

Hi @rusty1s I know you must be quite busy, however, I am wondering how to proceed. Thanks.

rusty1s · 2023-01-13T15:07:13Z

This should be ready to merge. Will take a last look and merge it soon.

sigeisler · 2023-01-27T08:34:22Z

Hi @rusty1s thanks for seeing this through :-)

I have seen that you now have moved this to the contrib package, out of curiosity, what are the intended differences? Is there a lower level of quality or is this due to maintenance (e.g you do not plan to support all features in the long haul etc.)?

Btw. if you need any input whatsoever for promoting this as one of the models in the new contrib package, please let me know.

rusty1s · 2023-01-27T13:04:43Z

Hi @sigeisler. Yes, I indeed moved this to contrib since we plan to move every big new model there first going forward. This does not have anything to do with code quality (that one is awesome), more so with maintenance and API stability. Hope that's okay :)

sigeisler · 2023-01-27T13:06:45Z

@rusty1s makes a lot of sense! Thanks for clarifying.

sigeisler and others added 12 commits November 9, 2022 13:40

Implementation

2543706

example and test

69659ff

Improve example

713c1ec

poisoning example

c34bf65

Test

b530f15

Update interface

7ae0354

Documentation

0ee7b2d

Minor

b58ec3f

Linting and docs

2a6ef3f

Merge branch 'master' into rbcd

8865cb0

[pre-commit.ci] auto fixes from pre-commit.com hooks

819ad61

for more information, see https://pre-commit.ci

Changelog

8b5283b

sigeisler added 2 commits November 14, 2022 17:07

Merge conflicts

db099c5

Linting

045a82f

sigeisler commented Nov 15, 2022

View reviewed changes

torch_geometric/nn/models/rbcd_attack.py Outdated Show resolved Hide resolved

wsad1 assigned sigeisler Nov 15, 2022

wsad1 added the 1 - Priority P1 label Nov 15, 2022

sigeisler and others added 3 commits November 15, 2022 13:09

Rever formatting of nn.models.init

e822819

Merge branch 'master' into rbcd

5995a51

[pre-commit.ci] auto fixes from pre-commit.com hooks

db0f69a

for more information, see https://pre-commit.ci

EdisonLeeeee reviewed Nov 15, 2022

View reviewed changes

sigeisler and others added 3 commits November 15, 2022 14:38

Fixes initial review

dee5065

Merge branch 'rbcd' of https://github.com/sigeisler/pytorch_geometric …

0d26dab

…into rbcd

[pre-commit.ci] auto fixes from pre-commit.com hooks

8e15233

for more information, see https://pre-commit.ci

EdisonLeeeee reviewed Nov 18, 2022

View reviewed changes

Apply suggestions from code review

2590df3

Co-authored-by: Jintang Li <cnljt@outlook.com>

sigeisler and others added 6 commits December 4, 2022 10:05

Merge branch 'master' into rbcd

e819616

Correct loss

430cd80

Fix changelog

52f9272

Minor

a3505e3

fix margin loss and add test

9462798

Merge branch 'master' into rbcd

1a9d16f

rusty1s reviewed Dec 17, 2022

View reviewed changes

sigeisler and others added 2 commits December 29, 2022 20:56

misc comments

854a47e

Merge branch 'master' into rbcd

edec829

sigeisler added 2 commits December 29, 2022 21:15

fix test

bc3accb

Minor docstring fix

ac43df9

sigeisler mentioned this pull request Jan 4, 2023

Questions about the use of mask in PGD attack sigeisler/robustness_of_gnns_at_scale#6

Closed

rusty1s added 5 commits January 21, 2023 08:57

Merge branch 'master' into rbcd

e182eca

update

3e136b2

lint example

4fb43b4

lint example

b4f831b

lint example

d300ba9

rusty1s approved these changes Jan 21, 2023

View reviewed changes

rusty1s enabled auto-merge (squash) January 21, 2023 09:02

rusty1s merged commit 2ca6180 into pyg-team:master Jan 21, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Projected and Greedy Randomized Block Coordinate Descent (NeurIPS 2021) #5972

Projected and Greedy Randomized Block Coordinate Descent (NeurIPS 2021) #5972

sigeisler commented Nov 14, 2022

codecov bot commented Nov 14, 2022 •

edited

sigeisler commented Nov 15, 2022

rusty1s commented Nov 15, 2022

EdisonLeeeee commented Nov 15, 2022

EdisonLeeeee left a comment

EdisonLeeeee left a comment

sigeisler commented Dec 15, 2022

rusty1s commented Dec 16, 2022

rusty1s left a comment

rusty1s Dec 17, 2022

sigeisler Dec 29, 2022 •

edited

sigeisler commented Dec 19, 2022

rusty1s commented Dec 20, 2022

sigeisler commented Dec 29, 2022

sigeisler commented Jan 12, 2023

rusty1s commented Jan 13, 2023

sigeisler commented Jan 27, 2023

rusty1s commented Jan 27, 2023 •

edited

sigeisler commented Jan 27, 2023

		print(f'Clean accuracy: {clean_accuracy:.3f}')


		class PoisoningPRBCDAttack(PRBCDAttack):

Projected and Greedy Randomized Block Coordinate Descent (NeurIPS 2021) #5972

Projected and Greedy Randomized Block Coordinate Descent (NeurIPS 2021) #5972

Conversation

sigeisler commented Nov 14, 2022

codecov bot commented Nov 14, 2022 • edited

Codecov Report

sigeisler commented Nov 15, 2022

rusty1s commented Nov 15, 2022

EdisonLeeeee commented Nov 15, 2022

EdisonLeeeee left a comment

Choose a reason for hiding this comment

EdisonLeeeee left a comment

Choose a reason for hiding this comment

sigeisler commented Dec 15, 2022

rusty1s commented Dec 16, 2022

rusty1s left a comment

Choose a reason for hiding this comment

rusty1s Dec 17, 2022

Choose a reason for hiding this comment

sigeisler Dec 29, 2022 • edited

Choose a reason for hiding this comment

sigeisler commented Dec 19, 2022

rusty1s commented Dec 20, 2022

sigeisler commented Dec 29, 2022

sigeisler commented Jan 12, 2023

rusty1s commented Jan 13, 2023

sigeisler commented Jan 27, 2023

rusty1s commented Jan 27, 2023 • edited

sigeisler commented Jan 27, 2023

codecov bot commented Nov 14, 2022 •

edited

sigeisler Dec 29, 2022 •

edited

rusty1s commented Jan 27, 2023 •

edited