[Explainability] `binary_classification` mode + link prediction example #6083

camillepradel · 2022-11-28T11:28:49Z

Progress towards #5924

Done:

adds support for binary classification mode by splitting ModelMode.classification into ModelMode.binary_classification and ModelMode.multiclass_classification
adds an example script gnn_explainer_link_pred.py training a link prediction model and getting the explanation for one output.

codecov · 2022-11-28T13:11:43Z

Codecov Report

Merging #6083 (a553fb7) into master (f343295) will increase coverage by 0.00%.
The diff coverage is 100.00%.

❗ Current head a553fb7 differs from pull request most recent head b01f4af. Consider uploading reports for the commit b01f4af to get more accurate results

@@           Coverage Diff           @@
##           master    #6083   +/-   ##
=======================================
  Coverage   84.48%   84.49%           
=======================================
  Files         371      371           
  Lines       20725    20738   +13     
=======================================
+ Hits        17509    17522   +13     
  Misses       3216     3216

Impacted Files	Coverage Δ
torch_geometric/explain/algorithm/base.py	`96.61% <ø> (-0.36%)`	⬇️
torch_geometric/explain/algorithm/gnn_explainer.py	`96.29% <100.00%> (+0.32%)`	⬆️
torch_geometric/explain/config.py	`100.00% <100.00%> (ø)`
torch_geometric/explain/explainer.py	`100.00% <100.00%> (ø)`

📣 We’re building smart automated test selection to slash your CI/CD build times. Learn more

avgupta456 · 2022-11-28T14:46:55Z

I wonder if we could combine this PR and #6056. This PR contributes an example script while #6056 handles the k_hop_subgraph and adds edge-level tests for GNNExplainer.

camillepradel · 2022-11-28T16:03:21Z

I wonder if we could combine this PR and #6056. This PR contributes an example script while #6056 handles the k_hop_subgraph and adds edge-level tests for GNNExplainer.

oops, I didn't see your PR! I will have a look, thanks!

rusty1s · 2022-11-29T17:18:27Z

@camillepradel Any reason to close this? We would still like to integrate an example of this :)

camillepradel · 2022-11-30T12:59:10Z

you are right. I wanted to open a new PR but there is actually no good reason to not keep using this one.

camillepradel · 2022-12-02T17:46:30Z

So I updated the example, but since it was a link prediction task, I needed the explainability framework to support binary classification. I ended up adding a new binary_classification mode (and renaming the original classification to multiclass_classification).
If somebody can think of a simpler way to support binary classification, I am curious to know about it.

…tion

RBendias

Hi, thanks a lot for the PR! I left some comments in the example file. Regarding the binary_classification type, I also think we should find a simpler solution, e.g., by selecting the loss based on the models' output. I'll take a closer look.

examples/gnn_explainer_link_pred.py

Co-authored-by: Ramona Bendias <ramona.bendias@gmail.com>

for more information, see https://pre-commit.ci

…th_logits

…tion

RBendias · 2022-12-05T18:20:29Z

I checked the code again. The main reason I see for the additional type is to check if the return_type is set correctly in the ModelConfig. The return_type needs to be probs, as we use binary_cross_entropy. However, we could also measure the MSE between the raw/probs/log_probs values (also using the raw model output in get_prediction instead of setting a threshold of 0.5). @camillepradel What do you think?

camillepradel · 2022-12-05T22:31:00Z

I checked the code again. The main reason I see for the additional type is to check if the return_type is set correctly in the ModelConfig. The return_type needs to be probs, as we use binary_cross_entropy. However, we could also measure the MSE between the raw/probs/log_probs values (also using the raw model output in get_prediction instead of setting a threshold of 0.5). @camillepradel What do you think?

The distinction between multiclass_classification and binary_classification modes is indeed used in ModelConfig, but also in processing prediction in Explainer and in processing loss in GNNExplainer. In the two later cases, it allows to handle differently the output of the model according to the mode. I initially though we could also make the difference by looking at the shape of the output, but it looked tricky to me (depending on the number of classes and the optional batching, we might not be able to know), which is why I went to define explicitly two distinct modes.

You are right, we don't have to enforce return_type to be probs for bynary classification (I have never seen a log_probs output in that setup but it seems legit), I will change that.

…tion

camillepradel · 2022-12-06T14:18:02Z

I updated the code to allow raw and log_probs modes, but I am still a bit confused about log_probs.

More specifically, I don't know how to apply MSE loss directly on log_probs since the range of values for log probabilities is (−∞,0]. In current version, I applied binary_cross_entropy to the exp() of y_hat.

rusty1s

LGTM, a few nit-picky comments. I think we need to drop log_probs support in binary_classification.

torch_geometric/explain/explainer.py

torch_geometric/explain/algorithm/gnn_explainer.py

test/explain/algorithm/test_gnn_explainer.py

Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

for more information, see https://pre-commit.ci

…n type

…tion

…lainer_binary_classification

rusty1s

Thanks for the updates. Looks great!

camillepradel added 2 commits November 28, 2022 11:41

first version of link prediction explanation

405487a

update changelog

b3d4b83

rusty1s assigned camillepradel Nov 28, 2022

rusty1s added feature 0 - Priority P0 example labels Nov 28, 2022

camillepradel closed this Nov 29, 2022

camillepradel reopened this Nov 30, 2022

Merge branch 'master' into explain_link_prediction

4b03597

camillepradel changed the title ~~Link prediction explanation~~ Link prediction explanation example Nov 30, 2022

camillepradel added 4 commits November 30, 2022 14:32

changelog

277f038

add support for binary classification mode

9ea40a1

adapt old GNNExplainer to new ModelMode values

50c045d

fix format error in docstring

3f35ca3

camillepradel changed the title ~~Link prediction explanation example~~ [Explainability] binary_classification mode + link prediction example Dec 2, 2022

Merge remote-tracking branch 'origin/master' into explain_link_predic…

0b9e01d

…tion

camillepradel marked this pull request as ready for review December 2, 2022 19:05

RBendias added the explain label Dec 5, 2022

RBendias reviewed Dec 5, 2022

View reviewed changes

camillepradel and others added 5 commits December 5, 2022 17:08

Apply suggestions from code review

aeb96d4

Co-authored-by: Ramona Bendias <ramona.bendias@gmail.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

fb41080

for more information, see https://pre-commit.ci

remove unused cast_to_multiclass

74d61e5

replace BCEWithLogitsLoss with direct call to binary_cross_entropy_wi…

0c740d5

…th_logits

Merge remote-tracking branch 'origin/master' into explain_link_predic…

f463b8b

…tion

camillepradel added 2 commits December 6, 2022 15:13

allow raw and log_probs modes for binary_classification

a82feaa

Merge remote-tracking branch 'origin/master' into explain_link_predic…

6cc33a5

…tion

rusty1s reviewed Dec 7, 2022

View reviewed changes

camillepradel and others added 9 commits December 7, 2022 16:08

Apply suggestions from code review

8609067

Co-authored-by: Matthias Fey <matthias.fey@tu-dortmund.de>

[pre-commit.ci] auto fixes from pre-commit.com hooks

32579f6

for more information, see https://pre-commit.ci

add link to gnn_explainer_link_pred.py into README

cece399

disable combination of binary_classification mode and log_probs retur…

0d0419b

…n type

remove unused best_val_auc from example script

d76fcfd

Merge remote-tracking branch 'origin/master' into explain_link_predic…

8530e7b

…tion

Move ExplainerConfig arguments to Explainer class in test_gnn_exp…

a4691bf

…lainer_binary_classification

small update

8de2823

update

a553fb7

rusty1s approved these changes Dec 9, 2022

View reviewed changes

rusty1s enabled auto-merge (squash) December 9, 2022 13:30

update

b01f4af

rusty1s merged commit 737cc76 into pyg-team:master Dec 9, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Explainability] `binary_classification` mode + link prediction example #6083

[Explainability] `binary_classification` mode + link prediction example #6083

camillepradel commented Nov 28, 2022 •

edited

codecov bot commented Nov 28, 2022 •

edited

avgupta456 commented Nov 28, 2022

camillepradel commented Nov 28, 2022

rusty1s commented Nov 29, 2022

camillepradel commented Nov 30, 2022

camillepradel commented Dec 2, 2022

RBendias left a comment •

edited

RBendias commented Dec 5, 2022

camillepradel commented Dec 5, 2022

camillepradel commented Dec 6, 2022

rusty1s left a comment

rusty1s left a comment

[Explainability] binary_classification mode + link prediction example #6083

[Explainability] binary_classification mode + link prediction example #6083

Conversation

camillepradel commented Nov 28, 2022 • edited

codecov bot commented Nov 28, 2022 • edited

Codecov Report

avgupta456 commented Nov 28, 2022

camillepradel commented Nov 28, 2022

rusty1s commented Nov 29, 2022

camillepradel commented Nov 30, 2022

camillepradel commented Dec 2, 2022

RBendias left a comment • edited

Choose a reason for hiding this comment

RBendias commented Dec 5, 2022

camillepradel commented Dec 5, 2022

camillepradel commented Dec 6, 2022

rusty1s left a comment

Choose a reason for hiding this comment

rusty1s left a comment

Choose a reason for hiding this comment

[Explainability] `binary_classification` mode + link prediction example #6083

[Explainability] `binary_classification` mode + link prediction example #6083

camillepradel commented Nov 28, 2022 •

edited

codecov bot commented Nov 28, 2022 •

edited

RBendias left a comment •

edited