Pytorch wrapper & Documentation update #132

lucashervier · 2023-07-18T13:27:51Z

New Features

A PyTorch wrapper

The main object of this PR is to provide within Xplique a convenient wrapper for Pytorch's model 4788b94. The tests were built in 693e3fa and mainly checked that it works for most attribution methods and is compatible with metrics. In addition, f5138ae provides the related documentation and points to a Getting Started tutorial. Finally, a test pipeline for cross-configuration between tensorflow and pytorch was added in 82231f4. This last commit also put all the configuration files into a single setup.cfg. The commit e1a29b7 include all the requested/recommended change after the first round of reviews.

Add an Enum for tasks

95ac086 and dc8484f add the Tasks enum which includes the operators for classification and regression tasks. The possibility to choose from the existing operator by their name was added. It also fixes the regression operator.

Add an activation option for metrics

While we recommend using the logits to generate explanations, it might be more relevant to look at the probability (after a softmax or sigmoid layer) of a prediction when computing metrics for instance if it measures 'a drop in probability for the classification of an input occluded in the more relevant part'. Thus, e60a5ff introduces this option when you build a metric. activation can be either None, 'softmax' or 'sigmoid'. This commit also includes an update of the documentation and the corresponding tests.

Warning: @fel-thomas this means to check the tutorials related to metrics

Documentation Enhancement

There was some important documentation missing. Especially, there was no clear description of the model parameter. Furthermore, since the V1.x the operator made its apparition but did not have its dedicated section. That was added in 1cc4ffd and further enhanced with the feedback on this PR in dc8484f. Additionally, it also provides better assets for dark theme, makes justify contents in the docs and add a collapsable section for both the README and the documentation homepage. It is thought to make those two pages easier to navigate quickly.

Continuous Integration

Docs workflow

7cad1c4 is dedicated to building a pipeline that automatically versions, builds and deploys documentation on releases. Thus, it avoids "forgetting" building the documentation and versioning will allow mapping each release version to its dedicated documentation.

Better issues templates

adc363b

dv-ai

Tests:
• Python version: The library was tested with python 3.6 (python-lints). Any good reason to remove it?

ReadMe :
• In the table of attribution methods: Sobol et Hsci are compatible with “TF, Pytorch” but these methods shall be compatible with any “Callable Model”. For my point of view, when the method is compatable with any “Callable Model”, we should replace “Callable” tag by “TF, Pytorch, Callable”. It will be more understandable for the user because a dedicated wrapper has be develop for Pytorch.
• Why metrics are only compatible with TF and Pytorch and not callable model?

Api_attributions.md
• Gradient-based approaches : We should explain why the whiteboxes methods are compatible or not: Support Gradient based whitebox methods when the method doesn’t need to “modify” some part of the model

Model.md
• In the section “The inputs have expected shape”, I recommend to add a section “Input format” in the same level of “General”. In this new section, I will put in sub-section “Images data”, “Tabular data”, “Time-Series data”.
• In the section “The inputs have expected shape”, I will add a new section “Task” where for each task, included segmentation and detection, there is a description of the output of the model and a description of the target parameter (cf. following comment).
• The targets parameter is always a source of confusion for the user. It shall be better describe in this section. We should explain how it used by the library and how the user should fill it. It’s done for image case in different tasks explanation but I think we should have a dedicated paragraph to explain the “target parameter”
• « Time-Series data » : The note is not so clear for me to how to solve the issue for “Lime” and “Kernel shap”
• The “trick” that have be done to adapt Xplique with pytorch should also work for other kind of library like sklearn. Maybe we should mention it.
• We should explain the different king of enum values that the user can select for the “operator parameter” (Task enum not describe)

Operator.md
• It’s not clear that the operator could be a enum value. We shall explain it and describe it (Task enum not describe)

Pytorch.py
• Why do we need “outputs.cpu()” in the call of backward ?

AntoninPoche

Great work ! The pytorch wrapper works well even for the segmentation task I tested. Then the documentation update is more than welcomed !

I have few other comments on the tutorial:

Verbosity, in the beginning, could be improved by:
- Adding a -q in !pip install xplique
- Adding _verbose=False or something like this in torch.hub.load
To get to the point I would collapse cells regarding downloading and preprocessing images and model.
For explanations I would highlight more that the only different is the wrapping of the model. And maybe talk a bit about the expected shape of images.
Regarding metrics, I do not understand why you used different images than the first ones. Is it to respect the guidelines? If so, you should say it. Otherwise, I think it makes the tutorial heavier.

README.md

docs/api/attributions/api_attributions.md

mkdocs.yml

docs/api/attributions/operator.md

docs/pytorch.md

docs/index.md

xplique/commons/operators.py

xplique/attributions/deconvnet.py

xplique/attributions/base.py

xplique/metrics/base.py

fel-thomas

This is remarkable work, Lucas – undoubtedly the most impressive pull request in Xplique since the library's creation.

There's a lot to verify, and given the substantial changes, it's crucial to ensure a thorough review. To start, I suggest splitting the pull request into two parts: one for the PyTorch wrapper implementation and associated documentation.

Regarding the other aspects such as continuous integration (CI), the remaining documentation, setup configuration (setup.cfg), the updated documentation, and the CI templates – let's address those in a separate pull request.

I propose you to start adressing my first comments and splitting the PR, then we should meet to finish and merge the Pytorch part (this week?)

Again, super job ! ;)

xplique/wrappers/pytorch.py

xplique/attributions/base.py

tests/wrappers/test_pytorch_wrapper.py

setup.py

setup.cfg

docs/pytorch.md

…. With this new getter we do not need to rechange the model when using metrics for classification

…setup and adapt worflows for the wrapper

…dd doc on operator and model's expectations

…n release

… a typo error in setup

…PyTorch for Callable methods, fix pep y

…ctionnality in the metric instead), fix regression_operator, update the tests and enhance the documentation

…oid) to the model when computing metrics (but not for generating explanations), add the corrsponding tests and documentation

…TorchWrapper class without the need of installing pytorch, change methods name to more explicit ones, improve the grad method, assert evaluation mode of pytorch module, update the test for the refactoring of the wrapper and also following operator and metric refactoring, add more information in the documentation

lucashervier · 2023-08-24T19:51:16Z

@dv-ai, @AntoninPoche and @fel-thomas the last push should address all your remarks. @fel-thomas and @AntoninPoche I directly answered your comments and resolved all the conversation. Do not hesitate to unresolve them if you think that the issue was not properly addressed.

The documentation was heavily changed so please read the new one carefully before approving :)

@AntoninPoche I took note of your feedback on the notebook. I will modify them ASAP to take it into account!

@dv-ai as you only add a general comment I address your points below:

Tests:
• Python version: The library was tested with python 3.6 (python-lints). Any good reason to remove it?

-> Python 3.6 started being deprecated

ReadMe :
• In the table of attribution methods: Sobol et Hsci are compatible with “TF, Pytorch” but these methods shall be compatible with any “Callable Model”. For my point of view, when the method is compatable with any “Callable Model”, we should replace “Callable” tag by “TF, Pytorch, Callable”. It will be more understandable for the user because a dedicated wrapper has be develop for Pytorch.
-> Done

• Why metrics are only compatible with TF and Pytorch and not callable model?
-> I think it is possible to use most metrics with any Callable model. However, I think that is not fast-forward and should be addressed in the future. I will open an issue once the PR is validated

Api_attributions.md
• Gradient-based approaches : We should explain why the whiteboxes methods are compatible or not: Support Gradient based whitebox methods when the method doesn’t need to “modify” some part of the model
-> I added an !!!info box at the end

Model.md
• In the section “The inputs have expected shape”, I recommend to add a section “Input format” in the same level of “General”. In this new section, I will put in sub-section “Images data”, “Tabular data”, “Time-Series data”.
• In the section “The inputs have expected shape”, I will add a new section “Task” where for each task, included segmentation and detection, there is a description of the output of the model and a description of the target parameter (cf. following comment).
• The targets parameter is always a source of confusion for the user. It shall be better describe in this section. We should explain how it used by the library and how the user should fill it. It’s done for image case in different tasks explanation but I think we should have a dedicated paragraph to explain the “target parameter”

-> For the 3 bullet points above: I changed a lot of the documentation for the model and the operator for those points. Do not hesitate to tell me if it's better now

• « Time-Series data » : The note is not so clear for me to how to solve the issue for “Lime” and “Kernel shap”
-> I modified the note to: "By default Lime & KernelShap will treat such inputs as grey images. You will need to define a custom map_to_interpret_space function when instantiating those methods in order to create a meaningful mapping of Time-Series data into an interpretable space when building such explainers."

• The “trick” that have be done to adapt Xplique with pytorch should also work for other kind of library like sklearn. Maybe we should mention it.
-> I modified the last section of the model.md file into: "Then you should take a look on the Callable documentation or you could take inspiration on the PyTorch Wrapper to write a wrapper that will integrate your model into our API!"

• We should explain the different king of enum values that the user can select for the “operator parameter” (Task enum not describe)
-> It's done in the new documentation of operator

Operator.md
• It’s not clear that the operator could be a enum value. We shall explain it and describe it (Task enum not describe)
-> Done in the new Operator's documentation

Pytorch.py
• Why do we need “outputs.cpu()” in the call of backward ?
-> I modified it so it is not necessary anymore

lucashervier · 2023-08-25T09:29:22Z

The last two commits are for you @AntoninPoche! They are related to your feedback on the notebook. I took your remarks concerning brevity and I split the tutorial into two (one Getting Started and another one for the Metrics). I also add those tutorials to the different place of interest in the documentation, readme, etc..

AntoninPoche

Amazing work as always! Most of my comments are little documentation modifications.

But I highlight the comment on the metrics/base.py file on activations. Their computation should only be done along the class axis and not the batch axis.

README.md

docs/api/attributions/api_attributions.md

docs/api/attributions/model.md

docs/api/attributions/operator.md

mkdocs.yml

tests/metrics/test_common.py

xplique/commons/operators.py

xplique/metrics/base.py

xplique/wrappers/pytorch.py

fel-thomas

Overall review

Once again, congratulations, Lucas, you've done an amazing job. I believe the library will definitely level up after your pull request.

I'm sorry for all the comments, but it's better that we try to do things right now so we can continue to progress smoothly later on.

Once the comments are addressed, I'm okay with merging.

Excellent work once again. 👍

.bumpversion.cfg

.github/ISSUE_TEMPLATE/bug_report.yml

.github/ISSUE_TEMPLATE/feature_request.yml

xplique/commons/operators.py

xplique/metrics/base.py

xplique/metrics/fidelity.py

xplique/wrappers/pytorch.py

…adability

… between zero and one

… on which axis the computation is done

lucashervier · 2023-09-05T17:07:42Z

@fel-thomas, @AntoninPoche thank you again for your numerous feedbacks. Every change concerning the docs are in the commit #3852bcf. The other requested changes were addressed in the dedicated commits which have explicit names I believe. I answer all your questions and mark the conversation as resolved but once again if you are not satisfied with one of my answers do not hesitate to re-open it!

Hope you like it!

AntoninPoche

Looks good to me! Really nice job Lucas!

It just seems that the remarks for pytorch.md were not applied to the version on this PR. I can do it in my future PR if you want ;)

docs/pytorch.md

lucashervier added documentation Improvements or additions to documentation enhancement New feature or request continuous-integration Improvement relative to the CI pipeline labels Jul 18, 2023

lucashervier requested review from fel-thomas and AntoninPoche July 18, 2023 13:27

lucashervier force-pushed the lucas/torch branch from 92b47f2 to 3d21ffa Compare July 18, 2023 14:18

lucashervier requested a review from dv-ai July 18, 2023 14:19

lucashervier force-pushed the lucas/torch branch from 203ac7e to 37770d4 Compare July 19, 2023 13:15

dv-ai reviewed Jul 31, 2023

View reviewed changes

AntoninPoche requested changes Aug 7, 2023

View reviewed changes

fel-thomas reviewed Aug 14, 2023

View reviewed changes

AntoninPoche reviewed Aug 17, 2023

View reviewed changes

docs/pytorch.md Outdated Show resolved Hide resolved

lucashervier added 18 commits August 22, 2023 11:42

feat: add a pytorch wrapper

4788b94

tests: add tests for the pytorch wrapper

693e3fa

docs: add documentation for pytorch wrapper and update some parameters

f5138ae

feat: add the possibility to get an operator from an enum or a string…

95ac086

…. With this new getter we do not need to rechange the model when using metrics for classification

ci: add pipeline to test torch wrapper, regroup cfg file in a single …

82231f4

…setup and adapt worflows for the wrapper

docs: update , change link to relative, add black and white assets, a…

1cc4ffd

…dd doc on operator and model's expectations

ci: add a workflow to automatically version, build and publish docs o…

7cad1c4

…n release

ci: make better issue templates

6db7fd8

Bump version: 1.0.1 → 1.1.0

7bf23a8

fixup: fix cases not covered when adding the get_operator method, fix…

0530754

… a typo error in setup

docs: remove table of contents (collapsable section now), add TF and …

833c54b

…PyTorch for Callable methods, fix pep y

refactor: change the Tasks enum, remove metric operator (move the fun…

dc8484f

…ctionnality in the metric instead), fix regression_operator, update the tests and enhance the documentation

feat: add the possibility to add an activation layer (softmax or sigm…

e60a5ff

…oid) to the model when computing metrics (but not for generating explanations), add the corrsponding tests and documentation

docs: enhance documentation on model and the api description

6d48cbc

fix: a path mistake for generating the documentation of ForGrad

590e75b

type: change operator signature

65dfb13

refactor: change doc arborescence

39d2fd2

lucashervier force-pushed the lucas/torch branch from 37770d4 to 39d2fd2 Compare August 24, 2023 18:29

lucashervier added 2 commits August 25, 2023 11:26

tutorial: add the tutorials for pytorch's model

2a34fe1

tutorial: add the tutorials for pytorch's model

cf84950

lucashervier requested review from AntoninPoche, dv-ai and fel-thomas August 25, 2023 10:09

AntoninPoche requested changes Aug 25, 2023

View reviewed changes

fel-thomas requested changes Sep 2, 2023

View reviewed changes

lucashervier added 6 commits September 5, 2023 18:37

docs: enhance documentation, fix typo mistake, improve clarity and re…

3852bcf

…adability

feature: remove information details and version for the issue templates

70ad209

test: assert that deletion and insertion with activation return score…

f3ba4b1

… between zero and one

fix: change if statements to more natural ones, more readable

96c0394

feat: add the axis for the softmax activation so the user acknowledge…

df67be1

… on which axis the computation is done

fix: replace simple quote to double in parameters

09708e0

lucashervier requested review from fel-thomas and AntoninPoche September 5, 2023 17:07

AntoninPoche approved these changes Sep 6, 2023

View reviewed changes

docs/pytorch.md Outdated Show resolved Hide resolved

docs/pytorch.md Outdated Show resolved Hide resolved

docs: clearer subtitle

eaa5970

fel-thomas approved these changes Sep 7, 2023

View reviewed changes

lucashervier merged commit 4753517 into master Sep 7, 2023
15 checks passed

lucashervier deleted the lucas/torch branch September 7, 2023 07:48

AntoninPoche mentioned this pull request Sep 13, 2023

Object Detection & Semantic Segmentation #136

Merged

AntoninPoche mentioned this pull request Sep 25, 2023

[Enhancement] - Your suggestion/feature request #120

Closed

lucashervier mentioned this pull request Oct 4, 2023

Handle model with tensor that has channel first #66

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pytorch wrapper & Documentation update #132

Pytorch wrapper & Documentation update #132

lucashervier commented Jul 18, 2023 •

edited

dv-ai left a comment

AntoninPoche left a comment

fel-thomas left a comment

lucashervier commented Aug 24, 2023

lucashervier commented Aug 25, 2023

AntoninPoche left a comment

fel-thomas left a comment •

edited

lucashervier commented Sep 5, 2023

AntoninPoche left a comment

Pytorch wrapper & Documentation update #132

Pytorch wrapper & Documentation update #132

Conversation

lucashervier commented Jul 18, 2023 • edited

New Features

A PyTorch wrapper

Add an Enum for tasks

Add an activation option for metrics

Documentation Enhancement

Continuous Integration

Docs workflow

Better issues templates

dv-ai left a comment

Choose a reason for hiding this comment

AntoninPoche left a comment

Choose a reason for hiding this comment

fel-thomas left a comment

Choose a reason for hiding this comment

lucashervier commented Aug 24, 2023

lucashervier commented Aug 25, 2023

AntoninPoche left a comment

Choose a reason for hiding this comment

fel-thomas left a comment • edited

Choose a reason for hiding this comment

Overall review

lucashervier commented Sep 5, 2023

AntoninPoche left a comment

Choose a reason for hiding this comment

lucashervier commented Jul 18, 2023 •

edited

fel-thomas left a comment •

edited