Double Softmax in PyTorch image estimator for test cases. #2227

GiulioZizzo · 2023-07-26T16:45:48Z

Many tests use the PyTorch image estimator defined in the test utils.

By default this estimator does not use logits, e.g. the function signature is:

get_image_classifier_pt(from_logits=False, load_init=True, use_maxpool=True)

However, the loss function is loss_fn = torch.nn.CrossEntropyLoss(reduction="sum")

torch.nn.CrossEntropyLoss by default expects logits and will re-apply a softmax: https://pytorch.org/docs/stable/generated/torch.nn.CrossEntropyLoss.html

Hence, we should aim to make the default configuration mathematically correct. We could:

Have the default from_logits=True
Additionally the loss depend on from_logits=True/False by using either CrossEntropyLoss/NLLLoss

This may require updating certain ART tests.

System information (please complete the following information):

OS: MacOS
Python version: 3.9
ART version or commit number: 1.15
TensorFlow / Keras / PyTorch / MXNet version: Torch 1.13

The text was updated successfully, but these errors were encountered:

beat-buesser · 2023-08-07T11:50:43Z

Hi @GiulioZizzo Thank you very much for raising this issue! Have you found any tests where the wrong value for from_logits has been used?

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

GiulioZizzo · 2023-08-14T08:47:02Z

Hi @beat-buesser ! So this issue is prevalent in the virtually all of ART test cases which use get_image_classifier_pt as they all use the default parameters as far as I can see. In many cases this doesn't cause a huge problem when the tests just do forward passes and then compute things like accuracy (the argmax would be unaffected).

However, it does start to cause problems when the neural network is trained and an exact result is expected. I came across the problem when refactoring test_adversarial_trainer for issue #2225 (including Huggingface support in ART). The Tensorflow and PyTorch models would train in a totally different manner even though they ought to converge to almost identical results (allowing for framework specific numerical deltas.) When you change the PyTorch classifier to use the correct logits/loss function combination the model then trains as it should and the framework results then match.

There could well be other tests that are affected by this, so could use investigating and correcting for current and future tests.

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

GiulioZizzo added a commit to GiulioZizzo/adversarial-robustness-toolbox that referenced this issue Aug 14, 2023

add from_logits=True to begin addressing issue Trusted-AI#2227

5d0c872

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

GiulioZizzo added a commit to GiulioZizzo/adversarial-robustness-toolbox that referenced this issue Aug 31, 2023

add from_logits=True to begin addressing issue Trusted-AI#2227

8b9d17f

Signed-off-by: GiulioZizzo <giulio.zizzo@yahoo.co.uk>

beat-buesser assigned GiulioZizzo Sep 1, 2023

beat-buesser added the improvement Improve implementation label Sep 1, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Double Softmax in PyTorch image estimator for test cases. #2227

Double Softmax in PyTorch image estimator for test cases. #2227

GiulioZizzo commented Jul 26, 2023

beat-buesser commented Aug 7, 2023

GiulioZizzo commented Aug 14, 2023

Double Softmax in PyTorch image estimator for test cases. #2227

Double Softmax in PyTorch image estimator for test cases. #2227

Comments

GiulioZizzo commented Jul 26, 2023

beat-buesser commented Aug 7, 2023

GiulioZizzo commented Aug 14, 2023