[ONNX] Fix numerical errors in softmax when dim is not last dimension #37326

p12tic · 2020-04-26T23:19:48Z

This PR improves the workaround for the problem of different semantics between ONNX softmax and Pytorch softmax.

In Pytorch the dim parameter specifies over which dimension normalize the values. ONNX on the other hand always coerces the input into a 2D tensor and the axis parameter specifies which dimensions represent rows and columns of the resulting tensor. As a result, only when we are normalizing the last dimension (dim == ndim - 1) semantics are the same.

Previously this was handled by recognizing the dim == ndim - 1 case and using softmax for that. All other cases used a fallback path of explicit invocations of exp, reducesum and div operators to compute the result. Unfortunately, this results in numeric errors when input values are large: the result of exp will produce infinity on both numerator and denumerator and the division of that will result in NaN.

This can be improved by transposing the input tensor so that we can reuse ONNX softmax.

Similar approach has been applied to logsoftmax function in #30433.

dr-ci · 2020-04-27T02:38:46Z

💊 Build failures summary and remediations

As of commit 2a9c810 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions on the GitHub issue tracker.

See how this bot performed.

This comment has been revised 5 times.

BowenBao

Thanks @p12tic for the fix! Overall looks good, please see inline comments for some additional updates.

test/onnx/test_pytorch_onnx_onnxruntime.py

torch/onnx/symbolic_opset9.py

BowenBao · 2020-05-01T22:21:56Z

@p12tic could you take a look at the comments and update your PR? Thanks!

p12tic · 2020-05-02T09:02:08Z

@BowenBao Yes, I will do so. This week my priorities were elsewhere, thus the delay.

p12tic · 2020-05-03T19:05:22Z

@BowenBao I've applied your suggestions to the PR. Thanks for suggesting them to the PR instead of doing it yourself, I've got to learn what max normalization in softmax is :-)

BowenBao

@p12tic Thanks for the fix! LGTM

facebook-github-bot

@houseroad has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

facebook-github-bot · 2020-05-05T02:12:30Z

@houseroad merged this pull request in d16c823.

…pytorch#37326) Summary: Fixes pytorch#34585. This PR improves the workaround for the problem of different semantics between ONNX softmax and Pytorch softmax. In Pytorch the `dim` parameter specifies over which dimension normalize the values. ONNX on the other hand always coerces the input into a 2D tensor and the `axis` parameter specifies which dimensions represent rows and columns of the resulting tensor. As a result, only when we are normalizing the last dimension (`dim == ndim - 1`) semantics are the same. Previously this was handled by recognizing the `dim == ndim - 1` case and using `softmax` for that. All other cases used a fallback path of explicit invocations of exp, reducesum and div operators to compute the result. Unfortunately, this results in numeric errors when input values are large: the result of exp will produce infinity on both numerator and denumerator and the division of that will result in NaN. This can be improved by transposing the input tensor so that we can reuse ONNX softmax. Similar approach has been applied to `logsoftmax` function in pytorch#30433. Pull Request resolved: pytorch#37326 Reviewed By: hl475 Differential Revision: D21389712 Pulled By: houseroad fbshipit-source-id: 554fd1b98231a28984c30c7e7abd3c0643386ff7

pytorchbot added the open source label Apr 26, 2020

p12tic mentioned this pull request Apr 27, 2020

Unstable softmax trace when exporting to onnx #34585

Closed

mrshenli added the module: onnx Related to torch.onnx label Apr 27, 2020

mrshenli requested a review from houseroad April 27, 2020 20:25

p12tic force-pushed the onnx-softmax-fix-numerical-errors branch from 63c4108 to ea8f934 Compare April 28, 2020 14:51

BowenBao requested changes Apr 29, 2020

View reviewed changes

test/onnx/test_pytorch_onnx_onnxruntime.py Show resolved Hide resolved

torch/onnx/symbolic_opset9.py Show resolved Hide resolved

p12tic added 2 commits May 3, 2020 21:58

[ONNX] Fix numerical errors in softmax when dim is not last dimension

f2ee8fc

[ONNX] Fix numerical errors in softmax when input rank is not known

2a9c810

p12tic force-pushed the onnx-softmax-fix-numerical-errors branch from ea8f934 to 2a9c810 Compare May 3, 2020 18:58

BowenBao approved these changes May 4, 2020

View reviewed changes

facebook-github-bot reviewed May 4, 2020

View reviewed changes

facebook-github-bot closed this in d16c823 May 5, 2020

facebook-github-bot added the merged label May 5, 2020

mruberry added the Merged label Oct 28, 2020

p12tic deleted the onnx-softmax-fix-numerical-errors branch October 11, 2022 13:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ONNX] Fix numerical errors in softmax when dim is not last dimension #37326

[ONNX] Fix numerical errors in softmax when dim is not last dimension #37326

p12tic commented Apr 26, 2020

dr-ci bot commented Apr 27, 2020 •

edited

Loading

BowenBao left a comment

BowenBao commented May 1, 2020

p12tic commented May 2, 2020

p12tic commented May 3, 2020

BowenBao left a comment

facebook-github-bot left a comment

facebook-github-bot commented May 5, 2020

[ONNX] Fix numerical errors in softmax when dim is not last dimension #37326

[ONNX] Fix numerical errors in softmax when dim is not last dimension #37326

Conversation

p12tic commented Apr 26, 2020

dr-ci bot commented Apr 27, 2020 • edited Loading

💊 Build failures summary and remediations

BowenBao left a comment

Choose a reason for hiding this comment

BowenBao commented May 1, 2020

p12tic commented May 2, 2020

p12tic commented May 3, 2020

BowenBao left a comment

Choose a reason for hiding this comment

facebook-github-bot left a comment

Choose a reason for hiding this comment

facebook-github-bot commented May 5, 2020

dr-ci bot commented Apr 27, 2020 •

edited

Loading