Gemm optional bias #2330

JamesAllingham · 2019-09-18T09:38:29Z

As discussed (briefly) in #2060 it would be useful for the bias term (the 'C' input) of the 'Gemm' operation to be optional. This PR adds this functionality.

Specifically, if the bias is not specified it is assumed to be 0. This is a common behaviour among deep learning frameworks.

This PR:

Updates the definition (in math/defs.cc) to make the 'C' input optional. The version for the Gemm operator is bumped to 11, and the old definition is moved to math/old.cc. Similarly, operator_sets.h is updated to reflect the new version number.
Updates the gemm_reference_implementation to allow for 'C' being optional.
Adds a test case for no bias term.

If missing it defaults to 0. Also added a test case for no bias. Updated the Gemm op to version 11.

wschin · 2019-09-18T16:06:56Z

+some people from SIG operator @ebarsoum, @spandantiwari, @gramalingam for visibility. I personally consider this change reasonable for some reasons below.

We have MatMul and Add but still introduced Gemm for this opportunity of further computation optimization. Because 2-D matrix multiplication is even more fundamental than Gemm, I feel it's reasonable to allow it in Gemm (a symbol stands for ultimate optimization).
Most frameworks can easily implement it given their existing Gemm code.
From user's perspective, it's sometimes annoying to use Gemm because we need to create zero initializer only for matching the spec.

To be fair, I also should note that this change is not going to increase the expressiveness of ONNX.

spandantiwari · 2019-09-18T18:21:41Z

@wschin @JamesAllingham Just curious if this is being tracked for 1.6 release. If not, then we may have to update the opset version number in the PR from 11 to 12.

docs/Operators.md

onnx/backend/test/case/node/gemm.py

onnx/defs/math/defs.cc

wschin · 2019-09-18T19:49:13Z

@wschin @JamesAllingham Just curious if this is being tracked for 1.6 release. If not, then we may have to update the opset version number in the PR from 11 to 12.

It's a nice to have thing in 1.6. If we can't make it on time, we can bump the opset version if you want.

wschin · 2019-09-18T19:52:35Z

@JamesAllingham, we need a shape inference test as well.

JamesAllingham · 2019-09-18T20:47:32Z

@JamesAllingham, we need a shape inference test as well.

I've added a test but I wasn't 100% sure what you wanted here so let me know if you wanted something else.

wschin · 2019-09-18T20:56:45Z

@JamesAllingham, we need a shape inference test as well.

I've added a test but I wasn't 100% sure what you wanted here so let me know if you wanted something else.

Ah, my bad. I missed the last file.

JamesAllingham · 2019-09-18T21:01:40Z

@JamesAllingham, we need a shape inference test as well.
I've added a test but I wasn't 100% sure what you wanted here so let me know if you wanted something else.

Ah, my bad. I missed the last file.

No, I only just added it, you didn't miss anything!

spandantiwari

LGTM.

spandantiwari · 2019-09-18T21:09:57Z

onnx/defs/math/old.cc

+            auto& first_input_shape = getInputShape(ctx, 0);
+            auto& second_input_shape = getInputShape(ctx, 1);
+            if (first_input_shape.dim_size() != 2)
+              fail_shape_inference("First input does not have rank 2");


nit: Minor point about code style - consider adding braces even for single line scopes to be consistent with coding style in this file.

Will do 👍

…o gemmOptionalBias

* Gemm optional bias (#2330) * Made the 'C' input of Gemm (the bias term) optional. If missing it defaults to 0. Also added a test case for no bias. Updated the Gemm op to version 11. * Fixed a typo! * Small tweaks to the Gemm docs. * Added a shape inference test for Gemm with no bias * Tweaked coding style slightly by adding braces to single line scopes. * Fix some backend tests (#2335) * Fix some node tests * PR comments and docs * Update Changelog.md * Update gen_doc script to validate proto3 files (#2122) * Update gen_doc script to validate proto3 files * Update CMakeLists.txt * Update pybind (#2340) * Fix node test case model for Gemm scalar bias case (#2342) * Fix some node tests * PR comments and docs * Update Changelog.md * Fix gemm scalar node test * Clarify behavior in ConvTranspose (#2343) * Fix the wrong behavior in ConvTranspose * Address comments

* Made the 'C' input of Gemm (the bias term) optional. If missing it defaults to 0. Also added a test case for no bias. Updated the Gemm op to version 11. * Fixed a typo! * Small tweaks to the Gemm docs. * Added a shape inference test for Gemm with no bias * Tweaked coding style slightly by adding braces to single line scopes.

JamesAllingham added 2 commits September 18, 2019 11:22

Made the 'C' input of Gemm (the bias term) optional.

5206884

If missing it defaults to 0. Also added a test case for no bias. Updated the Gemm op to version 11.

Fixed a typo!

fbb54d6

JamesAllingham requested a review from a team as a code owner September 18, 2019 09:38

wschin mentioned this pull request Sep 18, 2019

Non-standard Softmax behavior #2289

Closed

Merge branch 'master' into gemmOptionalBias

e7c24dd

wschin reviewed Sep 18, 2019

View reviewed changes

docs/Operators.md Outdated Show resolved Hide resolved

wschin reviewed Sep 18, 2019

View reviewed changes

docs/Operators.md Outdated Show resolved Hide resolved

wschin reviewed Sep 18, 2019

View reviewed changes

onnx/backend/test/case/node/gemm.py Show resolved Hide resolved

wschin reviewed Sep 18, 2019

View reviewed changes

onnx/defs/math/defs.cc Outdated Show resolved Hide resolved

prasanthpul added this to the 1.6 milestone Sep 18, 2019

JamesAllingham added 2 commits September 18, 2019 22:38

Small tweaks to the Gemm docs.

0688ed7

Added a shape inference test for Gemm with no bias

a698f01

JamesAllingham requested a review from a team as a code owner September 18, 2019 20:44

Merge branch 'master' into gemmOptionalBias

ac1ef1c

wschin approved these changes Sep 18, 2019

View reviewed changes

wschin added the operator Issues related to ONNX operators label Sep 18, 2019

wschin requested review from spandantiwari, houseroad and gramalingam September 18, 2019 21:07

spandantiwari approved these changes Sep 18, 2019

View reviewed changes

JamesAllingham added 2 commits September 18, 2019 23:36

Tweaked coding style slightly by adding braces to single line scopes.

4d57aec

Merge branch 'gemmOptionalBias' of github.com:JamesAllingham/onnx int…

5d04109

…o gemmOptionalBias

wschin merged commit 23bb6ea into onnx:master Sep 18, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Gemm optional bias #2330

Gemm optional bias #2330

JamesAllingham commented Sep 18, 2019

wschin commented Sep 18, 2019 •

edited

spandantiwari commented Sep 18, 2019 •

edited

wschin commented Sep 18, 2019

wschin commented Sep 18, 2019

JamesAllingham commented Sep 18, 2019

wschin commented Sep 18, 2019

JamesAllingham commented Sep 18, 2019

spandantiwari left a comment

spandantiwari Sep 18, 2019

JamesAllingham Sep 18, 2019

Gemm optional bias #2330

Gemm optional bias #2330

Conversation

JamesAllingham commented Sep 18, 2019

wschin commented Sep 18, 2019 • edited

spandantiwari commented Sep 18, 2019 • edited

wschin commented Sep 18, 2019

wschin commented Sep 18, 2019

JamesAllingham commented Sep 18, 2019

wschin commented Sep 18, 2019

JamesAllingham commented Sep 18, 2019

spandantiwari left a comment

Choose a reason for hiding this comment

spandantiwari Sep 18, 2019

Choose a reason for hiding this comment

JamesAllingham Sep 18, 2019

Choose a reason for hiding this comment

wschin commented Sep 18, 2019 •

edited

spandantiwari commented Sep 18, 2019 •

edited