MHA: Fix regression and apply bias flag to both in/out proj #52537

jbschlosser · 2021-02-19T23:10:24Z

Background

Reverts MHA behavior for bias flag to that of v1.5: flag enables or disables both in and out projection biases.

Updates type annotations for both in and out projections biases from Tensor to Optional[Tensor] for torch.jit.script usage.

Note: With this change, _LinearWithBias defined in torch/nn/modules/linear.py is no longer utilized. Completely removing it would require updates to quantization logic in the following files:

test/quantization/test_quantized_module.py
torch/nn/quantizable/modules/activation.py
torch/nn/quantized/dynamic/modules/linear.py
torch/nn/quantized/modules/linear.py
torch/quantization/quantization_mappings.py

This PR takes a conservative initial approach and leaves these files unchanged.

Is it safe to fully remove _LinearWithBias?

Test Plan

python test/test_nn.py TestNN.test_multihead_attn_no_bias

BC-Breaking Note

In v1.6, the behavior of MultiheadAttention's bias flag was incorrectly changed to affect only the in projection layer. That is, setting bias=False would fail to disable the bias for the out projection layer. This regression has been fixed, and the bias flag now correctly applies to both the in and out projection layers.

facebook-github-bot · 2021-02-19T23:10:45Z

💊 CI failures summary and remediations

As of commit 75dfb0f (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

gchanan · 2021-02-19T23:20:17Z

I don't think we need to worry about removing _LinearWithBias.

facebook-github-bot

@jbschlosser has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

codecov · 2021-02-22T19:40:10Z

Codecov Report

Merging #52537 (75dfb0f) into master (941ebec) will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##           master   #52537   +/-   ##
=======================================
  Coverage   80.64%   80.64%           
=======================================
  Files        1969     1969           
  Lines      215943   215944    +1     
=======================================
+ Hits       174137   174138    +1     
  Misses      41806    41806

gchanan · 2021-02-22T22:37:02Z

torch/nn/quantizable/modules/activation.py

@@ -65,7 +65,7 @@ def __init__(self, embed_dim: int, num_heads: int,
        # TODO: The use of the `_LinearWithBias` increases the quantization noise


is this comment still relevant?

I went back and forth on this, but I left it in for the case where an older model is loaded and then quantized. Is this valid? The comment should probably be at least updated to reflect that out_proj could or could not be a _LinearWithBias.

gchanan

lgmt. @vkuzo mind taking a look at the quantization code?

facebook-github-bot · 2021-02-22T22:49:19Z

@jbschlosser merged this pull request in a39b1c4.

vkuzo · 2021-02-22T22:52:08Z

cc @z-a-f as an fyi

…52537) Summary: Fixes pytorch#52257 ## Background Reverts MHA behavior for `bias` flag to that of v1.5: flag enables or disables both in and out projection biases. Updates type annotations for both in and out projections biases from `Tensor` to `Optional[Tensor]` for `torch.jit.script` usage. Note: With this change, `_LinearWithBias` defined in `torch/nn/modules/linear.py` is no longer utilized. Completely removing it would require updates to quantization logic in the following files: ``` test/quantization/test_quantized_module.py torch/nn/quantizable/modules/activation.py torch/nn/quantized/dynamic/modules/linear.py torch/nn/quantized/modules/linear.py torch/quantization/quantization_mappings.py ``` This PR takes a conservative initial approach and leaves these files unchanged. **Is it safe to fully remove `_LinearWithBias`?** Pull Request resolved: pytorch#52537 Test Plan: ``` python test/test_nn.py TestNN.test_multihead_attn_no_bias ``` ## BC-Breaking Note In v1.6, the behavior of `MultiheadAttention`'s `bias` flag was incorrectly changed to affect only the in projection layer. That is, setting `bias=False` would fail to disable the bias for the out projection layer. This regression has been fixed, and the `bias` flag now correctly applies to both the in and out projection layers. Reviewed By: bdhirsh Differential Revision: D26583639 Pulled By: jbschlosser fbshipit-source-id: b805f3a052628efb28b89377a41e06f71747ac5b

@ngimel

Fixes a post-1.8 regression in nn.MultiheadAttention + quantization scriptability introduced in #52537. Passes the new test introduced in that PR, and fixes the repro found by @ngimel [here](https://gist.github.com/bhosmer/ef517d0774f2f10336b8140116fd6b62). Per comments in #52537 there's definitely a carnal dependency between quantization and the `_LinearWithBias` class by name that I'm reinstating here, but there may be cleaner ways to solve this - I don't really know what I'm doing 😁 . @jbschlosser @z-a-f LMK if you have ideas, happy to change this as desired. It'd be nice to get a fix into 1.9. Differential Revision: [D28593830](https://our.internmc.facebook.com/intern/diff/D28593830) [ghstack-poisoned]

@ngimel

Fixes a post-1.8 regression in nn.MultiheadAttention + quantization scriptability introduced in #52537. Passes the new test introduced in that PR, and fixes the repro found by @ngimel [here](https://gist.github.com/bhosmer/ef517d0774f2f10336b8140116fd6b62). Per comments in #52537 there's definitely a carnal dependency between quantization and the `_LinearWithBias` class by name that I'm reinstating here, but there may be cleaner ways to solve this - I don't really know what I'm doing 😁 . @jbschlosser @z-a-f LMK if you have ideas, happy to change this as desired. It'd be nice to get a fix into 1.9. _[Update: now using a better name instead of `_LinearWithBias`, but this remains a short-term fix to re-suppress a quantization API usage error that should properly be raised upstream. See issue #58969]_ Differential Revision: [D28593830](https://our.internmc.facebook.com/intern/diff/D28593830) [ghstack-poisoned]

@ngimel

Fixes a post-1.8 regression in nn.MultiheadAttention + quantization scriptability introduced in #52537. Passes the new test introduced in that PR, and fixes the repro found by @ngimel [here](https://gist.github.com/bhosmer/ef517d0774f2f10336b8140116fd6b62). Per comments in #52537 there's definitely a carnal dependency between quantization and the `_LinearWithBias` class by name that I'm reinstating here, but there may be cleaner ways to solve this - I don't really know what I'm doing 😁 . @jbschlosser @z-a-f LMK if you have ideas, happy to change this as desired. It'd be nice to get a fix into 1.9. _[Update: now using a better name instead of `_LinearWithBias`, but this remains a short-term fix to re-suppress a quantization API usage error that should properly be raised upstream. See issue #58969]_ Differential Revision: [D28593830](https://our.internmc.facebook.com/intern/diff/D28593830) [ghstack-poisoned]

@ngimel

Fixes a post-1.8 regression in nn.MultiheadAttention + quantization scriptability introduced in #52537. Passes the new test introduced in that PR, and fixes the repro found by @ngimel [here](https://gist.github.com/bhosmer/ef517d0774f2f10336b8140116fd6b62). Per comments in #52537 there's definitely a carnal dependency between quantization and the `_LinearWithBias` class by name that I'm reinstating here, but there may be cleaner ways to solve this - I don't really know what I'm doing 😁 . @jbschlosser @z-a-f LMK if you have ideas, happy to change this as desired. It'd be nice to get a fix into 1.9. _[Update: now using a better name instead of `_LinearWithBias`, but this remains a short-term fix to re-suppress a quantization API usage error that should properly be raised upstream. See issue #58969]_ Differential Revision: [D28593830](https://our.internmc.facebook.com/intern/diff/D28593830) [ghstack-poisoned]

jbschlosser added 2 commits February 19, 2021 17:29

Fix: apply bias flag in MHA to both in/out proj

3c651a5

Test for correct MHA bias flag behavior

3820046

jbschlosser requested review from gchanan and zhangguanheng66 February 19, 2021 23:10

facebook-github-bot added the cla signed label Feb 19, 2021

gchanan added the module: bc-breaking Related to a BC-breaking change label Feb 19, 2021

gchanan approved these changes Feb 19, 2021

View reviewed changes

zhangguanheng66 approved these changes Feb 19, 2021

View reviewed changes

jbschlosser added 2 commits February 22, 2021 09:35

Fix MHA quantization

2a4f615

Fix: type hints tests

75dfb0f

zhangguanheng66 mentioned this pull request Feb 22, 2021

Possible bug in initializing nn.MultiheadAttention if bias=False #52545

Closed

facebook-github-bot reviewed Feb 22, 2021

View reviewed changes

gchanan reviewed Feb 22, 2021

View reviewed changes

gchanan approved these changes Feb 22, 2021

View reviewed changes

facebook-github-bot closed this in a39b1c4 Feb 22, 2021

facebook-github-bot added the Merged label Feb 22, 2021

vkuzo requested a review from z-a-f February 22, 2021 22:51

bhosmer mentioned this pull request May 21, 2021

fix nn.MHA + quantized scriptability #58727

Closed

jhoareau mentioned this pull request Jun 17, 2021

torch.load non backwards compatible on Transformer between 1.8.1 and 1.9.0 #60165

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

MHA: Fix regression and apply bias flag to both in/out proj #52537

MHA: Fix regression and apply bias flag to both in/out proj #52537

jbschlosser commented Feb 19, 2021

facebook-github-bot commented Feb 19, 2021 •

edited

Loading

gchanan commented Feb 19, 2021

facebook-github-bot left a comment

codecov bot commented Feb 22, 2021

gchanan Feb 22, 2021

jbschlosser Feb 22, 2021

gchanan left a comment

facebook-github-bot commented Feb 22, 2021

vkuzo commented Feb 22, 2021

		@@ -65,7 +65,7 @@ def __init__(self, embed_dim: int, num_heads: int,
		# TODO: The use of the `_LinearWithBias` increases the quantization noise

MHA: Fix regression and apply bias flag to both in/out proj #52537

MHA: Fix regression and apply bias flag to both in/out proj #52537

Conversation

jbschlosser commented Feb 19, 2021

Background

Test Plan

BC-Breaking Note

facebook-github-bot commented Feb 19, 2021 • edited Loading

💊 CI failures summary and remediations

gchanan commented Feb 19, 2021

facebook-github-bot left a comment

Choose a reason for hiding this comment

codecov bot commented Feb 22, 2021

Codecov Report

gchanan Feb 22, 2021

Choose a reason for hiding this comment

jbschlosser Feb 22, 2021

Choose a reason for hiding this comment

gchanan left a comment

Choose a reason for hiding this comment

facebook-github-bot commented Feb 22, 2021

vkuzo commented Feb 22, 2021

facebook-github-bot commented Feb 19, 2021 •

edited

Loading