Add batch_first support in MHA and update docs #839

zhangguanheng66 · 2020-06-23T16:00:27Z

No description provided.

codecov · 2020-06-23T19:00:01Z

Codecov Report

Merging #839 into master will increase coverage by 0.00%.
The diff coverage is 87.50%.

@@           Coverage Diff           @@
##           master     #839   +/-   ##
=======================================
  Coverage   77.43%   77.44%           
=======================================
  Files          43       44    +1     
  Lines        3045     3055   +10     
=======================================
+ Hits         2358     2366    +8     
- Misses        687      689    +2

Impacted Files	Coverage Δ
torchtext/nn/modules/__init__.py	`100.00% <ø> (ø)`
torchtext/nn/modules/multiheadattention.py	`92.40% <85.71%> (ø)`
torchtext/__init__.py	`88.00% <100.00%> (ø)`
torchtext/nn/__init__.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 07abf6d...c74a914. Read the comment docs.

cpuhrsch · 2020-07-14T16:54:02Z

torchtext/modules/multiheadattention.py

        r""" A multi-head attention container

        Args:
            nhead: the number of heads in the multiheadattention model
            in_proj_container: A container of multi-head in-projection linear layers (a.k.a nn.Linear).
-            attention_layer: The attention layer.
+            attention_layer: The custom attention layer. The input sent from MHA container to the attention layer


Does this also take care of broadcasting?

The custom attention layer needs to take care of broadcasting. Updated the doc to reflect this.

I'd then augment the shape to (..., seq, batch, feature) and explain what that means and also that it's optional, i.e. enough to only handle 3-dim.

cpuhrsch · 2020-07-14T16:57:52Z

torchtext/modules/multiheadattention.py


        Examples::
-            >>> SDP = torchtext.models.ScaledDotProduct(0.1)
+            >>> SDP = torchtext.modules.ScaledDotProduct(dropout=0.1)


Should we mirror the pytorch path conventions here?

torchtext.nn and torchtext.nn.functional?

The other two domains use torchvision/audio.models.

Yes, but this isn't a model, right?

OK, Will fix it.

checkpoint

9a867b0

zhangguanheng66 marked this pull request as draft June 23, 2020 16:00

Guanheng Zhang added 4 commits June 23, 2020 09:48

update test

94528ad

update docs

abecdfe

checkpoint

2bfb5bd

checkpoint

6764ebb

Guanheng Zhang added 6 commits June 30, 2020 09:14

Merge branch 'master' into revised_mha

533ae55

checkpoint

f28701b

Merge branch 'master' into revised_mha

63deff1

doc typo

6882a7a

Merge branch 'master' into revised_mha

2e058e8

mha padding

10fb958

cpuhrsch reviewed Jul 14, 2020

View reviewed changes

Guanheng Zhang added 10 commits July 14, 2020 12:29

update third_party

408013b

Merge branch 'master' into revised_mha

6658afc

update broadcast doc

bcb752c

swtich to torchtext.nn

ae996ca

update test_jit

d85de52

resolve flake error

2d6e782

udpate submodules

da7d63d

flake8

b03d474

Merge branch 'master' into revised_mha

a73075f

broadcast doc

c74a914

zhangguanheng66 marked this pull request as ready for review July 15, 2020 00:47

cpuhrsch approved these changes Jul 15, 2020

View reviewed changes

zhangguanheng66 merged commit b021bf9 into pytorch:master Jul 15, 2020

zhangguanheng66 deleted the revised_mha branch July 16, 2020 17:53

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add batch_first support in MHA and update docs #839

Add batch_first support in MHA and update docs #839

zhangguanheng66 commented Jun 23, 2020

codecov bot commented Jun 23, 2020 •

edited

Loading

cpuhrsch Jul 14, 2020

zhangguanheng66 Jul 14, 2020

cpuhrsch Jul 14, 2020

cpuhrsch Jul 14, 2020

zhangguanheng66 Jul 14, 2020

cpuhrsch Jul 14, 2020

zhangguanheng66 Jul 14, 2020

Add batch_first support in MHA and update docs #839

Add batch_first support in MHA and update docs #839

Conversation

zhangguanheng66 commented Jun 23, 2020

codecov bot commented Jun 23, 2020 • edited Loading

Codecov Report

cpuhrsch Jul 14, 2020

Choose a reason for hiding this comment

zhangguanheng66 Jul 14, 2020

Choose a reason for hiding this comment

cpuhrsch Jul 14, 2020

Choose a reason for hiding this comment

cpuhrsch Jul 14, 2020

Choose a reason for hiding this comment

zhangguanheng66 Jul 14, 2020

Choose a reason for hiding this comment

cpuhrsch Jul 14, 2020

Choose a reason for hiding this comment

zhangguanheng66 Jul 14, 2020

Choose a reason for hiding this comment

codecov bot commented Jun 23, 2020 •

edited

Loading