[Transformer] Skip dropout layer when rate=0 #597

TaoLv · 2019-02-18T02:37:14Z

Description

#596

Checklist

Essentials

PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage
Code is well-documented

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

mli · 2019-02-18T03:42:49Z

Job PR-597/1 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-597/1/index.html

codecov · 2019-02-18T03:42:59Z

Codecov Report

Merging #597 into master will decrease coverage by 0.51%.
The diff coverage is 100%.

@@            Coverage Diff            @@
##           master    #597      +/-   ##
=========================================
- Coverage   65.12%   64.6%   -0.52%     
=========================================
  Files         135     135              
  Lines       12397   12199     -198     
=========================================
- Hits         8073    7881     -192     
+ Misses       4324    4318       -6

Flag	Coverage Δ
#PR597	`64.6% <100%> (ø)`	⬆️
#PR617	`?`
#master	`?`
#notserial	`44.19% <0%> (+0.37%)`	⬆️
#py2	`64.34% <100%> (-0.53%)`	⬇️
#py3	`64.48% <100%> (-0.52%)`	⬇️
#serial	`49.62% <100%> (-0.78%)`	⬇️

eric-haibin-lin

Thanks for the contribution! One comment:

eric-haibin-lin · 2019-02-18T04:44:33Z

src/gluonnlp/model/transformer.py

@@ -106,7 +107,8 @@ def __init__(self, units=512, hidden_size=2048, dropout=0.0, use_residual=True,
                                  weight_initializer=weight_initializer,
                                  bias_initializer=bias_initializer,
                                  prefix='ffn_2_')
-            self.dropout_layer = nn.Dropout(dropout)


Might as well change

gluon-nlp/src/gluonnlp/model/transformer.py

Line 207 in 972d866

self.dropout_layer = nn.Dropout(dropout)

gluon-nlp/src/gluonnlp/model/transformer.py

Line 339 in 972d866

self.dropout_layer = nn.Dropout(dropout)

gluon-nlp/src/gluonnlp/model/transformer.py

Line 719 in 972d866

self.dropout_layer = nn.Dropout(dropout)

and

gluon-nlp/src/gluonnlp/model/transformer.py

Line 854 in 972d866

self.dropout_layer = nn.Dropout(dropout)

?

Sure. Will change them accordingly.

Thanks. Would you mind also adding the case when dropout is set to 0 in the unit test? https://github.com/dmlc/gluon-nlp/blob/master/tests/unittest/test_models.py#L75

szhengac · 2019-02-18T15:47:37Z

Looks ok to me.

TaoLv · 2019-02-23T14:56:16Z

Looks like the failure in CI is not related to the changes in this PR. Any idea about how to get it passed?

szha · 2019-02-23T18:41:51Z

@TaoLv I triggered the CI again just now. I will make the linkcheck optional and instead let @mli send a report to the PR if anything is broken

eric-haibin-lin

Looks good pending unit tests

eric-haibin-lin · 2019-02-24T19:18:42Z

src/gluonnlp/model/transformer.py

@@ -106,7 +107,8 @@ def __init__(self, units=512, hidden_size=2048, dropout=0.0, use_residual=True,
                                  weight_initializer=weight_initializer,
                                  bias_initializer=bias_initializer,
                                  prefix='ffn_2_')
-            self.dropout_layer = nn.Dropout(dropout)


Thanks. Would you mind also adding the case when dropout is set to 0 in the unit test? https://github.com/dmlc/gluon-nlp/blob/master/tests/unittest/test_models.py#L75

TaoLv · 2019-02-26T14:03:38Z

Sure, will change accordingly.

mli · 2019-03-03T02:20:52Z

Job PR-597/5 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-597/5/index.html

mli · 2019-03-03T15:27:44Z

Job PR-597/6 is complete.
Docs are uploaded to http://gluon-nlp-staging.s3-accelerate.dualstack.amazonaws.com/PR-597/6/index.html

* skip droput layer when rate=0 * address comments * fix complain of CI * retrigger * trigger CI * add test for dropout=0

skip droput layer when rate=0

972d866

TaoLv requested a review from szha as a code owner February 18, 2019 02:37

TaoLv mentioned this pull request Feb 18, 2019

[Transformer] Avoid adding dropout layers if dropout = 0 #596

Closed

TaoLv changed the title ~~Skip dropout layer when rate=0~~ [Transformer] Skip dropout layer when rate=0 Feb 18, 2019

szha requested review from szhengac and eric-haibin-lin February 18, 2019 03:47

eric-haibin-lin reviewed Feb 18, 2019

View reviewed changes

TaoLv added 2 commits February 18, 2019 17:23

address comments

93de652

fix complain of CI

83e5c72

retrigger

1692fc0

eric-haibin-lin reviewed Feb 24, 2019

View reviewed changes

trigger CI

6c52e83

TaoLv added 3 commits March 3, 2019 22:20

add test for dropout=0

2c45bf8

Merge remote-tracking branch 'official/master' into fix-dropout

34b7456

Merge remote-tracking branch 'origin/fix-dropout' into fix-dropout

eafd650

szha approved these changes Mar 3, 2019

View reviewed changes

eric-haibin-lin approved these changes Mar 4, 2019

View reviewed changes

eric-haibin-lin merged commit cad5fc2 into dmlc:master Mar 4, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Transformer] Skip dropout layer when rate=0 #597

[Transformer] Skip dropout layer when rate=0 #597

TaoLv commented Feb 18, 2019 •

edited

Loading

mli commented Feb 18, 2019

codecov bot commented Feb 18, 2019 •

edited

Loading

eric-haibin-lin left a comment

eric-haibin-lin Feb 18, 2019

TaoLv Feb 18, 2019

eric-haibin-lin Feb 24, 2019

szhengac commented Feb 18, 2019

TaoLv commented Feb 23, 2019

szha commented Feb 23, 2019

eric-haibin-lin left a comment

eric-haibin-lin Feb 24, 2019

TaoLv commented Feb 26, 2019

mli commented Mar 3, 2019

mli commented Mar 3, 2019

[Transformer] Skip dropout layer when rate=0 #597

[Transformer] Skip dropout layer when rate=0 #597

Conversation

TaoLv commented Feb 18, 2019 • edited Loading

Description

Checklist

Essentials

Changes

Comments

mli commented Feb 18, 2019

codecov bot commented Feb 18, 2019 • edited Loading

Codecov Report

eric-haibin-lin left a comment

Choose a reason for hiding this comment

eric-haibin-lin Feb 18, 2019

Choose a reason for hiding this comment

TaoLv Feb 18, 2019

Choose a reason for hiding this comment

eric-haibin-lin Feb 24, 2019

Choose a reason for hiding this comment

szhengac commented Feb 18, 2019

TaoLv commented Feb 23, 2019

szha commented Feb 23, 2019

eric-haibin-lin left a comment

Choose a reason for hiding this comment

eric-haibin-lin Feb 24, 2019

Choose a reason for hiding this comment

TaoLv commented Feb 26, 2019

mli commented Mar 3, 2019

mli commented Mar 3, 2019

TaoLv commented Feb 18, 2019 •

edited

Loading

codecov bot commented Feb 18, 2019 •

edited

Loading