Adding dropout implmentation to RNNs #203

ericdnielsen · 2018-05-21T17:50:50Z

Not ready for review yet. Still need testing, etc. But getting into a PR for easier comparison.

This change is

ericdnielsen · 2018-07-02T20:20:53Z

Tests added, ready for review

caisq · 2018-07-03T18:17:44Z

Review status: 0 of 1 LGTMs obtained

src/layers/recurrent.ts, line 1018 at r2 (raw file):

        this.dropoutMask = generateDropoutMask(() => {
                             return tfc.onesLike(inputs as Tensor);
                           }, this.dropout, training) as Tensor;

This can be simplified as () => tfc.onesLike(input as Tensor). Same elsewhere.

src/layers/recurrent.ts, line 1024 at r2 (raw file):

            generateDropoutMask(() => {
              return tfc.onesLike(prevOutput);
            }, this.recurrentDropout, training) as Tensor;

This can be simplified as () => tfc.onesLike(prevOutput). Same elsewhere.

src/layers/recurrent.ts, line 1474 at r2 (raw file):

Tensor[]

Optional: Would be slightly safer to type this as [Tensor, Tensor, Tensor]. Same below.

src/layers/recurrent.ts, line 2513 at r2 (raw file):

training = training)

s/training = training/training/

Same below.

src/layers/recurrent_test.ts, line 540 at r2 (raw file):

dropouts = [0.0, 0.1];

Can we add tests for recurrentDropout > 0 and < 1?

src/layers/recurrent_test.ts, line 548 at r2 (raw file):

For generality, can we make this a larger number?

src/layers/recurrent_test.ts, line 565 at r2 (raw file):

This should be 1 regardless of timeSteps, right? (No change required. Just confirming.)

src/layers/recurrent_test.ts, line 582 at r2 (raw file):

As above, we can make this a larger number?

src/layers/recurrent_test.ts, line 602 at r2 (raw file):

Add an else branch with assertions in it.

src/layers/recurrent_test.ts, line 912 at r2 (raw file):

this needs an else branch with assertions in it.

src/layers/recurrent_test.ts, line 1235 at r2 (raw file):

This needs an else branch with assertions in it.

Comments from Reviewable

ericdnielsen · 2018-07-03T19:30:36Z

Review status: 0 of 1 LGTMs obtained

src/layers/recurrent.ts, line 1018 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

        this.dropoutMask = generateDropoutMask(() => {
                             return tfc.onesLike(inputs as Tensor);
                           }, this.dropout, training) as Tensor;

This can be simplified as () => tfc.onesLike(input as Tensor). Same elsewhere.

Done.

src/layers/recurrent.ts, line 1024 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

            generateDropoutMask(() => {
              return tfc.onesLike(prevOutput);
            }, this.recurrentDropout, training) as Tensor;

This can be simplified as () => tfc.onesLike(prevOutput). Same elsewhere.

Done.

src/layers/recurrent.ts, line 1474 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

Tensor[]
Optional: Would be slightly safer to type this as [Tensor, Tensor, Tensor]. Same below.

Done.

src/layers/recurrent.ts, line 2513 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

training = training)
s/training = training/training/

Same below.

Done.

src/layers/recurrent_test.ts, line 540 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

dropouts = [0.0, 0.1];
Can we add tests for recurrentDropout > 0 and < 1?

discussed, offline.

src/layers/recurrent_test.ts, line 548 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

1
For generality, can we make this a larger number?

Done.

src/layers/recurrent_test.ts, line 565 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

1
This should be 1 regardless of timeSteps, right? (No change required. Just confirming.)

Done.

src/layers/recurrent_test.ts, line 582 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

1
As above, we can make this a larger number?

Done.

src/layers/recurrent_test.ts, line 602 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

}
Add an else branch with assertions in it.

Done.

src/layers/recurrent_test.ts, line 912 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

}
this needs an else branch with assertions in it.

Done.

src/layers/recurrent_test.ts, line 1235 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

}
This needs an else branch with assertions in it.

Done.

Comments from Reviewable

caisq · 2018-07-03T19:52:53Z

Review status: 0 of 1 LGTMs obtained

src/layers/recurrent.ts, line 1474 at r2 (raw file):

Previously, ericdnielsen wrote…

Done.

You say it's done. But I don't see any changes made here?

src/layers/recurrent_test.ts, line 542 at r3 (raw file):

for (const training of trainings) {

For tests like this, can we add the following check for the absence of memory leak?

After the 1st iteration, call tfc.memory().numTensors to get the number of tensors in the backend.
After the 2nd iteration, get that number again and assert that it should be equal to the previous number.

src/layers/recurrent_test.ts, line 566 at r3 (raw file):

1  *

Is there any reason to keep this 1 * here? Same below?

Comments from Reviewable

ericdnielsen · 2018-07-03T20:09:06Z

Review status: 0 of 1 LGTMs obtained

src/layers/recurrent.ts, line 1474 at r2 (raw file):

Previously, caisq (Shanqing Cai) wrote…

You say it's done. But I don't see any changes made here?

Ahh I changed the LSTM ones, not the GRU ones. Both are changed now.

src/layers/recurrent_test.ts, line 542 at r3 (raw file):

Previously, caisq (Shanqing Cai) wrote…

for (const training of trainings) {
For tests like this, can we add the following check for the absence of memory leak?

After the 1st iteration, call tfc.memory().numTensors to get the number of tensors in the backend.
After the 2nd iteration, get that number again and assert that it should be equal to the previous number.

Done.

src/layers/recurrent_test.ts, line 566 at r3 (raw file):

Previously, caisq (Shanqing Cai) wrote…

1  *
Is there any reason to keep this 1 * here? Same below?

I think, compared to the GRU & LSTM ones, where its not 1 *, the parallelism is useful.

Comments from Reviewable

caisq · 2018-07-03T20:11:54Z

Thanks!

Review status: complete! 1 of 1 LGTMs obtained

src/layers/recurrent.ts, line 2009 at r4 (raw file):

[Tensor, Tensor, Tensor];

Isn't this supposed to be four (instead of three) Tensors? Same in the line below.

Comments from Reviewable

ericdnielsen · 2018-07-03T20:14:01Z

Review status: complete! 1 of 1 LGTMs obtained

src/layers/recurrent.ts, line 2009 at r4 (raw file):

Previously, caisq (Shanqing Cai) wrote…

[Tensor, Tensor, Tensor];
Isn't this supposed to be four (instead of three) Tensors? Same in the line below.

Looks like it should be, not sure why TS & lint don't complain.

Comments from Reviewable

Inital pass on implementation

0e2c132

caisq self-requested a review June 29, 2018 13:40

ericdnielsen added 2 commits July 2, 2018 13:06

Merge remote-tracking branch 'origin' into add-dropout-to-rnn

4753f49

add tests

5e7a373

lint

59e73ae

ericdnielsen requested review from bileschi and davidsoergel July 3, 2018 14:30

ericdnielsen added 2 commits July 3, 2018 14:27

Addressing comments

053cacf

memory issues

17fedbd

ericdnielsen added 2 commits July 3, 2018 15:57

tighten typing

c64afc6

memory leak check

60b1603

dims

c4d03b4

Merge remote-tracking branch 'origin' into add-dropout-to-rnn

3bb1f9a

ericdnielsen merged commit 1a9ac50 into master Jul 3, 2018

ericdnielsen deleted the add-dropout-to-rnn branch July 3, 2018 20:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding dropout implmentation to RNNs #203

Adding dropout implmentation to RNNs #203

ericdnielsen commented May 21, 2018 •

edited by nsthorat

ericdnielsen commented Jul 2, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

Adding dropout implmentation to RNNs #203

Adding dropout implmentation to RNNs #203

Conversation

ericdnielsen commented May 21, 2018 • edited by nsthorat

ericdnielsen commented Jul 2, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

caisq commented Jul 3, 2018

ericdnielsen commented Jul 3, 2018

ericdnielsen commented May 21, 2018 •

edited by nsthorat