Fix a bug in the handling of array of step outputs by TimeDistributed layer #315

caisq · 2018-09-06T13:52:18Z

Fixes: tensorflow/tfjs#681

BUG

This change is

0.12.18 contains the fix to the bug about concat gradient

…t-fix

davidsoergel

Reviewable status: 0 of 1 approvals obtained (waiting on @caisq, @davidsoergel, @ericdnielsen, and @bileschi)

src/layers/wrappers.ts, line 230 at r1 (raw file):

        // TODO(cais): Add useLearningPhase.
        const output =
            getExactlyOneTensor(this.layer.call(inputs, kwargs) as Tensor);

Maybe worth adding a comment about what is going on here (i.e., under what circumstances does call() return a length-1 array?)

src/layers/wrappers.ts, line 230 at r1 (raw file):

        // TODO(cais): Add useLearningPhase.
        const output =
            getExactlyOneTensor(this.layer.call(inputs, kwargs) as Tensor);

The cast should be unnecessary now

caisq

Thanks for the review!

Reviewable status: complete! 1 of 1 approvals obtained (waiting on @davidsoergel, @ericdnielsen, and @bileschi)

src/layers/wrappers.ts, line 230 at r1 (raw file):

Previously, davidsoergel (David Soergel) wrote…

Maybe worth adding a comment about what is going on here (i.e., under what circumstances does call() return a length-1 array?)

Done.

src/layers/wrappers.ts, line 230 at r1 (raw file):

Previously, davidsoergel (David Soergel) wrote…

The cast should be unnecessary now

Done.

rodrigopivi · 2018-09-12T13:15:08Z

hi @caisq

Thanks for this fix, after manually building the tfjs-layers and testing this use case of passing a model as a layer. This fix works when training with a validationSplit of 0.5. If i use another validation split value, then the training and validation tensors will have different shapes and tfjs will throw an error. Here is an example code of this problem:

const inputs = tf.input({ dtype: 'float32', shape: [1, 2] });
const lstm = tf.layers.lstm({ units: 2, returnSequences: true }).apply(inputs) as tf.SymbolicTensor;
const timeAttention = new TimeSeriesAttention({}).apply(lstm) as tf.SymbolicTensor;
const model = tf.model({ inputs, outputs: timeAttention });
const optimize = tf.train.adam(0.0066, 0.0025, 0.1);
model.compile({ loss: 'categoricalCrossentropy', metrics: ['accuracy'], optimizer: optimize });

const inp = tf.tensor3d([[[1,1]],[[2,2]],[[3,3]],[[4,4]],[[5,5]],[[6,6]]],[6,1,2]);
const out = tf.tensor3d([[[1,0]],[[2,0]],[[3,0]],[[4,0]],[[5,0]],[[6,0]]], [6, 1, 2]);

(async () => {
    // NOTE: if validation split is 0.5 this works, else it fails because the tensors shape for
    //             train and validation will have different shapes
    await model.fit(inp, out, { validationSplit: 0.2 });
})();

NOTE: TimeSeriesAttention is just a model

caisq · 2018-09-13T15:54:41Z

@rodrigopivi Thanks for the report and the code for reproducing the error. It's on my TODO list to look at this issue. But my schedule is a little tight curerntly, so expect a delay of 1-2 weeks.

rodrigopivi · 2018-09-13T16:26:25Z

thank you

caisq added 6 commits August 31, 2018 21:37

WIP

efbafb7

Merge branch 'master' into core-0.12.16

7f4d109

Use tfc.serialization.registerClass()

ddb1416

Merge branch 'master' into core-0.12.16

7bce19e

Switch from 0.12.16 to 0.12.18

3d8ea85

0.12.18 contains the fix to the bug about concat gradient

Fix a bug in the way TimeDistributed handles array of outputs

d0fa91e

caisq changed the title ~~Time dist fix~~ [WIP; DO NOT REVIEW YET] Fix a bug in the handling of array of step outputs by TimeDistributed layer Sep 6, 2018

caisq added 2 commits September 6, 2018 13:26

Merge branch 'master' of github.com:caisq/tfjs-layers-1 into time-dis…

714b905

…t-fix

cleanup

e31730f

caisq changed the title ~~[WIP; DO NOT REVIEW YET] Fix a bug in the handling of array of step outputs by TimeDistributed layer~~ Fix a bug in the handling of array of step outputs by TimeDistributed layer Sep 6, 2018

Clean up test

cfb9314

caisq requested review from ericdnielsen, bileschi and davidsoergel September 6, 2018 17:36

davidsoergel approved these changes Sep 6, 2018

View reviewed changes

Respond to davidsoergel@'s comments

b83964a

caisq commented Sep 6, 2018

View reviewed changes

caisq merged commit e0b73c1 into tensorflow:master Sep 6, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a bug in the handling of array of step outputs by TimeDistributed layer #315

Fix a bug in the handling of array of step outputs by TimeDistributed layer #315

caisq commented Sep 6, 2018 •

edited by nsthorat

davidsoergel left a comment

caisq left a comment

rodrigopivi commented Sep 12, 2018 •

edited

caisq commented Sep 13, 2018

rodrigopivi commented Sep 13, 2018

Fix a bug in the handling of array of step outputs by TimeDistributed layer #315

Fix a bug in the handling of array of step outputs by TimeDistributed layer #315

Conversation

caisq commented Sep 6, 2018 • edited by nsthorat

davidsoergel left a comment

Choose a reason for hiding this comment

caisq left a comment

Choose a reason for hiding this comment

rodrigopivi commented Sep 12, 2018 • edited

caisq commented Sep 13, 2018

rodrigopivi commented Sep 13, 2018

caisq commented Sep 6, 2018 •

edited by nsthorat

rodrigopivi commented Sep 12, 2018 •

edited