Different input dimention compared to output dimension #12

titu1994 · 2017-09-13T18:33:52Z

Hi, I'm trying to implement a naive version of this paper in Keras, and was wondering how is the case that - n_in != n_out handled.

I went through the code a few times, and couldn't understand the element wise multiplication of (1 - r_t) with x_t, if x_t is of a different shape than r_t.

taolei87 · 2017-09-13T18:40:31Z

Hi,

When n_in != n_out, we simply add one more linear transform (say W) at the highway connection:
h'[t] = r[t] * h[t] + (1-r[t]) * (Wx[t])

Again, this multiplication can be batched together with other mm as well. that said, this (Wx) is being put into U. and I guess that's why you find the code confusing.

We should better document the implementation. Thanks for pointing this out!

taolei87 · 2017-09-13T18:41:11Z

Also, in speech task, the (Wx) is always included, which is discussed in the appendix.

titu1994 · 2017-09-13T18:52:25Z

Thanks for the quick reply!

Just for further clarification, this new W will infact be different W from equation (3) in the paper, and will not have any bias?

taolei87 · 2017-09-13T18:56:46Z

yes, the new W will be different to (3). and no bias term for them.

only the neural gates have biases (4) (5)

titu1994 · 2017-09-13T19:06:30Z

Fantastic. Thank you very much for the clarification, and fantastic work on this efficient implementation in CUDA.

taolei87 · 2017-09-13T19:18:41Z

You are very welcome!

taolei87 added the question label Sep 13, 2017

titu1994 closed this as completed Sep 13, 2017

taolei87 mentioned this issue Sep 19, 2017

Confused about the variable k in the code. #16

Closed

taoleicn mentioned this issue Jan 9, 2018

A little question about the architecture #44

Closed

stegben mentioned this issue Jan 25, 2018

remove SRU num_units == x.shape[-1] restriction tensorflow/tensorflow#16404

Merged

taolei87 mentioned this issue Mar 13, 2018

About k value in SRUCell , why it's can be 4 when n_in != out_size #48

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Different input dimention compared to output dimension #12

Different input dimention compared to output dimension #12

titu1994 commented Sep 13, 2017 •

edited

Loading

taolei87 commented Sep 13, 2017

taolei87 commented Sep 13, 2017

titu1994 commented Sep 13, 2017 •

edited

Loading

taolei87 commented Sep 13, 2017

titu1994 commented Sep 13, 2017

taolei87 commented Sep 13, 2017

Different input dimention compared to output dimension #12

Different input dimention compared to output dimension #12

Comments

titu1994 commented Sep 13, 2017 • edited Loading

taolei87 commented Sep 13, 2017

taolei87 commented Sep 13, 2017

titu1994 commented Sep 13, 2017 • edited Loading

taolei87 commented Sep 13, 2017

titu1994 commented Sep 13, 2017

taolei87 commented Sep 13, 2017

titu1994 commented Sep 13, 2017 •

edited

Loading

titu1994 commented Sep 13, 2017 •

edited

Loading