Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Different input dimention compared to output dimension #12

Closed
titu1994 opened this issue Sep 13, 2017 · 6 comments
Closed

Different input dimention compared to output dimension #12

titu1994 opened this issue Sep 13, 2017 · 6 comments
Labels

Comments

@titu1994
Copy link

titu1994 commented Sep 13, 2017

Hi, I'm trying to implement a naive version of this paper in Keras, and was wondering how is the case that - n_in != n_out handled.

I went through the code a few times, and couldn't understand the element wise multiplication of (1 - r_t) with x_t, if x_t is of a different shape than r_t.

@taolei87
Copy link
Contributor

Hi,

When n_in != n_out, we simply add one more linear transform (say W) at the highway connection:
h'[t] = r[t] * h[t] + (1-r[t]) * (Wx[t])

Again, this multiplication can be batched together with other mm as well. that said, this (Wx) is being put into U. and I guess that's why you find the code confusing.

We should better document the implementation. Thanks for pointing this out!

@taolei87
Copy link
Contributor

Also, in speech task, the (Wx) is always included, which is discussed in the appendix.

@titu1994
Copy link
Author

titu1994 commented Sep 13, 2017

Thanks for the quick reply!

Just for further clarification, this new W will infact be different W from equation (3) in the paper, and will not have any bias?

@taolei87
Copy link
Contributor

yes, the new W will be different to (3). and no bias term for them.

only the neural gates have biases (4) (5)

@titu1994
Copy link
Author

Fantastic. Thank you very much for the clarification, and fantastic work on this efficient implementation in CUDA.

@taolei87
Copy link
Contributor

You are very welcome!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants