Skip to content
This repository has been archived by the owner on Sep 27, 2020. It is now read-only.

about concat #4

Closed
chencodeX opened this issue Apr 25, 2018 · 5 comments
Closed

about concat #4

chencodeX opened this issue Apr 25, 2018 · 5 comments

Comments

@chencodeX
Copy link

https://arxiv.org/pdf/1506.04214.pdf
in this parper
image
the LSTM compute i(t),f(t),o(t) is use x(t),h(t-1) and c(t-1). but in your code convolution_lstm.py line 24:
combined = torch.cat((input, h), dim=1)

you just use x and h, why?

@automan000
Copy link
Owner

Hi Chen,
I think you might be right. I forget why I did this when I wrote the code.
I have checked several implementations. Many of them make the same decision.
I will do some further investigations. Once finished, I will let you know.
Thanks anyway.

@chencodeX
Copy link
Author

I have some of my own opinions, which may be to save code complexity and the number of parameters

@Jackie-Chou
Copy link

@automan000 Guess you have referenced some implementations of LSTM instead of convLSTM, note the difference of formulas of them .
LSTM
image
convLSTM
image
The raw LSTM really doesn't involve c for computing the gate while convLSTM does.

@automan000
Copy link
Owner

@Jackie-Chou Actually I am referring the implementations such as BasicConvLSTMCell
But your words are very convincing. I will update the code ASAP. Thanks a lot.

@automan000
Copy link
Owner

Bug fixed. Close this issue.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants