SeqLSTM #207

nicholas-leonard · 2016-04-13T22:28:54Z

Super fast LSTM code from Justin's torch-rnn

jcjohnson · 2016-04-13T22:56:25Z

I'm not sure about the best way to handle (N x T x D) vs (T x N x D) layouts (where N = minibatch, T = sequence length, D = input size). TND fits better with the rest of rnn, and is more memory friendly so it will probably be a bit faster; however NTD seems like a more natural fit with the rest of nn, which is why I chose to use that layout.

Thoughts?

nicholas-leonard · 2016-04-14T00:52:13Z

@jcjohnson Yeah my thoughts exactly. I want to make SeqLSTM default to TND to make the transition more seamless for rnn users. The reason I chose this for rnn was like you said the memory-friendliness. But then like you said, their is the rest of torch users and the torch-rnn package...

I was thinking that we could add something like SeqLSTM.batchfirst = true to make it easy to switch to NTD instead of the default TND. If you insist, since it is your code and it will affect your backwards-compatibility, I don't really mind making it default to false.

So it is up to you really :) What do you choose ?

SeanNaren · 2016-04-14T10:38:27Z

Really awesome to see this in rnn nice job guys. @nicholas-leonard I basically created a version of BiSequencer for the torch-rnn package which may be of use, I opened a PR here. I would be more than happy to create a PR after you implement this!

nicholas-leonard · 2016-04-14T12:12:03Z

@SeanNaren Awesome! Would really appreciate that!

jcjohnson · 2016-04-14T17:27:07Z

@nicholas-leonard TND default sounds good to me; that way the default behavior is also the fastest.

I like the idea of a batchFirst field that switches over to NTD layout. I'm not too worried about the default behavior maintaining backward compatibility with torch-rnn since I am probably the main consumer of the SeqLSTM right now, and a line to set batchFirst in my own code isn't a big deal.

Fixed remember comparisons (= to ==)

Update SeqLSTM.lua [changed = to == in comparison]

northanapon · 2016-04-18T18:36:07Z

A couple questions after trying out the code. SeqLSTM.batchfirst does not work when remember_state = true. I think the index of self.cell and self._output might need to be swapped.

Second, when batch size changed (i.e. during test time), SeqLSTM does not reset previous states properly. As a result, the batch size is mismatch. This might also affect the training. I am not sure how to implement :forget(). Maybe

function SeqLSTM:forget()
  parent:forget()
  self:resetStates()
end

nicholas-leonard · 2016-04-20T02:14:19Z

@northanapon I fixed your first bug. Wasn't able to reproduce your second bug (see unit test). Maybe it got fixed by fix to first. Can you test your second use case again with newest commit?

northanapon · 2016-04-21T17:59:30Z

I tried the new SeqLSTM (from master). The batch size problem still exists when :remember('both'). The test case resets :remember('neither'). Here is a code to reproduce the error.

lstm = nn.SeqLSTM(10, 10)
lstm.batchfirst = true
lstm:remember('both')
lstm:training()
lstm:forward(torch.Tensor(32, 20, 10))
lstm:evaluate()
lstm:forget()
lstm:forward(torch.Tensor(1, 1, 10))

I have to set :remember('both') in order to sample output one at step at a time, but :forget() does not reset c0 and h0 size.

nicholas-leonard · 2016-04-21T18:31:32Z

@northanapon I see what you mean now. Fixed in 7116a3d . Thanks for pointing this out!

nicholas-leonard added 4 commits April 12, 2016 10:24

initial commit for SeqLSTM

7922b50

SeqLSTM:toFastLSTM

bd55afa

SeqLSTM unit test ++

153b4b1

attribution

0d3b038

nicholas-leonard mentioned this pull request Apr 13, 2016

Element-Research rnn jcjohnson/torch-rnn#60

Open

SeanNaren mentioned this pull request Apr 14, 2016

Bi-Directional RNN support jcjohnson/torch-rnn#66

Closed

nicholas-leonard added 2 commits April 14, 2016 16:36

batchfirst

696af37

SeqLSTM.remember = Sequencer.remember

f58a6f3

nicholas-leonard mentioned this pull request Apr 15, 2016

nn.BiSequencer, cudnn and non-contiguous input #178

Closed

SeanNaren mentioned this pull request Apr 15, 2016

Added BRNN support borisfom/cudnn.torch#3

Closed

willfrey and others added 2 commits April 15, 2016 18:29

Update SeqLSTM.lua

d95c7d9

Fixed remember comparisons (= to ==)

Merge pull request #2 from willfrey/patch-1

372baf5

Update SeqLSTM.lua [changed = to == in comparison]

nicholas-leonard added 5 commits April 19, 2016 21:38

SeqLSTM unit test for batchfirst and variable T x N

abc98f2

merge

c88c21a

Merge branch 'SeqLSTM' of github.com:nicholas-leonard/rnn into SeqLSTM

a7fab16

SeqLSTM fix remember bug + unit test

cee690c

SeqLSTM evaluate unit tests

3e15971

SeqLSTM doc ++

3566f3d

nicholas-leonard changed the title ~~SeqLSTM (work in progress)~~ SeqLSTM Apr 20, 2016

nicholas-leonard added 2 commits April 20, 2016 10:24

simplified remember states

3c2e18c

SeqLSTM unit tested

258cd6c

nicholas-leonard merged commit 6a71750 into Element-Research:master Apr 21, 2016

Kaixhin mentioned this pull request Apr 21, 2016

Recurrent Dqn Kaixhin/Atari#8

Open

nicholas-leonard mentioned this pull request Apr 22, 2016

Benchmark Update glample/rnn-benchmarks#10

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SeqLSTM #207

SeqLSTM #207

nicholas-leonard commented Apr 13, 2016

jcjohnson commented Apr 13, 2016

nicholas-leonard commented Apr 14, 2016

SeanNaren commented Apr 14, 2016

nicholas-leonard commented Apr 14, 2016

jcjohnson commented Apr 14, 2016

northanapon commented Apr 18, 2016

nicholas-leonard commented Apr 20, 2016

northanapon commented Apr 21, 2016 •

edited

Loading

nicholas-leonard commented Apr 21, 2016

SeqLSTM #207

SeqLSTM #207

Conversation

nicholas-leonard commented Apr 13, 2016

jcjohnson commented Apr 13, 2016

nicholas-leonard commented Apr 14, 2016

SeanNaren commented Apr 14, 2016

nicholas-leonard commented Apr 14, 2016

jcjohnson commented Apr 14, 2016

northanapon commented Apr 18, 2016

nicholas-leonard commented Apr 20, 2016

northanapon commented Apr 21, 2016 • edited Loading

nicholas-leonard commented Apr 21, 2016

northanapon commented Apr 21, 2016 •

edited

Loading