Added implementation for GridRNN #1665

phvu · 2016-03-27T05:35:28Z

As discussed here: #1453

The implementation is generic, users can specify the number of dimensions and various configurations for those dimensions (input/output/priority/non-recurrent). The type of the cells along dimensions can also be selected among LSTM, GRU, vanilla RNN.

Come with unittests for basic types: 2LSTM (tied weights, non-recurrent), 2BasicLSTM and 2RNN.

I made a simple test of Grid2LSTM for character-level language modeling: https://github.com/phvu/grid-lstm-tensorflow/tree/master/char-rnn.

tensorflow-jenkins · 2016-03-27T05:35:30Z

Can one of the admins verify this patch?

ebrevdo · 2016-03-29T05:10:53Z

Will take a look tomorrow.

ebrevdo · 2016-03-30T23:06:35Z

tensorflow/contrib/grid_rnn/__init__.py

+from __future__ import print_function
+
+# pylint: disable=unused-import,wildcard-import, line-too-long
+from tensorflow.contrib.grid_rnn.python.ops.grid_rnn_cell import *


add newline

ebrevdo · 2016-03-30T23:12:44Z

Thanks for the hard work! Some comments.

…input and output of the cell, updated tests

phvu · 2016-03-31T11:54:43Z

@ebrevdo Thanks for the comments. I updated the code.

Good to know about input_size and output_size, I updated the code to not depend on those.
However as the LSTMCell is still accepting input_size, I still keep input_size as a parameter in the cell_fn callback. Should it be also removed?
The tests are real. I computed the values using this. Since the initializer is fixed in the tests (weights are initialized to 0.5 or 0.2, depending on the tests; bias are initialized to zero), the output are deterministic. But of course I can switch to lightweight asserts if that is preferred.

ebrevdo · 2016-04-04T04:11:37Z

Without a unit test that calls tf.nn.rnn or tf.nn.dynamic_rnn, it's not clear that these cells interact correctly with those methods. Can you add such a test for your classes?

phvu · 2016-04-04T04:14:56Z

ok will do.

phvu · 2016-04-06T18:35:32Z

I added the tests for Grid1LSTM, Grid2LSTM, Grid3LSTM (with ReLU), trained with tf.nn.rnn. Can you take a look?

ebrevdo · 2016-04-08T03:26:30Z

Will look tomorrow-thanks!
On Apr 6, 2016 11:36 AM, "Vu Pham" notifications@github.com wrote:

I added the tests for Grid1LSTM, Grid2LSTM, Grid3LSTM (with ReLU),
trained with tf.nn.rnn. Can you take a look?

—
You are receiving this because you were mentioned.
Reply to this email directly or view it on GitHub
#1665 (comment)

ebrevdo · 2016-04-13T21:33:47Z

Jenkins, test this please?

gunan · 2016-04-13T21:33:48Z

Can one of the admins verify this patch?

ebrevdo · 2016-04-13T21:34:04Z

Sorry for not reviewing earlier. Let's retest this and we can get it merged.

ebrevdo · 2016-04-14T03:51:25Z

@phvu can you rebase on HEAD and run the tests?

vrv · 2016-04-14T03:55:27Z

@ebrevdo: why does he have to rebase to HEAD? Do you think there will be a conflict in doing so?

Also, only we can trigger tests.

vrv · 2016-04-14T03:55:49Z

@tensorflow-jenkins: test this please

phvu · 2016-04-14T09:19:47Z

Cool, tests passed. So I don't need to rebase master for now?

ebrevdo · 2016-04-14T16:20:35Z

Excellent, we're good to go. Thanks for the contribution! If you have any example code using this we can consider adding it elsewhere in the repo (e.g. under models).

phvu · 2016-04-15T00:26:31Z

Cool, thanks for all the help.
Unfortunately I can't share what I am working on with this yet. I have an application of this to character-level modeling (https://github.com/phvu/grid-lstm-tensorflow/tree/master/char-rnn) but it isn't entirely my code so I put it in a separated repo.

I think at some point I can try reproducing some experiments in the paper. Will submit PRs by then.

jstaker7 · 2016-07-18T22:20:24Z

Hi @phvu,

Thanks very much for your work. I'm a little confused as to how this can be applied generally. For example, in the paper a 3-LSTM network was applied to image patches of the MNIST dataset where each patch is c * m units long (which is equal to the depth dimension). So we have a 3D grid of c * m vectors. But in your implementation of __call__, we are expecting a tensor of size batch * c * m * num_dims. For MNIST, shouldn't it support b * c * m * dim1_size * dim2_size ? This would require input_dims to be something like [1, 2] (where dim 0 is batch); but I don't see any tests covering cases besides input_dim=0. Am I missing something?

phvu · 2016-07-19T11:26:25Z

Hi @jstaker7,
There are some nuances here.
First, the __call__ method expects the state tensor of batch_size * ( (c + m) * num_dims). When c=m=2 then c*m = c+m. Sorry for the bad test case, I should have used other values.

Second, in the MNIST case, (as far as I understand) I suppose you could use Grid3LSTMCell (https://github.com/tensorflow/tensorflow/blob/master/tensorflow/contrib/grid_rnn/python/ops/grid_rnn_cell.py#L242). This cells receives input and gives output in the first dimension (index 0), and uses LSTM in the other 2 dimensions.
In order to replicate Figure 11 in the paper, you will need to construct 4 3-LSTM cells, each cell will handle 1 scan direction of the input image. However the first hidden LSTM layer receives original pixels as input, while other hidden LSTM layers receives the output of the LSTM just below it.
How to do that is up to you. One simple way is to have a loop to scan the image in a given scan direction (say left-to-right top-to-bottom), which will give a sequence of vectors, and then feed that sequence into the corresponding 3-LSTM cell for that scan direction. For bigger images, you might want to use tf.scan()

So your assumption that input_dims should be [1,2] is not true. The cells should receive input at dimension 0. For me this is the only reasonable interpretation of the paper, which also take 1-LSTM and 2-LSTM cells into account. (unless the authors release their implementation so that we can do a comparison).

jstaker7 · 2016-07-19T16:07:25Z

Hi @phvu,

Thank you, this helps quite a lot. Since each dimension will always have its own LSTM, does that mean we will never use an input_dims other than 0?

phvu · 2016-07-19T19:38:17Z

To replicate the experiments in the paper, yes. But I am not sure about that generally.

When we set dimension i to be input_dims, it simply means for each __call__, the cell expects an input tensor for dimension i. For non-input dimensions, the cell will take the recurrent values (extracted from the state tensor) as input.

So I guess we can be creative and feed inputs into more than one dimension. I used dimension 0 as input and output dimensions just as a convention. You can as well construct your own GridRNNCell with any configuration.

jstaker7 · 2016-07-19T19:48:17Z

Perfect, thanks so much for the clarification. Super helpful and makes a lot of sense.

This might also be interesting to look into in the future. Grid LSTMs were mentioned and it looks like there was a recent merge: #2560

phoaivu added 11 commits March 21, 2016 09:37

first commit to grid rnn

e0b1b0d

Merge branch 'master' into enhancement/grid-rnn

1289e4b

added the first tests, and package organization

5337a25

Merge branch 'master' into enhancement/grid-rnn

bf554a4

finished testing 2LSTM and 2BasicLSTM

127fe19

Merge branch 'master' into enhancement/grid-rnn

c170765

fix for RNN/GRU

0d77f55

added edge cases tests

3bffeda

removed logs, added docs

949043c

added gridGRU

db367ce

resolved conflicts after merge with upstream master

ce03a24

googlebot added the cla: yes label Mar 27, 2016

Merge branch 'master' into enhancement/grid-rnn

5bf8826

vrv assigned ebrevdo Mar 28, 2016

ebrevdo reviewed Mar 30, 2016
View reviewed changes

phoaivu added 2 commits March 31, 2016 14:55

added copyright and newlines

43558ee

clean up _propagate, better way to reuse variables, don't depends on …

f412579

…input and output of the cell, updated tests

added tests for GridLSTM cells with tf.nn.rnn()

c80a374

ebrevdo merged commit 97fb7dd into tensorflow:master Apr 14, 2016

phvu deleted the enhancement/grid-rnn branch May 12, 2016 23:33

tatatodd mentioned this pull request Sep 9, 2016

Grid RNN Cell does not support dynamic batch sizes #4296

Closed

selcouthlyBlue mentioned this pull request Dec 7, 2017

MDLSTM in TensorFlow backend rwth-i6/returnn#8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added implementation for GridRNN #1665

Added implementation for GridRNN #1665

phvu commented Mar 27, 2016

tensorflow-jenkins commented Mar 27, 2016

ebrevdo commented Mar 29, 2016

ebrevdo Mar 30, 2016

phvu Apr 4, 2016

ebrevdo commented Mar 30, 2016

phvu commented Mar 31, 2016

ebrevdo commented Apr 4, 2016

phvu commented Apr 4, 2016

phvu commented Apr 6, 2016

ebrevdo commented Apr 8, 2016

ebrevdo commented Apr 13, 2016

gunan commented Apr 13, 2016

ebrevdo commented Apr 13, 2016

ebrevdo commented Apr 14, 2016

vrv commented Apr 14, 2016

vrv commented Apr 14, 2016

phvu commented Apr 14, 2016

ebrevdo commented Apr 14, 2016

phvu commented Apr 15, 2016

jstaker7 commented Jul 18, 2016 •

edited

Loading

phvu commented Jul 19, 2016 •

edited

Loading

jstaker7 commented Jul 19, 2016

phvu commented Jul 19, 2016 •

edited

Loading

jstaker7 commented Jul 19, 2016 •

edited

Loading

Added implementation for GridRNN #1665

Added implementation for GridRNN #1665

Conversation

phvu commented Mar 27, 2016

tensorflow-jenkins commented Mar 27, 2016

ebrevdo commented Mar 29, 2016

ebrevdo Mar 30, 2016

Choose a reason for hiding this comment

phvu Apr 4, 2016

Choose a reason for hiding this comment

ebrevdo commented Mar 30, 2016

phvu commented Mar 31, 2016

ebrevdo commented Apr 4, 2016

phvu commented Apr 4, 2016

phvu commented Apr 6, 2016

ebrevdo commented Apr 8, 2016

ebrevdo commented Apr 13, 2016

gunan commented Apr 13, 2016

ebrevdo commented Apr 13, 2016

ebrevdo commented Apr 14, 2016

vrv commented Apr 14, 2016

vrv commented Apr 14, 2016

phvu commented Apr 14, 2016

ebrevdo commented Apr 14, 2016

phvu commented Apr 15, 2016

jstaker7 commented Jul 18, 2016 • edited Loading

phvu commented Jul 19, 2016 • edited Loading

jstaker7 commented Jul 19, 2016

phvu commented Jul 19, 2016 • edited Loading

jstaker7 commented Jul 19, 2016 • edited Loading

jstaker7 commented Jul 18, 2016 •

edited

Loading

phvu commented Jul 19, 2016 •

edited

Loading

phvu commented Jul 19, 2016 •

edited

Loading

jstaker7 commented Jul 19, 2016 •

edited

Loading