allow symbolic in reshape #287

skaae · 2015-06-05T08:17:51Z

Allow symbolic variables in the shape specifcation when using the reshape layer.

Its discussed here: https://groups.google.com/forum/#!topic/lasagne-users/eA995V73K8I

One use case is to allow for both variable batchsize and sequence length when using recurrent nets.

It passes the tests but i'm not sure if it has some undesired consequences?

benanne · 2015-06-05T09:08:21Z

Maybe we should check that the TensorVariable is scalar as well? I think as it stands you could pass in any tensor and it wouldn't complain until runtime.

Then we can also get rid of that pass statement which looks a bit odd :)

f0k · 2015-06-05T10:16:46Z

Your test is missing get_output_shape_for(). All the tests I wrote first check layer.output_shape and then check layer.get_output_for(inputdata).eval(). Add that and you will see you need to adapt ReshapeLayer.get_output_shape_for() as well. Specifically, you need a second for loop after the one caring for [i]:

# Secondly, replace all symbolic shapes with `None`, as we cannot
# infer their size here.
for dim, o in enumerate(output_shape):
    if isinstance(o, TensorVariable):
        output_shape[dim] = None
        masked_output_shape[dim] = None

You can also merge it with the first for loop if you find a good way of documenting it, otherwise just copy and paste my suggestion as is.

Apart from that, I don't understand your use case. Why do you need a ReshapeLayer after the EmbeddingLayer? Shouldn't the embedding layer give you the correct shape already?

skaae · 2015-06-06T17:36:24Z

Yeah is see the test is not correct.

My use case is for recurrent nets. When you combine recurrent nets and feed forward nets
you need to do the reshapes:

Recurrent                                             Feedforward
(batch_size, seq_len, num_units) <-> (batch_size*seq_len, num_units)

If you want both seq_len and batch_size to be flexible you need to let the reshape depend on a
symbolic variable.

skaae · 2015-06-06T18:20:35Z

I adressed @f0k's and @benanne's comments.

benanne · 2015-06-07T11:42:48Z

lasagne/layers/shape.py

+            elif isinstance(s, T.TensorVariable):
+                if s.ndim != 0:
+                    raise ValueError(
+                        "The symbolic variable specifying shape must be a "


Maybe "A symbolic variable in a shape specification must be a scalar, but had %i dimensions"? The variable technically doesn't specify the shape, it only specifies one dimension.

benanne · 2015-06-07T11:43:44Z

Couldn't resist a bit of nitpicking, sorry! :) Apart from that error message everything looks great. I'm okay with merging either way, as I said myself we shouldn't be nitpicking about docs / error messages too much just yet (see #279).

ReshapeLayer is getting pretty complicated... but I guess it already was before anyway.

skaae · 2015-06-07T11:54:17Z

No worries. I think its important to have meaningfull errors, and your sugesstions is more precise :)

Maybe i should also add an example?

skaae · 2015-06-07T12:05:51Z

Updated with @benanne's error message and added example with symbolic dimension specification.

benanne · 2015-06-07T13:28:28Z

Sweet, looks good to me. I'll leave this to @f0k to merge in case he has any further comments.

f0k · 2015-06-07T14:27:14Z

lasagne/layers/shape.py

+    >>> l_in = InputLayer((None, None, 10))
+    >>> l1 = ReshapeLayer(l_in, (batch_size*seq_len, [2]))
+    >>> l1.output_shape
+    (None, 10)


That's not a good example, because you could just do ReshapeLayer(l_in, (-1, [2])) for that. Better remove the example than keeping people wondering.

True. Maybe we should remove the example from here and add an example in the recurrent docs?

f0k · 2015-06-07T14:38:30Z

Minor comments, but the code and the tests look good! So the use case for this is going from (seqlen*bs, N) back to (bs, seqlen, N), right? Going from (bs, seqlen, N) to (seqlen*bs, N) doesn't need this PR, that's why I'm asking. I think we don't need to illustrate this in the docstring then, it gets too involved to construct a case that cannot do without symbolic variables.

skaae · 2015-06-07T15:18:57Z

I removed the blanks. I agree with your comment about the example, it is probably more confusing.
You are correct with the use case. You can go from (bs, seqlen, N) to (bs*seqlen, N) with (-1, N) but not in the opposite direction.

As i mentioned above i think we should add an example in the recurrent docs. Do you agree?

f0k · 2015-06-07T15:28:52Z

As i mentioned above i think we should add an example in the recurrent docs. Do you agree?

Yes, that seems to be a good place! You'd need some example on how to use a custom subnetwork for the input-to-hidden or hidden-to-hidden connections anyway, and this will include the reshapes (unless it's hidden in the recurrent layer class, but I'm not sure if that'd be good).

skaae · 2015-06-07T15:31:59Z

Yes. I'll create an issue in @craffel's repo so we wont forget.

skaae · 2015-06-08T08:14:26Z

I removed the new example and removed the blanks.

benanne · 2015-06-08T08:51:00Z

Nice work, merging!

allow symbolic in reshape

skaae · 2015-06-08T18:40:30Z

I'm writing the example with recurrent layers + dynamic reshape and I think i found an bug/non desired behavior:
My code is

from lasagne.layers import *
import lasagne
from lasagne.nonlinearities import softmax
import theano.tensor as T
import numpy as np
x, y = T.tensor3(), T.matrix()
batchsize, seqlen, _ = x.shape

num_inputs, num_units, num_classes = 10, 12, 5
l_inp = InputLayer((batchsize, seqlen, 10))
l_lstm_fwd = LSTMLayer(l_inp, num_units=num_units)
l_lstm_bck = LSTMLayer(l_inp, num_units=num_units, backwards=True)
l_lstm = ConcatLayer([l_lstm_fwd, l_lstm_bck], axis=2)
l_shp1 = ReshapeLayer(l_lstm, (-1, 2*num_units)) #<<<<<<<<<problem
l_softmax = DenseLayer(l_shp1, num_units=num_classes, nonlinearity=softmax)
l_out = ReshapeLayer(l_softmax, (batchsize, seqlen, num_classes))

lasagne.layers.get_output(
    l_out, x).eval({x: np.ones((10, 5, num_inputs), dtype='float32')})

If I have minus -1 in the marked line i get an error in the sanity check in 155. I i replace it with
batchsize*seqlen it runs.

I havent fully figured out what the sanity checks but thought i would mention it anyway.

I think that the error message in line 98 should be changed to something like:

raise ValueError(shape must be a tuple of int, [int], -1 or SymbolicVariables)

skaae · 2015-06-08T19:02:28Z

The problem seems to be that line 130 ff replace TensorsVariables with None

        for dim, o in enumerate(output_shape):
            if isinstance(o, T.TensorVariable):
                output_shape[dim] = None
                masked_output_shape[dim] = None

which is then converted to None in line 137.

        output_size = (None if any(x is None for x in masked_output_shape)
                       else np.prod(masked_output_shape))

I think it can be solved by checking for tensors

        def has_tensor_input(lst):
            return any(map(lambda v: isinstance(v, T.TensorVariable), lst))

        has_tensor = (has_tensor_input(masked_input_shape)) or (
            has_tensor_input(masked_output_shape))
        del masked_input_shape, masked_output_shape
        # Finally, infer value for -1 if needed
        if -1 in output_shape:
            dim = output_shape.index(-1)
            if (input_size is None) or (output_size is None):
                output_shape[dim] = None
                output_size = None
            else:
                output_size *= -1
                output_shape[dim] = input_size // output_size
                output_size *= output_shape[dim]
        # Sanity check
        if (input_size is not None) and (output_size is not None) \
           and (input_size != output_size) and not has_tensor:
            raise ValueError("%s cannot be reshaped to specification %s. "
                             "The total size mismatches." %
                             (input_shape, self.shape))

skaae · 2015-06-08T19:20:54Z

Sorry for making this so long :) But i figured that maybe i'm not supposed to give input sizes that are non int/[int] in the input layer? The shape layer works if I use

l_inp = InputLayer((None, None, 10))

benanne · 2015-06-08T19:42:29Z

Indeed, symbolic scalars are not supported in the shape specification of InputLayer. Maybe they should be? It'd be better to open a new issue for this though.

skaae · 2015-06-08T19:45:44Z

ok. I guess its old habit to pass anything called batchsize into the input layer.

f0k · 2015-06-16T17:09:18Z

Indeed, symbolic scalars are not supported in the shape specification of InputLayer. Maybe they should be?

No, we want the shapes to be non-symbolic. InputLayer((None, None, 10)) is the correct solution if the first two sizes are only known at run-time. The only thing we could do is replacing symbolic dimensions with None automatically in the constructor, but that would complicate the documentation.

f0k referenced this pull request Jun 5, 2015

PEP-8 and pyflakes changes for #21

53ba5b7

benanne reviewed Jun 7, 2015
View reviewed changes

f0k reviewed Jun 7, 2015
View reviewed changes

skaae mentioned this pull request Jun 7, 2015

Add example with variable batchsize and sequence length craffel/nntools#44

Closed

allow symbolic variables in reshape layer

ea2eeac

benanne added a commit that referenced this pull request Jun 8, 2015

Merge pull request #287 from skaae/reshape

b5f9794

allow symbolic in reshape

benanne merged commit b5f9794 into Lasagne:master Jun 8, 2015

skaae mentioned this pull request Jun 9, 2015

add example to recurrent docs craffel/nntools#47

Closed

skaae deleted the reshape branch June 16, 2015 21:27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

allow symbolic in reshape #287

allow symbolic in reshape #287

skaae commented Jun 5, 2015

benanne commented Jun 5, 2015

f0k commented Jun 5, 2015

skaae commented Jun 6, 2015

skaae commented Jun 6, 2015

benanne Jun 7, 2015

benanne commented Jun 7, 2015

skaae commented Jun 7, 2015

skaae commented Jun 7, 2015

benanne commented Jun 7, 2015

f0k Jun 7, 2015

skaae Jun 7, 2015

f0k commented Jun 7, 2015

skaae commented Jun 7, 2015

f0k commented Jun 7, 2015

skaae commented Jun 7, 2015

skaae commented Jun 8, 2015

benanne commented Jun 8, 2015

skaae commented Jun 8, 2015

skaae commented Jun 8, 2015

skaae commented Jun 8, 2015

benanne commented Jun 8, 2015

skaae commented Jun 8, 2015

f0k commented Jun 16, 2015

allow symbolic in reshape #287

allow symbolic in reshape #287

Conversation

skaae commented Jun 5, 2015

benanne commented Jun 5, 2015

f0k commented Jun 5, 2015

skaae commented Jun 6, 2015

skaae commented Jun 6, 2015

benanne Jun 7, 2015

Choose a reason for hiding this comment

benanne commented Jun 7, 2015

skaae commented Jun 7, 2015

skaae commented Jun 7, 2015

benanne commented Jun 7, 2015

f0k Jun 7, 2015

Choose a reason for hiding this comment

skaae Jun 7, 2015

Choose a reason for hiding this comment

f0k commented Jun 7, 2015

skaae commented Jun 7, 2015

f0k commented Jun 7, 2015

skaae commented Jun 7, 2015

skaae commented Jun 8, 2015

benanne commented Jun 8, 2015

skaae commented Jun 8, 2015

skaae commented Jun 8, 2015

skaae commented Jun 8, 2015

benanne commented Jun 8, 2015

skaae commented Jun 8, 2015

f0k commented Jun 16, 2015