Recurrent layers: Accept layer for hid_init #462

f0k · 2015-10-15T10:18:33Z

The recurrent layers currently have some custom behaviour when hid_init is a TensorVariable rather than a shared variable, callable or numpy array: They assume hid_init is a tensor of one order higher than otherwise, to include the batch dimension, and they assume hid_init is not to be learned.

With #404, parameters can be arbitrary Theano expressions anyway, and overriding that behaviour to assume a different dimensionality is a bit awkward. As discussed in #11 (comment) and following, a better solution would be allowing hid_init to be a Layer, and assuming it would include the batch dimension in this case. This would also be a step towards supporting the encoder-decoder architecture discussed in #391 (comment) and following.

The text was updated successfully, but these errors were encountered:

craffel · 2015-10-19T03:56:03Z

This seems reasonable to me. I think @skaae is better suited to comment, as he added that functionality for a specific use case (next character prediction) in mind. I think the new recurrent containers will be a better way to handle that use-case anyways.

skaae · 2015-10-19T11:09:45Z

Yes its better require ´hid_init` to be layer. For the "my" usecase you can just wrap the tensor in an input layer.

craffel · 2015-10-19T13:48:32Z

Ok, cool. @f0k can you assign this to me? Unless one of you want to do it.

f0k · 2015-10-19T14:30:35Z

You're welcome to tackle this! :) And I'll be glad to review.

skaae · 2015-10-26T16:25:11Z

Remember to return the parameters when hid_init/cell_init is a layer. We got a question about this on the mailing list.

f0k · 2015-10-29T17:52:46Z

Remember to return the parameters when hid_init/cell_init is a layer.

No need to do that manually, hid_init would become part of the layer's incoming layers (i.e., it would be passed to the MergeLayer super constructor). Otherwise it couldn't properly participate in get_output() anyway.

skaae · 2015-10-30T09:13:07Z

aah yes :)

skaae · 2015-12-18T09:46:34Z

@craffel: Do you plan to work on this soon? otherwise i can create a PR.

f0k · 2015-12-18T12:00:53Z

Do you plan to work on this soon? otherwise i can create a PR.

Look above your comment, there is a PR, #522! :)

You could start a PR on top of that which removes the TensorVariable special case, though. (I.e., do not base it on master, but base it on #522, as we're going to merge that anyway.)

skaae · 2015-12-18T13:51:20Z

missed that :) I'll remove the TensorVariable special case when #522 is merged.

craffel · 2015-12-18T16:09:25Z

@craffel: Do you plan to work on this soon? otherwise i can create a PR.

I was planning on it, but I'm glad someone else did it :)

f0k mentioned this issue Oct 15, 2015

Arbitrary expressions as Layer parameters #11

Closed

f0k assigned craffel Oct 19, 2015

skaae mentioned this issue Dec 3, 2015

Fill the initial hidden state of GRULayer with output from a previous layer #522

Merged

skaae mentioned this issue Dec 17, 2015

LSTMLayer does not work when a SharedTensorVariable is passed for hid_init #542

Open

f0k closed this as completed in #522 Dec 22, 2015

skaae mentioned this issue Dec 23, 2015

Remove TensorVariable Init #549

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Recurrent layers: Accept layer for hid_init #462

Recurrent layers: Accept layer for hid_init #462

f0k commented Oct 15, 2015

craffel commented Oct 19, 2015

skaae commented Oct 19, 2015

craffel commented Oct 19, 2015

f0k commented Oct 19, 2015

skaae commented Oct 26, 2015

f0k commented Oct 29, 2015

skaae commented Oct 30, 2015

skaae commented Dec 18, 2015

f0k commented Dec 18, 2015

skaae commented Dec 18, 2015

craffel commented Dec 18, 2015

Recurrent layers: Accept layer for hid_init #462

Recurrent layers: Accept layer for hid_init #462

Comments

f0k commented Oct 15, 2015

craffel commented Oct 19, 2015

skaae commented Oct 19, 2015

craffel commented Oct 19, 2015

f0k commented Oct 19, 2015

skaae commented Oct 26, 2015

f0k commented Oct 29, 2015

skaae commented Oct 30, 2015

skaae commented Dec 18, 2015

f0k commented Dec 18, 2015

skaae commented Dec 18, 2015

craffel commented Dec 18, 2015