Looping over tensor / using symbolic variable. #23

elanmart · 2015-09-10T16:09:12Z

Hello, if I'm understanding the code correctly, currently there is no way of doing loops with symbolic number of iterations (or looping over leading dimension of tensor like in ScanOP).

Are there plans to add such functionality to CGT? If not, what would be the recommended way of processing variable-length sequences? Is there a Switch operator, so that one could write:

output = init_output()
for t in range(max_num_steps):
   output = cgt.switch(X.shape[0]>t, make_step(X[t], output), output)

But wouldn't it create a huge overhead if the difference between shortest and longest sequences in the dataset is large?

Hope it makes sense to post this question here instead of the mailing list.

The text was updated successfully, but these errors were encountered:

joschu · 2015-09-10T17:46:22Z

Right, this functionality currently isn't implemented.
It'll certainly be possible to implement a Scan-like Op (in fact, I've implemented something similar before).
But I think it'll take some thought to figure out what's the right way to do it here.
Theano's scan doesn't have the friendliest syntax, and the Scan code is very intricate, suggesting that it might not be exactly the right abstraction.

Unfortunately, the switch method you suggested won't currently work--if you write a Switch Op, both of its operands will be evaluated.

I think the right solution to this problem is to have some mechanism for branching/looping that gets built into the "execution graph" -- the final data structure used to perform the computation.

The other temporary solution, which would work, is to make careful use of "masks", which zero out some components of the data or recurrent state and allow you train on variable-length inputs in batch form.
AFAIK most high-performance RNN code uses this sort of scheme.

Edit: using switch would work (though there's overhead), you'd just have to clip the index so you don't get an out of bounds error. The folowing hacky solution almost works (one just needs to implement maximum

output = (X.shape[0]<=t) * make_step(X[cgt.maximum(t, X.shape[0]-1)], output) + (X.shape[0]>t) * output

elanmart · 2015-10-05T18:27:19Z

Thanks. is it possible to implement a lazy IfElse op in cgt?

joschu · 2015-10-08T17:16:05Z

Not yet, because of how the graph execution works. I'm going to open up an issue for further discussion on this topic -- @hojonathanho and I have discussed it a bit.

joschu mentioned this issue Oct 9, 2015

Conditionals (if-else) and Loops #37

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Looping over tensor / using symbolic variable. #23

Looping over tensor / using symbolic variable. #23

elanmart commented Sep 10, 2015

joschu commented Sep 10, 2015

elanmart commented Oct 5, 2015

joschu commented Oct 8, 2015

Looping over tensor / using symbolic variable. #23

Looping over tensor / using symbolic variable. #23

Comments

elanmart commented Sep 10, 2015

joschu commented Sep 10, 2015

elanmart commented Oct 5, 2015

joschu commented Oct 8, 2015