WIP: state preserving LSTM #4

freewym · 2015-12-23T19:14:18Z

No description provided.

vijayaditya · 2015-12-29T19:52:02Z

egs/wsj/s5/steps/nnet3/components.py

+    if (state_preserving == "false"):
+        component_nodes.append("component-node name={0}_f1 component={0}_W_f-xr input=Append({1}, IfDefined(Offset({0}_{2}, {3})))".format(name, input_descriptor, recurrent_connection, lstm_delay))
+    else:
+        component_nodes.append("component-node name={0}_f1 component={0}_W_f-xr input=Append({1}, Failover(Offset({0}_{2}, {3}), Offset(output_{0}_{2}_STATE_PREVIOUS_MINIBATCH, {3})))".format(name, input_descriptor, recurrent_connection, lstm_delay))


Do we need to use the offset descriptor even for STATE_PREVIOUS_MINIBATCH variable ? Isn't its value constant at all time steps.

If the lstm delay value is larger than -1, e.g. -3, then we need to add 3 previous states at t=-3, -2, -1 for frames at t=0, 1, 2 respectively, in which case we need the offset descriptor with value -3

I don't think it is possible to add the previous state from multiple time steps in the same variable in the current framework ? Did you try using this network with larger offsets and check it works. I think we need to have separate variables to store each time offset from the previous minibatch.

In my implementation, I add to each example an additional input io with multiple time steps of the same variable (Ln 108-130 in nnet3-add-recurrent-io-to-egs.cc), just the same way as ordinary input. I think as long as the Index of those additional input io is correct (in terms of computability of all the component nodes after building the computation graph).

nice that you managed to get this done with minimal variables.

…e egs for large chunks. It replaces the function of the previous option left-shift-window, but have not changed its name and description yet

…g stats in training if we have more than one output nodes

cuda kernels for sparse matrix affine forward/backward prop

freewym force-pushed the splstm branch 28 times, most recently from 717e6ca to 3024543 Compare December 29, 2015 19:11

vijayaditya reviewed Dec 29, 2015
View reviewed changes

freewym force-pushed the splstm branch 7 times, most recently from c41ddd2 to 342c665 Compare January 23, 2016 17:01

freewym force-pushed the splstm branch 10 times, most recently from 4be93e2 to 6a43192 Compare July 1, 2016 22:27

starting state preserving LSTM

0d55353

freewym force-pushed the splstm branch from 6a43192 to 0d55353 Compare July 2, 2016 00:03

changes tp extract recurrent offsets from GeneralDescriptor

07e08c2

freewym force-pushed the splstm branch from 3fcee02 to 07e08c2 Compare July 2, 2016 07:31

add the option to use SplitIntoRanges() in chain/ to get starts of th…

509af43

…e egs for large chunks. It replaces the function of the previous option left-shift-window, but have not changed its name and description yet

freewym force-pushed the splstm branch 4 times, most recently from 7014a93 to f17b524 Compare July 3, 2016 16:21

add state preserving training support to python scripts

d8a9a26

freewym force-pushed the splstm branch from f17b524 to d8a9a26 Compare July 6, 2016 23:33

fix a possible bug in counting num_minibatches_processed when updatin…

cc61676

…g stats in training if we have more than one output nodes

freewym pushed a commit that referenced this pull request Apr 6, 2017

Merge pull request #4 from freewym/shortcut2

ae9b986

cuda kernels for sparse matrix affine forward/backward prop

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: state preserving LSTM #4

WIP: state preserving LSTM #4

freewym commented Dec 23, 2015

vijayaditya Dec 29, 2015

freewym Dec 29, 2015

vijayaditya Dec 29, 2015

freewym Dec 29, 2015

vijayaditya Dec 29, 2015

WIP: state preserving LSTM #4

Are you sure you want to change the base?

WIP: state preserving LSTM #4

Conversation

freewym commented Dec 23, 2015

vijayaditya Dec 29, 2015

Choose a reason for hiding this comment

freewym Dec 29, 2015

Choose a reason for hiding this comment

vijayaditya Dec 29, 2015

Choose a reason for hiding this comment

freewym Dec 29, 2015

Choose a reason for hiding this comment

vijayaditya Dec 29, 2015

Choose a reason for hiding this comment