About net topology and activation function in eesen #65

NewEricWang · 2016-06-30T06:50:38Z

Hi, Yajie
What activation function is used in BILSTM model in eesen? How to change the activation function? How to set different cell-number and recurrent-number separately in eesen?

Thanks!

yajiemiao · 2016-06-30T14:43:27Z

We use the standard activation function for (BI)LSTM. You can find more details by looking at Section 2 of the following paper
http://www.cs.cmu.edu/~ymiao/pub/is2015_lstm.pdf

The cell and hidden output dimension is the same. It can be set by changing "lstm_cell_dim" in run_ctc_xxx.sh. By default it's set to 320

NewEricWang · 2016-07-01T06:12:54Z

Thanks a lot!
Can I insert a sub-sampling layer between two BILSTM layers in eesen? The sub-sampling layer is expected to play reducing net parameters like SVD.
Can the different cell and hidden output dimension be supported in eesen?

yajiemiao · 2016-07-01T14:04:05Z

When you said "sub-sampling", I assume that you meant sub-sampling the frames. This is hard because we pad the utterances to the same length within a batch. Your sub-sampling layer needs to figure out which of the frames are not padding frames and does the backpropagation accordingly.

You may simply try inserting an affine transform between two LSTM layers, which projects LSTM outputs into a presumably lower-dimensional vector. But this may introduce training instability as the projection layer is not recurrent.

In its vanilla definition, LSTM has to have the same cell and output dimension. Google's LSTM with recurrent projection layer is not supported.

NewEricWang · 2016-07-04T02:25:00Z

Thanks!

NewEricWang closed this as completed Jul 4, 2016

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About net topology and activation function in eesen #65

About net topology and activation function in eesen #65

NewEricWang commented Jun 30, 2016

yajiemiao commented Jun 30, 2016

NewEricWang commented Jul 1, 2016

yajiemiao commented Jul 1, 2016

NewEricWang commented Jul 4, 2016

About net topology and activation function in eesen #65

About net topology and activation function in eesen #65

Comments

NewEricWang commented Jun 30, 2016

yajiemiao commented Jun 30, 2016

NewEricWang commented Jul 1, 2016

yajiemiao commented Jul 1, 2016

NewEricWang commented Jul 4, 2016