About LSTM Layer in GroundHog #23

gaoyuankidult · 2014-11-11T12:22:45Z

Hello

I have checked the LSTM Layer in GroundHog / groundhog / layers / rec_layers.py. I wonder is this a complete standard LSTM layer(e.g. described in Alex Grave's paper, http://arxiv.org/pdf/1308.0850v5.pdf ) or a prototype for now?

I didn't see bias term in # input/output gate update. did I miss it? btw. do you have an example using this layer?

thanks.

btw. I can write a wiki(tutorial) about it, If there is some example.

formulas described in paper:

code of LSTM layer:

rizar · 2014-11-13T07:44:56Z

@kyunghyuncho , you wrote the LSTM layer.

kyunghyuncho · 2014-11-16T00:04:53Z

Sorry about the late reply!

As @gaoyuankidult correctly noticed, the implementation in GoundHog lacks the bias terms for the gaters, which I believe wouldn't make much difference. Also, note that there are a number of variants of LSTMs (see, e.g., http://arxiv.org/pdf/1409.2329.pdf).

You can train a neural machine translation model using LSTM by

    state['enc_rec_layer'] = 'LSTMLayer'                                                          
    state['enc_rec_gating'] = False
    state['enc_rec_reseting'] = False
    state['dec_rec_layer'] = 'LSTMLayer'                                                          
    state['dec_rec_gating'] = False
    state['dec_rec_reseting'] = False                                                             
    state['dim_mult'] = 4

Though, this feature has been tested only internally quite some time ago. If you run into any issues with this, please, leave a comment here about the details and I'll take a look into it.

infinitezxc · 2015-01-29T18:36:10Z

Hello @kyunghyuncho

When I am trying to train a model using LSTM, I run into this error:

ValueError: dimension mismatch in args to gemm (512,2000)x(1000,1000)->(512,1000)
Apply node that caused the error: GpuDot22(GpuReshape{2}.0, W_0_dec_repr_readout)
Inputs types: [CudaNdarrayType(float32, matrix), CudaNdarrayType(float32, matrix)]
Inputs shapes: [(512, 2000), (1000, 1000)]
Inputs strides: [(2000, 1), (1000, 1)]
Inputs values: ['not shown', 'not shown']

Do you have any suggestions? Thanks!

kyunghyuncho · 2015-01-29T19:38:46Z

Can you provide your state variables?

On Thu, Jan 29, 2015 at 1:36 PM, infinitezxc notifications@github.com
wrote:

Hello @kyunghyuncho https://github.com/kyunghyuncho

When I am trying to train a model using LSTM, I run into this error:
‘’‘
ValueError: dimension mismatch in args to gemm
(512,2000)x(1000,1000)->(512,1000)
Apply node that caused the error: GpuDot22(GpuReshape{2}.0,
W_0_dec_repr_readout)
Inputs types: [CudaNdarrayType(float32, matrix), CudaNdarrayType(float32,
matrix)]
Inputs shapes: [(512, 2000), (1000, 1000)]
Inputs strides: [(2000, 1), (1000, 1)]
Inputs values: ['not shown', 'not shown']
’‘’
Do you have any suggestions? Thanks!

—
Reply to this email directly or view it on GitHub
#23 (comment)
.

infinitezxc · 2015-01-30T01:22:32Z

Thanks for the quick reply :) @kyunghyuncho

I am using the default states in prototype_phrase_lstm_state(), with the source and target paths modified and adding a prototype_phrase_lstm_state line at the end of init.py.

guxd · 2016-01-13T08:41:56Z

@infinitezxc Have you solved the LSTM error? I met the error too.

kyunghyuncho closed this as completed Dec 5, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About LSTM Layer in GroundHog #23

About LSTM Layer in GroundHog #23

gaoyuankidult commented Nov 11, 2014

rizar commented Nov 13, 2014

kyunghyuncho commented Nov 16, 2014

infinitezxc commented Jan 29, 2015

kyunghyuncho commented Jan 29, 2015

infinitezxc commented Jan 30, 2015

guxd commented Jan 13, 2016

About LSTM Layer in GroundHog #23

About LSTM Layer in GroundHog #23

Comments

gaoyuankidult commented Nov 11, 2014

rizar commented Nov 13, 2014

kyunghyuncho commented Nov 16, 2014

infinitezxc commented Jan 29, 2015

kyunghyuncho commented Jan 29, 2015

infinitezxc commented Jan 30, 2015

guxd commented Jan 13, 2016