lstm_layer mask #4

ThinkingSlow · 2017-09-22T02:10:51Z

Hello, I'm following your work, and try to reimplement ESIM by tensorflow.
I noticed in you lstm_layer() , you masked c and h, I'm wondering how much will the mask improve the model compared with the no-mask(just basic LSTM).
And how much will the ortho_weight help?
Thank you so much.

lukecq1231 · 2017-09-22T20:57:30Z

Hi, the mask is used to deal with the sentences with difference lengths in one minibatch. Actually, I did not try the experiments about mark/no-mask and ortho_weight/no ortho_weight. I will test it. Thanks for your questions.

lukecq1231 · 2017-09-28T16:34:35Z

I try some simple experiments. Baseline is 88.0% on test set; if no-mask, the accuracy is 87.7%; if no ortho_weight, the accuracy is 87.5%. I hope that answered your question.

ThinkingSlow · 2017-11-20T11:41:16Z

Thank you soso much~~

ThinkingSlow changed the title ~~parameters in code and paper are different~~ lstm_layer mask Sep 22, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lstm_layer mask #4

lstm_layer mask #4

ThinkingSlow commented Sep 22, 2017 •

edited

Loading

lukecq1231 commented Sep 22, 2017

lukecq1231 commented Sep 28, 2017

ThinkingSlow commented Nov 20, 2017

lstm_layer mask #4

lstm_layer mask #4

Comments

ThinkingSlow commented Sep 22, 2017 • edited Loading

lukecq1231 commented Sep 22, 2017

lukecq1231 commented Sep 28, 2017

ThinkingSlow commented Nov 20, 2017

ThinkingSlow commented Sep 22, 2017 •

edited

Loading