Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

lstm_layer mask #4

Open
ThinkingSlow opened this issue Sep 22, 2017 · 3 comments
Open

lstm_layer mask #4

ThinkingSlow opened this issue Sep 22, 2017 · 3 comments

Comments

@ThinkingSlow
Copy link

ThinkingSlow commented Sep 22, 2017

Hello, I'm following your work, and try to reimplement ESIM by tensorflow.
I noticed in you lstm_layer() , you masked c and h, I'm wondering how much will the mask improve the model compared with the no-mask(just basic LSTM).
And how much will the ortho_weight help?
Thank you so much.

@ThinkingSlow ThinkingSlow changed the title parameters in code and paper are different lstm_layer mask Sep 22, 2017
@lukecq1231
Copy link
Owner

Hi, the mask is used to deal with the sentences with difference lengths in one minibatch. Actually, I did not try the experiments about mark/no-mask and ortho_weight/no ortho_weight. I will test it. Thanks for your questions.

@lukecq1231
Copy link
Owner

I try some simple experiments. Baseline is 88.0% on test set; if no-mask, the accuracy is 87.7%; if no ortho_weight, the accuracy is 87.5%. I hope that answered your question.

@ThinkingSlow
Copy link
Author

Thank you soso much~~

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants