Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

This repo cannot reproduce the result of original paper #49

Open
DC-Swind opened this issue Dec 23, 2018 · 2 comments
Open

This repo cannot reproduce the result of original paper #49

DC-Swind opened this issue Dec 23, 2018 · 2 comments

Comments

@DC-Swind
Copy link

DC-Swind commented Dec 23, 2018

Thank you for your implementation, it is very helpful for me.
I run this code and can get the similar result when the number of heads equals to 1. But, I cannot get the result of original paper(73.6/82.7) when I use 8 heads, batch size 32, training step 150k, char dimension of 200 (the same setting as the original paper). I can only get around (71.27/80.58).
Same situation was ocurred when I ran the pytorch repo (https://github.com/andy840314/QANet-pytorch-).

Any suggestions?

@JACKHAHA363
Copy link

Have you been able to reproduce the original paper's result?

@DC-Swind
Copy link
Author

I tried my best, but the result was still two points below.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants