-
Notifications
You must be signed in to change notification settings - Fork 104
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Regarding results #2
Comments
Hi, |
Nope. With this repo I got about there, 10% lower. Definitely nothing close to the range of 70s. In fact, it's worst than LSTMs. I am trying to reimplement the model myself from scratch. And it also seems like the results are around 65-ish. Have tried various hyperparameters, including different optimizers, learn rates etc. Still the same. |
@vanzytay As I've mentioned in the README, I guess the difference in the results is because of the unreported hyper-parameters. But now that the original author has released his code, I've added this task of reaching the state-of-the-art performance to my todo list. |
As I just glanced through the author's code, I see some serious differences: |
Can you share where is the location of original author code? Thank you |
Thanks @ganeshjawahar, unfortunately it is written on Java :( |
I tried to improve the performance by tweaking the model and parameters, and finally got accuracy of 0.69 for Laptop, and 0.77 for Restaurant, which is still not perfect, but much closer to those ofthe original paper (0.72 for Laptop, 0.809 for Resturant). You can checkout my fix here https://github.com/pcgreat/mem_absa if you are interested. @ganeshjawahar thanks for your great implementation, especially for the attention part. I also want to take a look at original author code, but the link is invalid now. Would you please share it with me if the license permits? |
@pcgreat The code of the original author can't be downloaded anymore, do you have a copy file of the code? |
@pcgreat In your code i noticed that you feed the target (the sentiment ground truth) to the graph in the test method (model.py), can y explicate me why please ? |
@nkooli yeah, target is fed into the model to calculate the test loss. But don't worry, the trained model won't be changed by this, as long as the |
@ganeshjawahar The code from on http://ir.hit.edu.cn/~dytang/paper/aspect_memnet/src.zip cannot be downloaded,would you please share it with me , if possible? Thanks. |
@pcgreat I downloaded the code and seem that the result is not perfect(0.7 for restaurant) from https://github.com/pcgreat/mem_absa. You have mentioned in the README, the model can improve the accuracy by 8%, the accuracy of restaurant is around 78%. Do you have any idea for me? |
Hi,
I was wondering if you managed to get any results close to the paper with this repository?
I downloaded the code and seem to get way worst results from the paper. Is it because of the unreported hyperparameters?
Thanks!
The text was updated successfully, but these errors were encountered: