Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

training model for WLP -- stuck in suboptimal solution #65

Open
nupoorgandhi opened this issue Aug 24, 2023 · 0 comments
Open

training model for WLP -- stuck in suboptimal solution #65

nupoorgandhi opened this issue Aug 24, 2023 · 0 comments

Comments

@nupoorgandhi
Copy link

I'm trying to train the relation extraction model for the Wet Labs Protocol dataset. My loss stays fairly constant, and the model always predicts no relation between each span pair. When I look at the logits I can see that for each relation, the score is pretty much the same for all the examples, so it doesn't seem like anything is being learned. The entity extraction task is working for the WLP dataset, and I am certain that the data format is correct. I have tried learning rates between 1e-3 and 1e-7, and batch sizes 1 and 32. Does anyone have suggestions for how to debug?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant