Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Is there a way to learn off-topic data? #153

Open
fbaeumer opened this issue Sep 15, 2017 · 3 comments
Open

Is there a way to learn off-topic data? #153

fbaeumer opened this issue Sep 15, 2017 · 3 comments

Comments

@fbaeumer
Copy link

fbaeumer commented Sep 15, 2017

My question is whether it is useful to train sentences that contain no named entities to increase the Precision and Recall. On this way it could be learned which sentences/context contain NE and which do not (like off-topic data). Or should I only provide trainings data containing named entities?

@davisking
Copy link
Contributor

I don't understand what you are trying to ask.

@grafael
Copy link

grafael commented Jan 3, 2018

If I understand right, you are trying to extend your dataset with data that is not labeled. In this case, your Precision and Recall will only increase for "O" (BILOU). Also, it may let your Named Entities Scores even worse. Give it a try and run the conneval script, it will clarify what I'm trying to explain.

@playfulart
Copy link

playfulart commented Jan 3, 2018 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants