-
Notifications
You must be signed in to change notification settings - Fork 535
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there a way to learn off-topic data? #153
Comments
I don't understand what you are trying to ask. |
If I understand right, you are trying to extend your dataset with data that is not labeled. In this case, your Precision and Recall will only increase for "O" (BILOU). Also, it may let your Named Entities Scores even worse. Give it a try and run the conneval script, it will clarify what I'm trying to explain. |
Unsubscribe
…On Tue, Jan 2, 2018 at 9:14 PM, Rafael Antonio Ribeiro Gomes < ***@***.***> wrote:
If I understand right, you are trying to extend your dataset with data
that is not labeled. In this case, your Precision and Recall will only
increase for "O" (BILOU). Also, it may let your Named Entities Scores even
worst. Give it a try and run the *conneval* script, it will clarify what
I'm trying to explain.
—
You are receiving this because you are subscribed to this thread.
Reply to this email directly, view it on GitHub
<#153 (comment)>, or mute
the thread
<https://github.com/notifications/unsubscribe-auth/AANybUWWqIPXGm4TuGFhI1Le3_OhuWzBks5tGuKFgaJpZM4PY1Qj>
.
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
My question is whether it is useful to train sentences that contain no named entities to increase the Precision and Recall. On this way it could be learned which sentences/context contain NE and which do not (like off-topic data). Or should I only provide trainings data containing named entities?
The text was updated successfully, but these errors were encountered: