Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to threshold by probability (end-to-end model)? #52

Closed
mnskim opened this issue Aug 8, 2021 · 3 comments
Closed

How to threshold by probability (end-to-end model)? #52

mnskim opened this issue Aug 8, 2021 · 3 comments

Comments

@mnskim
Copy link

mnskim commented Aug 8, 2021

Hello,
First of all, thank you for a fantastic code release!

I'm wondering how to threshold the results of end-to-end genre model - In the disambiguation model, since probability is given for the single entity being disambiguated, I'm able to get a good filtering with a single value of confidence, i.e 75% probability (by exponentiating the logprob),

But in the case of the end-to-end model I'm not sure how we should filter by confidence. Would it be sufficient to normalize by the number of entities found? Thanks!

@nicola-decao
Copy link
Contributor

Hello, I'm not sure what you want to do. The probability of the output is of the whole sequence (i.e. both mention detection and entity disambiguation).

@mnskim
Copy link
Author

mnskim commented Aug 9, 2021

Ah I see, from the output of the end-to-end model, I was hoping to threshold the entity disambiguation probability by some value like 75%, so that I can keep only the disambiguation results that the model is confident about - would this be possible?

@nicola-decao
Copy link
Contributor

Yes it is possible but it is up to you to define the rules for thresholding.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants