How to threshold by probability (end-to-end model)? #52

mnskim · 2021-08-08T16:56:49Z

Hello,
First of all, thank you for a fantastic code release!

I'm wondering how to threshold the results of end-to-end genre model - In the disambiguation model, since probability is given for the single entity being disambiguated, I'm able to get a good filtering with a single value of confidence, i.e 75% probability (by exponentiating the logprob),

But in the case of the end-to-end model I'm not sure how we should filter by confidence. Would it be sufficient to normalize by the number of entities found? Thanks!

nicola-decao · 2021-08-09T08:36:26Z

Hello, I'm not sure what you want to do. The probability of the output is of the whole sequence (i.e. both mention detection and entity disambiguation).

mnskim · 2021-08-09T08:57:01Z

Ah I see, from the output of the end-to-end model, I was hoping to threshold the entity disambiguation probability by some value like 75%, so that I can keep only the disambiguation results that the model is confident about - would this be possible?

nicola-decao · 2021-08-13T08:26:57Z

Yes it is possible but it is up to you to define the rules for thresholding.

nicola-decao closed this as completed Aug 13, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to threshold by probability (end-to-end model)? #52

How to threshold by probability (end-to-end model)? #52

mnskim commented Aug 8, 2021 •

edited

nicola-decao commented Aug 9, 2021

mnskim commented Aug 9, 2021 •

edited

nicola-decao commented Aug 13, 2021

How to threshold by probability (end-to-end model)? #52

How to threshold by probability (end-to-end model)? #52

Comments

mnskim commented Aug 8, 2021 • edited

nicola-decao commented Aug 9, 2021

mnskim commented Aug 9, 2021 • edited

nicola-decao commented Aug 13, 2021

mnskim commented Aug 8, 2021 •

edited

mnskim commented Aug 9, 2021 •

edited