[RuleMetrics] Overall precision should be calculated from overall correct/incorrect #1045

dcfidalgo · 2022-01-27T17:17:02Z

For example, in the screenshot below, the overall precision should be 170 / (170+45) = 0.79 .

I think this is more meaningful than the average of the precisions, since it weighs each precision with its annot. coverage. For example, if I have one 100% precise rule that covers almost the whole dataset, and one 0% precise rule that only covers one record, now it would show me a 50% precision, but the weak labels from a simple majority voter would almost always be correct.

Right now, I think we compute the average of the precisions, but taking the NaNs as 0 into account, which seems like a bug.

@frascuchon @leiyre Not sure who to assign this one to?

…t in rules fix #1045

…t in rules (#1086)

…t in rules (#1086) (cherry picked from commit 05ff096)

fix #1045

…t in rules (#1086) (cherry picked from commit 05ff096) - fix(#1045): fix overall precision (#1087) (cherry picked from commit 451af37)

* 'master' of https://github.com/recognai/rubrix: (33 commits) fix(#1045): fix overall precision (#1087) fix(#1081): prevent add records of different task (#1085) fix(#1045): calculate overall precision from overall correct/incorrect in rules (#1086) fix(#924): parse new error format in UI (#1082) fix(#1054): Optimize Long records (#1080) docs(#949): change note to admonition (#1071) fix(#1053): metadata modal position (#1068) fix(#1067): fix rule definition link when no labels are defined (#1069) fix(#1065): 'B' tag for beginning tokens (#1066) feat(#1054): optimize long records view (#1064) feat(#924): parse validation error, including submitted information (#1056) fix(#1058): sort by % data in rules list (#1062) fix(#1050): generalizes entity span validation (#1055) fix: missing Optional import fix(cleanlab): set cleanlab n_jobs=1 as default (#1059) feat(#982): Show filters in labelling rules view (#1038) feat(#932): label models now modify the prediction_agent when calling LabelModel.predict (#1049) fix(#821): Token classifier QA 2 (#1057) ci: fix path filter condition refactor(#924): normalize API error responses (#1031) ...

dcfidalgo added the app label Jan 27, 2022

dcfidalgo added this to Backlog in Release via automation Jan 27, 2022

dcfidalgo added the type: bug Indicates an unexpected problem or unintended behavior label Jan 27, 2022

frascuchon moved this from Backlog to Planified in Release Jan 31, 2022

leiyre added a commit that referenced this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrec…

7d19efa

…t in rules fix #1045

leiyre mentioned this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrect #1086

Merged

frascuchon moved this from Planified to In progress in Release Feb 2, 2022

frascuchon pushed a commit that referenced this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrec…

05ff096

…t in rules (#1086)

frascuchon pushed a commit that referenced this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrec…

ca64fe4

…t in rules (#1086) (cherry picked from commit 05ff096)

frascuchon moved this from In progress to Done in Release Feb 2, 2022

frascuchon added the release label Feb 2, 2022

frascuchon moved this from Done to Release Ready in Release Feb 2, 2022

frascuchon pushed a commit that referenced this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrec…

31a4328

…t in rules (#1086) (cherry picked from commit 05ff096)

leiyre added a commit that referenced this issue Feb 2, 2022

fix(#1045): fix overall precision

f36fc93

fix #1045

leiyre mentioned this issue Feb 2, 2022

fix(#1045): fix overall precision #1087

Merged

frascuchon closed this as completed in #1087 Feb 2, 2022

Release automation moved this from Release Ready to Done Feb 2, 2022

frascuchon pushed a commit that referenced this issue Feb 2, 2022

fix(#1045): fix overall precision (#1087)

451af37

frascuchon moved this from Done to Release Ready in Release Feb 2, 2022

frascuchon moved this from Release Ready to Closed in Release Feb 2, 2022

frascuchon pushed a commit that referenced this issue Feb 2, 2022

fix(#1045): calculate overall precision from overall correct/incorrec…

1c76d81

…t in rules (#1086) (cherry picked from commit 05ff096) - fix(#1045): fix overall precision (#1087) (cherry picked from commit 451af37)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[RuleMetrics] Overall precision should be calculated from overall correct/incorrect #1045

[RuleMetrics] Overall precision should be calculated from overall correct/incorrect #1045

dcfidalgo commented Jan 27, 2022 •

edited

[RuleMetrics] Overall precision should be calculated from overall correct/incorrect #1045

[RuleMetrics] Overall precision should be calculated from overall correct/incorrect #1045

Comments

dcfidalgo commented Jan 27, 2022 • edited

dcfidalgo commented Jan 27, 2022 •

edited