Recommend term deletion/modification once a specific TP/FP ratio has been reached #31
Comments
So we'll have |
I was kinda thinking of |
The correct ratios depend on the amount of spam during the measurement On Wed, Nov 5, 2014 at 11:19 AM, Sam notifications@github.com wrote:
|
Yes, using the sens/spec metrics would seem to be a much better solution. The exact ratios may need some adjusting, shall we start with <25% = recommend deletion, <50% = recommend review? |
spec + sens is always <= 200% .. or should be, if the estimates were at least somewhat sensible, which On Wed, Nov 5, 2014 at 12:04 PM, Sam notifications@github.com wrote:
|
@Vincentyification I edited my comment after discovering a bug with the sens/spec calculations. |
@honnza Agreed. |
Would it be a good idea to keep a list of removed terms? This way, if a user attempts to re-add a term on said list, Pham would reply with how it was removed, the term's stats (preserved from time of deletion), and listen for a y/n command whether to add it back. |
Good idea. I'll add that too. |
For example, whenever someone FP/TPs a report, Pham could quickly check each found blacklist term's TP/FP ratio to determine whether it is returning an unusually high number of FPs. If, say the term's ratio is <1:5, Pham would suggest that the term needs editing (to try to improve its "accuracy"). And if the term's ratio is above 1:10, Pham would suggest term deletion.
Any thoughts?
The text was updated successfully, but these errors were encountered: