You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This sample invariance test is currently applied to the ranking tests, and that seems strange.
For ranking the scores should have a different shape and be integer (right now y_true and y_pred are binary so it's valid scores but it's not a great test).
Also, the same should be tested for multi-class.
Via #10481.
The text was updated successfully, but these errors were encountered:
Similarly test_thresholded_invariance_string_vs_numbers_labels uses binary outcomes instead of scores and doesn't use multiclass data.
amueller
changed the title
test_sample_order_invariance in common metrics tests applied to ranking
test_sample_order_invariance in common metrics tests applied to threshold metrics
Oct 5, 2018
I'm confused by your notion of ranking. I'm pretty sure ranking as far as our metrics are concerned is about having the estimator assign a score to each candidate label, and for the ranking to then be implied, not for the rank to be provided necessarily as an integer.
Not sure exactly what you're looking for here. But yes, it can be hard to know if a particular metric is being fully covered by the common tests.
This sample invariance test is currently applied to the ranking tests, and that seems strange.
For ranking the scores should have a different shape and be integer (right now y_true and y_pred are binary so it's valid scores but it's not a great test).
Also, the same should be tested for multi-class.
Via #10481.
The text was updated successfully, but these errors were encountered: