Inconsistent behavior in averaging choices micro and samples. #10706

amueller · 2018-02-26T15:50:40Z

For the multi-class case, precision, recall and f-score with micro all produce accuracy, while with samples they produce an error.
That seem inconsistent. Using the definitions in the docs, they should also all be accuracy, I think.

I think I'd propose to deprecate micro averaging for multiclass.

The docs actually give an example of micro-average recall for multiclass, which is really weird imho.

The text was updated successfully, but these errors were encountered:

amueller · 2018-02-26T15:52:58Z

The docs at the top actually recommend micro average for multi-class:

Micro-averaging may be preferred in multilabel settings,
including multiclass classification where a majority class is to be ignored.

That seems weird to me, given that it's just accuracy.

jnothman · 2018-02-26T22:04:59Z

Wording could be clearer, but the intention there is that using multiclass with a majority class ignored (labels=np.setdiff1d(classes_, 'default class') will return something other than accuracy. I think it is hard to deprecate because of how it is used in classification_report and elsewhere.

amueller mentioned this issue Feb 26, 2018

[MRG] DOC fix sentence about micro being equal, add accuracy #10705

Merged

cmarmo added the module:multiclass label Jan 16, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent behavior in averaging choices micro and samples. #10706

Inconsistent behavior in averaging choices micro and samples. #10706

amueller commented Feb 26, 2018

amueller commented Feb 26, 2018 •

edited

jnothman commented Feb 26, 2018

Inconsistent behavior in averaging choices micro and samples. #10706

Inconsistent behavior in averaging choices micro and samples. #10706

Comments

amueller commented Feb 26, 2018

amueller commented Feb 26, 2018 • edited

jnothman commented Feb 26, 2018

amueller commented Feb 26, 2018 •

edited