Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add multi-label support to the confusion matrix metric #3452

Closed
arjoly opened this issue Jul 20, 2014 · 12 comments · Fixed by #11179
Closed

Add multi-label support to the confusion matrix metric #3452

arjoly opened this issue Jul 20, 2014 · 12 comments · Fixed by #11179
Labels
Easy Well-defined and straightforward way to resolve Enhancement

Comments

@arjoly
Copy link
Member

arjoly commented Jul 20, 2014

Currently the confusion_matrix support binary and multi-class classification, but not multi-label data yet.

@jnothman
Copy link
Member

What does confusion matrix mean in a multilabel context?

On 20 July 2014 20:34, Arnaud Joly notifications@github.com wrote:

Currently the confusion_matrix support binary and multi-class
classification, but not multi-label data yet.


Reply to this email directly or view it on GitHub
#3452.

@arjoly
Copy link
Member Author

arjoly commented Jul 20, 2014

The same as in the multiclass case, but the notion of true positive, true negative, false negative are different the multi-label case.

For example, this is explained in http://www.cnts.ua.ac.be/~vincent/pdf/microaverage.pdf

@jnothman
Copy link
Member

Okay, not actually a confusion matrix in the multiclass sense (which label
is confused with which), but a binarized error analysis.

On 20 July 2014 22:50, Arnaud Joly notifications@github.com wrote:

The same as in the multiclass case, but the notion of true positive,
true negative, false negative are different the multi-label case.

For example, this is explain
http://www.cnts.ua.ac.be/~vincent/pdf/microaverage.pdf


Reply to this email directly or view it on GitHub
#3452 (comment)
.

@arjoly
Copy link
Member Author

arjoly commented Jul 20, 2014

Okay, not actually a confusion matrix in the multiclass sense (which label
is confused with which), but a binarized error analysis.

Then we might want to add a function. It might be binarized_confusion_matrix or multilabel_confusion_matrix.

Magellanea added a commit to Magellanea/scikit-learn that referenced this issue Aug 31, 2014
@Magellanea
Copy link

@arjoly @jnothman This implements the multilabel_confusion_matrix function which computes the values True positives, true negatives, false positives and false negatives for a multi-label classification problem, was that what you expected to be the behavior of the multilabel_confusion_matrix

raghavrv pushed a commit to raghavrv/scikit-learn that referenced this issue Jan 19, 2015
raghavrv pushed a commit to raghavrv/scikit-learn that referenced this issue Jan 20, 2015
raghavrv pushed a commit to raghavrv/scikit-learn that referenced this issue Jan 27, 2015
raghavrv pushed a commit to raghavrv/scikit-learn that referenced this issue Feb 1, 2015
raghavrv pushed a commit to raghavrv/scikit-learn that referenced this issue Feb 4, 2015
@Florents-Tselai
Copy link

What's the status on this?

@arjoly
Copy link
Member Author

arjoly commented Sep 28, 2015

The pr #3452 has stalled feel free to continue if you want to finish it.

@pramitchoudhary
Copy link

Is there any plan, or is someone working on pr #3452 ?

@Spandan-Madan
Copy link

Spandan-Madan commented Apr 5, 2017

I am planning to add multi label classification metrics, including confusion matrix. There is a NIPS 2012 paper which explains great metrics for multi-label context. Will put in the effort if I think there's enough people who still need it. Is it still needed? Or is to not that important for the community?

@ChristianSch
Copy link

ChristianSch commented Apr 5, 2017

Which paper exactly are you referring to? I find the metrics quite ambiguous in general use. Some use F_1/precision/recall/subset accuracy micro/macro and some use other metrics. (I use the former)

I'm always interested in bringing more native ml support to sklearn. But also be aware of other approaches like scikit-multilearn, or my own small skml for scikit compatible multi label classification. I find the things for ml in sklearn lacking and quite unusable.

@arjoly
Copy link
Member Author

arjoly commented Apr 13, 2017

Adding the most important multi-label algorithms is very interesting in my opinion. This is indeed a very long term project.

@carlthome
Copy link

@Spandan-Madan, which NIPS 2012 paper are you thinking of?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Easy Well-defined and straightforward way to resolve Enhancement
Projects
None yet
8 participants