Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Multi-label classifiers and recommendation for prune method #31

Closed
avostryakov opened this issue Jun 22, 2020 · 2 comments
Closed

Multi-label classifiers and recommendation for prune method #31

avostryakov opened this issue Jun 22, 2020 · 2 comments
Labels
question A question for Cleanlab maintainers

Comments

@avostryakov
Copy link

Thanks for cleanlab!

Do you have examples of multi-label classification tasks? I'm curious what prune method to use in this case: prune by noise rate, prune by class, or both?

@sylvainma
Copy link

sylvainma commented Aug 13, 2020

I am also looking for an example when the target y is a multi-output one:

[[0 1 0 0 1], 
 [1 0 0 0 1], 
 ...]

Any trick?

@anishathalye anishathalye added the question A question for Cleanlab maintainers label Mar 28, 2022
@jwmueller jwmueller assigned aditya1503 and unassigned aditya1503 Dec 23, 2022
@jwmueller
Copy link
Member

In case there is any chance this information is still useful to you:

In cleanlab v2.2, we did a major overhaul of the functionality for multi-label classification. cleanlab's new label error detection capabilities for multi-label classification are much more powerful and easy to use!

Get started here: https://docs.cleanlab.ai/stable/tutorials/multilabel_classification.html

This tutorial demonstrates how to handle multi-output targets in the format you have.

We also published a paper detailing the new algorithms cleanlab v2.2 uses to estimate label errors in multi-label datasets:
Identifying Incorrect Annotations in Multi-Label Classification Data

Through extensive benchmarking, this paper answers questions about which label error detection algorithms to use for multi-label data (that are the cleanlab default settings post v2.2).

And definitely please open a new issue (stating which cleanlab version you are using) if you still encounter any problems using cleanlab for multi-label datasets!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question A question for Cleanlab maintainers
Projects
None yet
Development

No branches or pull requests

5 participants