Major API change. Introducing Cleanlab 2.0 #128

cgnorthcutt · 2022-03-10T09:18:01Z

CHANGELOG Cleanlab 1.0.1 --> 2.0:

At a high level, this update redesigns cleanlab to be scalable and extensible for growth (e.g., adding new ways to rank data and labels, adding new methods for computing data and labels quality, adding new tasks like regression and object detection, etc.) This update also simplifies most of the naming conventions, redesigning cleanlab to be more developer friendly and less academic.

Module name changes:

pruning.py --> filter.py
latent_estimation.py --> count.py
parent module/folder models/ --> example_models/

New module created:

rank.py
- moved all ranking and ordering functions from pruning.py/filter.py to here

Method name changes:

pruning.get_noise_indices() --> filter.find_label_issues()
count.num_label_errors() --> count.num_label_issues()

Methods added:

rank.py adds
- two ranking functions to rank data based on label quality for entire dataset (not just examples with label issues)
- get_self_confidence_for_each_label()
- get_normalized_margin_for_each_label()
filter.py adds
- two more methods added to filter.find_label_issues() (select method using the filter_by parameter)
  - confident_learning, which has been shown to work very well and may become the default in the future, and
  - predicted_neq_given, which is useful for benchmarking a simple baseline approach, but underperformant relative to the other filter_by methods)
classification.py adds
- LearningWithNoisyLabels.get_label_issues()
  - for a canonical one-line of code use:LearningWithNoisyLabels().fit(X, y).get_label_issues()
  - no need to compute predicted probabilities in advance

Naming conventions changed in method names, comments, parameters, etc.

s -> labels
psx -> pred_probs
label_errors --> label_issues
noise_mask --> label_issues_mask
label_errors_bool --> label_issues_mask
prune_method --> filter_by
prob_given_label --> self_confidence
pruning --> filtering

Parameter re-ordering:

re-ordered (labels, pred_probs) parameters to be consistent (in that order) in all methods.
re-ordered parameters (e.g. frac_noise) in filter.find_label_issues()

Parameter changes:

in order_label_issues()
- param: sorted_index_method --> rank_by
in find_label_issues()
- param: sorted_index_method --> return_indices_ranked_by
- param: prune_method --> filter_by

Global variables changed:

filter.py
- Only require 1 example to be left in each class
- MIN_NUM_PER_CLASS = 5 --> MIN_NUM_PER_CLASS = 1
- enables cleanlab to work for toy-sized datasets

… labels.

…. (instead of 5)

…label_issues

codecov-commenter · 2022-03-10T09:31:07Z

Codecov Report

Merging #128 (71a79d7) into master (b1ea583) will increase coverage by 0.38%.
The diff coverage is 94.80%.

@@            Coverage Diff             @@
##           master     #128      +/-   ##
==========================================
+ Coverage   86.50%   86.89%   +0.38%     
==========================================
  Files          12       11       -1     
  Lines         956      908      -48     
  Branches      163      166       +3     
==========================================
- Hits          827      789      -38     
+ Misses        115      103      -12     
- Partials       14       16       +2

Impacted Files	Coverage Δ
cleanlab/coteaching.py	`0.00% <ø> (ø)`
cleanlab/utils/util.py	`100.00% <ø> (ø)`
cleanlab/version.py	`100.00% <ø> (ø)`
cleanlab/filter.py	`91.77% <91.77%> (ø)`
cleanlab/count.py	`93.96% <96.22%> (ø)`
cleanlab/__init__.py	`100.00% <100.00%> (ø)`
cleanlab/classification.py	`100.00% <100.00%> (+2.24%)`	⬆️
cleanlab/example_models/mnist_pytorch.py	`97.61% <100.00%> (ø)`
cleanlab/noise_generation.py	`97.18% <100.00%> (-2.12%)`	⬇️
cleanlab/rank.py	`100.00% <100.00%> (ø)`
... and 6 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update b1ea583...71a79d7. Read the comment docs.

cleanlab/rank.py

cleanlab/filter.py

JohnsonKuan · 2022-03-11T23:14:48Z

cleanlab/filter.py

-      Method to order label error indices (instead of a bool mask), either:
-        'normalized_margin' := normalized margin (p(s = k) - max(p(s != k)))
-        'prob_given_label' := [psx[i][labels[i]] for i in label_errors_idx]
+    psx : np.array (shape (N, K))


I prefer pred_probs. Most intuitive name for me.

cleanlab/rank.py

cgnorthcutt · 2022-03-12T00:11:21Z

@jwmueller @calebchiam @JohnsonKuan - Note that sometimes prob_given_label is used and other times self_confidence is used throughout the repo. Changing to self_confidence everywhere.

…ence. testing added.

cleanlab/util.py

cleanlab/noise_generation.py

README.md

cleanlab/classification.py

anishathalye

In some functions, we are switching parameter order in this release. Related to that: can we switch to using kw-only arguments for a lot of functionality where we don't expect people to use positional arguments? E.g. the signature of find_label_issues can be def find_label_issues(labels, pred_probs, *, confident_joint=None, ...). This way, we'll have freedom to move things around / insert parameters as we please, and we won't break any client code.

There's a bunch of weird issues in the diff, probably from a search-and-replace or from an imprecise refactoring tool. I highlighted a couple of them. As we get closer to finalizing this PR, we should take a close line-by-line look at the diff to make sure we haven't missed any.

anishathalye · 2022-03-12T13:41:28Z

README.md


 ```python
-# Compute psx (n x m matrix of predicted probabilities) on your own, with any classifier.
-# Here is an example that shows in detail how to compute psx on CIFAR-10:
+# Compute pred_probs (n x m matrix of predicted probabilities) on your own, with any classifier.


Should we be careful to always mention "out-of-sample predicted probabilities" every time we say pred_probs? Or alternatively, come up with a different name for pred_probs that makes it more obvious that these should be out-of-sample? We've seen a couple examples of people doing the wrong thing in the wild, training on the dataset and then just evaluating the trained model on that same dataset to compute psx to feed into cleanlab.

@anishathalye oos_pred_probs? or pred_probs_cv? or heldout_pred_probs

i might want to skip this change. The comment just below that line says
# Be sure you compute probs in a holdout/out-of-sample manner (e.g. via cross-validation)

anishathalye · 2022-03-12T13:48:51Z

README.md

+ordered_label_issues = find_label_issues(
+    labels=numpy_array_of_noisy_labels,
+    pred_probs=numpy_array_of_predicted_probabilities,
+    return_indices_ranked_by='normalized_margin', # Orders label issues


Minor point, but I prefer the imperativerank_by=

In our previous discussion, we agreed that we should mention that this changes the format of the return output from a bool mask to indices and hence it was a good idea to include return_indices...ranked_by versus just rank_by which doesn't indicate clearly the return type is changing (and will be a different length as well)

Note that all the methods in rank.py use rank_by. Only this method which changes the return type uses a different parameter name, to make that clear.

cleanlab/coteaching.py

cleanlab/example_models/README.md

cleanlab/filter.py

cleanlab/classification.py

tests/test_classification.py

cleanlab/classification.py

cgnorthcutt added 22 commits February 17, 2022 13:36

Refactor modules pruning to filter and latent_estimation to count

79b4462

Remove polyplex (research) algorithms from cleanlab

056b971

Create new module rank and move scoring functions to rank.

da913a8

Rename test to match new module names

32f9b99

Fixed error in normalized margin. added ranking for arbitrary psx and…

04d83c8

… labels.

Remove unused tests and methods. add multi-label support for baseline.

5c30338

Move baseline methods to filter and delete baseline module.

726bcd3

change filter.get_noise_indices to filter.find_label_issues

1234a0d

Rename baseline methods. fill out docstrings.

32f0989

Only require 1 example to be left in each class after removing errors…

d521c7f

…. (instead of 5)

Remove K as a parameter to count.compute_confident_joint

daf4b87

Add C_argmax and C_ij methods from CL paper to find_label_issues

37786c6

Add warnings for new prune methods and frac_noise. Fix tests.

b1d8827

Add baseline tests to test_rank_filter and delete baseline test

caf6d4e

Remove inverse_noise_matrix parameter in classification call to find_…

4ec928a

…label_issues

add todo to update docstring with new ranking functions

e40d430

Merge remote-tracking branch 'upstream/master' into release200

ef32e29

Merge remote-tracking branch 'upstream/master' into release200

df54f77

Merge remote-tracking branch 'origin/master' into release200

1604274

100% tests pass. add multi-label support for prune_method

73845ee

Major NOT-backwards-compatible name changes to most components

eb0b5c0

More Major NOT-backwards-compatible name changes

02c88b4

cgnorthcutt added this to the Cleanlab 2.0 milestone Mar 10, 2022

cgnorthcutt requested review from jwmueller, anishathalye and JohnsonKuan March 10, 2022 09:18

cgnorthcutt assigned jwmueller Mar 10, 2022

JohnsonKuan reviewed Mar 10, 2022

View reviewed changes

cleanlab/rank.py Outdated Show resolved Hide resolved

jwmueller requested a review from weijinglok March 10, 2022 20:29

JohnsonKuan reviewed Mar 11, 2022

View reviewed changes

cgnorthcutt added 2 commits March 11, 2022 16:41

major api changes. psx -> pred_probs. prob_given_label -> self_confid…

2787df1

…ence. testing added.

enable python version 3.9 for pytorch model.

adeae0f

calebchiam reviewed Mar 12, 2022

View reviewed changes

cleanlab/util.py Outdated Show resolved Hide resolved

calebchiam reviewed Mar 12, 2022

View reviewed changes

cleanlab/noise_generation.py Outdated Show resolved Hide resolved

calebchiam added 4 commits March 11, 2022 21:03

ran spellcheck

aecb061

ran grammar check

cc26ccb

Update count.py

3accf09

Update filter.py

930fb41

jwmueller reviewed Mar 12, 2022

View reviewed changes

README.md Outdated Show resolved Hide resolved

jwmueller reviewed Mar 12, 2022

View reviewed changes