[Metrics] Confusion matrix class interface #4348

SkafteNicki · 2020-10-25T13:18:06Z

What does this PR do?

Adds back confusion matrix class interface
Unify with functional interface
Updated old test to new interface

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure your PR does only one thing, instead of bundling different changes together? Otherwise, we ask you to create a separate PR for every change.
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?
Did you verify new and existing tests pass locally with your changes?
If you made a notable change (that affects users), did you update the CHANGELOG?

PR review

Is this pull request ready for review? (if not, please submit in draft mode)

Anyone in the community is free to review the PR once the tests have passed.
If we didn't discuss your PR in Github issues there's a high chance it will not be merged.

Did you have fun?

Make sure you had fun coding 🙃

Co-authored-by: Teddy Koker <teddy.koker@gmail.com>

…_matrix

Vichoko · 2020-10-30T14:51:01Z

In which ptl version this will be released?

ydcjeff · 2020-10-30T14:53:56Z

In which ptl version this will be released?

it could be in coming point release on Tuesday.

NumesSanguis · 2020-11-04T08:14:58Z

According to the release changelog, ConfusionMatrix should be in PL v1.0.5.
However, trying to import it, fails:

>>> from pytorch_lightning.metrics import ConfusionMatrix
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
ImportError: cannot import name 'ConfusionMatrix' from 'pytorch_lightning.metrics' (*path*/python3.8/site-packages/pytorch_lightning/metrics/__init__.py)

Colab: https://colab.research.google.com/drive/1QHFCBtGEv215DuLw6Suqi9A1WHiExlZR?usp=sharing

ydcjeff · 2020-11-04T08:29:42Z

@NumesSanguis I think they cherry-picked the commits for 1.0.5 release so it didn't include in it. maybe in 1.1

NumesSanguis · 2020-11-04T08:36:48Z

@ydcjeff Then the patch notes are wrong:

Added ConfusionMatrix class interface (#4348)
Added multiclass AUROC metric (#4236)

You can find this Metric also already in the docs:
https://pytorch-lightning.readthedocs.io/en/latest/metrics.html#confusionmatrix

ydcjeff · 2020-11-04T08:39:23Z

Yep, we are in the transition of point release and minor release, we will edit the changelog

s-rog · 2020-11-04T08:41:07Z

#4505

NumesSanguis · 2020-11-04T10:09:20Z

I don't see an associated Feature Request, so I hope bug reports here are ok.

@SkafteNicki I tried this ConfusionMatrix using the master branch (pip install git+https://github.com/PytorchLightning/pytorch-lightning.git@master --upgrade), but there is an error when the last batch size does not match earlier batches:

     25                              threshold: float = 0.5) -> torch.Tensor:
     26     preds, target = _input_format_classification(preds, target, threshold)
---> 27     unique_mapping = (target.view(-1) * num_classes + preds.view(-1)).to(torch.long)
     28     bins = torch.bincount(unique_mapping, minlength=num_classes ** 2)
     29     confmat = bins.reshape(num_classes, num_classes)

RuntimeError: The size of tensor a (32) must match the size of tensor b (16) at non-singleton dimension 0

Colab: https://colab.research.google.com/drive/1QHFCBtGEv215DuLw6Suqi9A1WHiExlZR?usp=sharing

NumesSanguis · 2020-11-04T10:16:47Z

@SkafteNicki There is also an issue (if you make sure the above batch size does not show up) when trying to log the Confusion Matrix with the v1.0.x logging approach:

self.confmat(output.softmax(dim=1), labels)
self.log(f"confmat/val", self.confmat)

Gives:

/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/logging.py in metrics_to_scalars(self, metrics)
     46         for k, v in metrics.items():
     47             if isinstance(v, torch.Tensor):
---> 48                 v = v.item()
     49 
     50             if isinstance(v, dict):

ValueError: only one element tensors can be converted to Python scalars

Colab: https://colab.research.google.com/drive/1GRi0VhRTbDGdj4D1HUMo49Pe7__tWe31?usp=sharing

s-rog · 2020-11-04T10:25:40Z

@NumesSanguis Mind opening a separate issue?

SkafteNicki · 2020-11-04T10:31:37Z

I don't see an associated Feature Request, so I hope bug reports here are ok.

@SkafteNicki I tried this ConfusionMatrix using the master branch (pip install git+https://github.com/PytorchLightning/pytorch-lightning.git@master --upgrade), but there is an error when the last batch size does not match earlier batches:
     25                              threshold: float = 0.5) -> torch.Tensor:
     26     preds, target = _input_format_classification(preds, target, threshold)
---> 27     unique_mapping = (target.view(-1) * num_classes + preds.view(-1)).to(torch.long)
     28     bins = torch.bincount(unique_mapping, minlength=num_classes ** 2)
     29     confmat = bins.reshape(num_classes, num_classes)

RuntimeError: The size of tensor a (32) must match the size of tensor b (16) at non-singleton dimension 0
Colab: https://colab.research.google.com/drive/1QHFCBtGEv215DuLw6Suqi9A1WHiExlZR?usp=sharing

This example does not reflect a real example. When you are creating artificial labels, when you do batch[1].sigmoid() you are creating a label tensor along the feature dimension, but should instead create it along the batch dimension like this i.e. batch[:,0].sigmoid().

@SkafteNicki There is also an issue (if you make sure the above batch size does not show up) when trying to log the Confusion Matrix with the v1.0.x logging approach:

self.confmat(output.softmax(dim=1), labels)
self.log(f"confmat/val", self.confmat)

Gives:

/usr/local/lib/python3.6/dist-packages/pytorch_lightning/trainer/logging.py in metrics_to_scalars(self, metrics)
     46         for k, v in metrics.items():
     47             if isinstance(v, torch.Tensor):
---> 48                 v = v.item()
     49 
     50             if isinstance(v, dict):

ValueError: only one element tensors can be converted to Python scalars

Colab: https://colab.research.google.com/drive/1GRi0VhRTbDGdj4D1HUMo49Pe7__tWe31?usp=sharing

self.log does not allow logging anything else than scalar tensors. This is a general limitation of self.log (not specific related to confusion matrix.

Vichoko · 2020-11-04T13:12:48Z

If there is no logging support through self.log, how we're supposed to aggregate the confusion matrix across batches and GPUs (on a multi-GPU scheme)?

SkafteNicki · 2020-11-04T18:59:59Z

@Vichoko depending on what logger you are using, you can create the confusion matrix as a figure and log it. For example using the default tensorboard logger:

import matplotlib.pyplot as plt
...
def training_step(self, batch, batch_idx):
    ...
    step_confmat = self.metric(pred, target)
    fig = plt.figure(); plt.imshow(step_confmat)
    self.logger.experiment.add_figure('step_confmat', fig, global_step=self.global_step)

def training_epoch_end(self, outputs):
    epoch_confmat = self.metric.compute()
    fig = plt.figure(); plt.imshow(epoch_confmat)
    self.logger.experiment.add_figure('epoch_confmat', fig, global_step=self.global_step)

NumesSanguis · 2020-11-05T03:12:09Z

This example does not reflect a real example. When you are creating artificial labels, when you do batch[1].sigmoid() you are creating a label tensor along the feature dimension, but should instead create it along the batch dimension like this i.e. batch[:,0].sigmoid().

Sorry, somehow I had my own classification problem in mind where I assumed that batch was a tuple of (inputs, labels), hence I used batch[1].

Thank you for the confusion matrix example, that was actually what I was searching for!

NumesSanguis · 2020-11-06T03:10:46Z

I assume the x-axis (bottom) to represent the Predicted label, and the y-axis (left) to be the True label, but could this be added to the documentation?

SkafteNicki · 2020-11-06T10:28:38Z

We follow sklearns way of representing confusion matrices (https://scikit-learn.org/stable/modules/generated/sklearn.metrics.confusion_matrix.html#sklearn.metrics.confusion_matrix) which indeed is predicted by x-axis/columns and true by y-axis/rows.
@NumesSanguis please feel free to open a PR with recommended doc update.

Borda · 2020-11-06T10:56:37Z

@SkafteNicki @NumesSanguis changelog shall be fixed in #4505

* docs + precision + recall + f_beta + refactor Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * rebase Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * fixes Co-authored-by: Teddy Koker <teddy.koker@gmail.com> * added missing file * docs * docs * extra import * add confusion matrix * add to docs * add test * pep8 + isort * update tests * move util function * unify functional and class * add to init * remove old implementation * update tests * pep8 * add duplicate * fix doctest * Update pytorch_lightning/metrics/classification/confusion_matrix.py Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com> * changelog * bullet point args * bullet docs * bullet docs Co-authored-by: ananyahjha93 <ananya@pytorchlightning.ai> Co-authored-by: Teddy Koker <teddy.koker@gmail.com> Co-authored-by: Justus Schock <12886177+justusschock@users.noreply.github.com> Co-authored-by: chaton <thomas@grid.ai> Co-authored-by: Roger Shieh <55400948+s-rog@users.noreply.github.com> Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com> (cherry picked from commit e0b856c)

Vichoko · 2020-11-11T14:23:29Z

Changelog implied that this would be released on 1.0.6 but it wasn't :c. I should be worried about using this feature before 1.1?

ananyahjha93 and others added 21 commits October 10, 2020 11:57

docs + precision + recall + f_beta + refactor

dd9c584

Co-authored-by: Teddy Koker <teddy.koker@gmail.com>

rebase

1ef3ef2

Co-authored-by: Teddy Koker <teddy.koker@gmail.com>

fixes

f7c5c2d

Co-authored-by: Teddy Koker <teddy.koker@gmail.com>

added missing file

f810486

docs

3ca8123

docs

344a518

extra import

f2f7ec9

add confusion matrix

385cf3b

add to docs

e83c7bc

add test

0f1ed32

pep8 + isort

40501c5

merge

816c3a3

update tests

c8f1902

Merge remote-tracking branch 'upstream/master' into metrics/confusion…

d15b8df

…_matrix

move util function

46db09a

unify functional and class

a48624a

add to init

d626ac8

remove old implementation

8404be1

update tests

bbca887

pep8

4cbb7f0

add duplicate

ef485b8

SkafteNicki added feature Is an improvement or enhancement Metrics labels Oct 25, 2020

SkafteNicki added this to the 1.1 milestone Oct 25, 2020

SkafteNicki requested review from ananyahjha93, awaelchli, Borda, justusschock, nateraw and SeanNaren as code owners October 25, 2020 13:18

rohitgr7 and others added 3 commits October 30, 2020 14:27

bullet docs

832f6c3

bullet docs

70c7032

Merge branch 'master' into metrics/confusion_matrix

a158720

SkafteNicki merged commit e0b856c into Lightning-AI:master Oct 30, 2020

Metrics package automation moved this from in Progress to Done Oct 30, 2020

SkafteNicki deleted the metrics/confusion_matrix branch October 30, 2020 10:44

ydcjeff modified the milestones: 1.1, 1.0.x Oct 30, 2020

edenlightning added the hacktoberfest-accepted label Oct 30, 2020

edenlightning modified the milestones: 1.0.x, 1.1 Nov 4, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Metrics] Confusion matrix class interface #4348

[Metrics] Confusion matrix class interface #4348

SkafteNicki commented Oct 25, 2020

Vichoko commented Oct 30, 2020

ydcjeff commented Oct 30, 2020

NumesSanguis commented Nov 4, 2020

ydcjeff commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

ydcjeff commented Nov 4, 2020

s-rog commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

s-rog commented Nov 4, 2020

SkafteNicki commented Nov 4, 2020

Vichoko commented Nov 4, 2020 •

edited

SkafteNicki commented Nov 4, 2020

NumesSanguis commented Nov 5, 2020

NumesSanguis commented Nov 6, 2020

SkafteNicki commented Nov 6, 2020

Borda commented Nov 6, 2020 •

edited

Vichoko commented Nov 11, 2020 •

edited

[Metrics] Confusion matrix class interface #4348

[Metrics] Confusion matrix class interface #4348

Conversation

SkafteNicki commented Oct 25, 2020

What does this PR do?

Before submitting

PR review

Did you have fun?

Vichoko commented Oct 30, 2020

ydcjeff commented Oct 30, 2020

NumesSanguis commented Nov 4, 2020

ydcjeff commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

ydcjeff commented Nov 4, 2020

s-rog commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

NumesSanguis commented Nov 4, 2020

s-rog commented Nov 4, 2020

SkafteNicki commented Nov 4, 2020

Vichoko commented Nov 4, 2020 • edited

SkafteNicki commented Nov 4, 2020

NumesSanguis commented Nov 5, 2020

NumesSanguis commented Nov 6, 2020

SkafteNicki commented Nov 6, 2020

Borda commented Nov 6, 2020 • edited

Vichoko commented Nov 11, 2020 • edited

Vichoko commented Nov 4, 2020 •

edited

Borda commented Nov 6, 2020 •

edited

Vichoko commented Nov 11, 2020 •

edited