Error in training example #23

robmarkcole · 2021-08-11T13:27:55Z

Following the example from the README:

ValueError                                Traceback (most recent call last)
<ipython-input-11-f854c515c2ab> in <module>()
----> 1 trainer.fit(model, datamodule=dm)

23 frames
/usr/local/lib/python3.7/dist-packages/torchmetrics/functional/classification/stat_scores.py in _stat_scores_update(preds, target, reduce, mdmc_reduce, num_classes, top_k, threshold, multiclass, ignore_index)
    123         if not mdmc_reduce:
    124             raise ValueError(
--> 125                 "When your inputs are multi-dimensional multi-class, you have to set the `mdmc_reduce` parameter"
    126             )
    127         if mdmc_reduce == "global":

ValueError: When your inputs are multi-dimensional multi-class, you have to set the `mdmc_reduce` parameter

The text was updated successfully, but these errors were encountered:

isaaccorley · 2021-08-11T14:19:44Z

That's odd. I made an example recently that uses the same setup under examples/levircd+.ipynb and didn't run into that error.

When I get a chance I'll try running it locally.

robmarkcole · 2021-08-12T04:53:24Z

I launched the examples/levircd+.ipynb straight to colab and hit exactly the same issue there.!

1.4 M     Trainable params
0         Non-trainable params
1.4 M     Total params
5.402     Total estimated model params size (MB)
Validation sanity check: 0%
0/2 [00:00<?, ?it/s]
/usr/local/lib/python3.7/dist-packages/pytorch_lightning/trainer/data_loading.py:106: UserWarning: The dataloader, val dataloader 0, does not have many workers which may be a bottleneck. Consider increasing the value of the `num_workers` argument` (try 4 which is the number of cpus on this machine) in the `DataLoader` init to improve performance.
  f"The dataloader, {name}, does not have many workers which may be a bottleneck."
---------------------------------------------------------------------------
ValueError                                Traceback (most recent call last)
<ipython-input-17-f854c515c2ab> in <module>()
----> 1 trainer.fit(model, datamodule=dm)

23 frames
/usr/local/lib/python3.7/dist-packages/torchmetrics/functional/classification/stat_scores.py in _stat_scores_update(preds, target, reduce, mdmc_reduce, num_classes, top_k, threshold, multiclass, ignore_index)
    123         if not mdmc_reduce:
    124             raise ValueError(
--> 125                 "When your inputs are multi-dimensional multi-class, you have to set the `mdmc_reduce` parameter"
    126             )
    127         if mdmc_reduce == "global":

ValueError: When your inputs are multi-dimensional multi-class, you have to set the `mdmc_reduce` parameter

However the good news is I dont have any issue with examples/probav.ipynb

isaaccorley · 2021-08-12T22:09:08Z

Looks like the latest torchmetrics==0.5.0 version required a new parameter for computing Accuracy, Precision, Recall on 2D segmentation mask output. Just had to add mdmc_average="global" to each of the metrics in the lightning modules. I'm able to train now so seems like it's fixed. See #25

isaaccorley closed this as completed Aug 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error in training example #23

Error in training example #23

robmarkcole commented Aug 11, 2021

isaaccorley commented Aug 11, 2021 •

edited

robmarkcole commented Aug 12, 2021 •

edited

isaaccorley commented Aug 12, 2021

Error in training example #23

Error in training example #23

Comments

robmarkcole commented Aug 11, 2021

isaaccorley commented Aug 11, 2021 • edited

robmarkcole commented Aug 12, 2021 • edited

isaaccorley commented Aug 12, 2021

isaaccorley commented Aug 11, 2021 •

edited

robmarkcole commented Aug 12, 2021 •

edited