[nnUNet/PyTorch] PyTorch Libary Import Error with most recent release #1113

tjhendrickson · 2022-04-18T21:48:47Z

Related to nnUNet/PyTorch(s)
(e.g. GNMT/PyTorch or FasterTransformer/All)

Describe the bug

Within Docker container, typing python main.py --help produces a traceback error.

Traceback (most recent call last):
  File "main.py", line 19, in <module>
    from pytorch_lightning import Trainer, seed_everything
  File "/opt/conda/lib/python3.8/site-packages/pytorch_lightning/__init__.py", line 20, in <module>
    from pytorch_lightning import metrics  # noqa: E402
  File "/opt/conda/lib/python3.8/site-packages/pytorch_lightning/metrics/__init__.py", line 15, in <module>
    from pytorch_lightning.metrics.classification import (  # noqa: F401
  File "/opt/conda/lib/python3.8/site-packages/pytorch_lightning/metrics/classification/__init__.py", line 14, in <module>
    from pytorch_lightning.metrics.classification.accuracy import Accuracy  # noqa: F401
  File "/opt/conda/lib/python3.8/site-packages/pytorch_lightning/metrics/classification/accuracy.py", line 18, in <module>
    from pytorch_lightning.metrics.utils import deprecated_metrics
  File "/opt/conda/lib/python3.8/site-packages/pytorch_lightning/metrics/utils.py", line 22, in <module>
    from torchmetrics.utilities.data import get_num_classes as _get_num_classes
ImportError: cannot import name 'get_num_classes' from 'torchmetrics.utilities.data' (/opt/conda/lib/python3.8/site-packages/torchmetrics/utilities/data.py)

To Reproduce
Steps to reproduce the behavior:

Create Docker image by following quick start guide on nnUNet for PyTorch
"Shell" into container with sudo docker run -it nnunet:latest /bin/bash
Execute main.py python main.py --help

The text was updated successfully, but these errors were encountered:

tjhendrickson · 2022-04-19T18:41:54Z

Downgrading torchmetrics to v0.6.0 seems to resolve the issue.

tjhendrickson · 2022-04-19T22:01:22Z

Unfortunately after modifying the torchmetrics version I am now running into a different traceback error:

  File "main.py", line 34, in <module>
    set_affinity(int(os.getenv("LOCAL_RANK", "0")), args.gpus, mode=args.affinity)
  File "/workspace/nnunet_pyt/utils/gpu_affinity.py", line 376, in set_affinity
    set_socket_unique_affinity(gpu_id, nproc_per_node, cores, "contiguous", balanced)
  File "/workspace/nnunet_pyt/utils/gpu_affinity.py", line 263, in set_socket_unique_affinity
    os.sched_setaffinity(0, ungrouped_affinities[gpu_id])
OSError: [Errno 22] Invalid argument

This error seems to persist no matter what text I enter following the --affinity flag

michal2409 · 2022-04-20T09:44:00Z

Have you tried running with --affinity disabled or commenting the L32-33 in the main.py? (https://github.com/NVIDIA/DeepLearningExamples/blob/master/PyTorch/Segmentation/nnUNet/main.py#L32).

Another fix for torchmetrics is to upgrade pytorch lightning to 1.5.10 (there are issues with 1.6.0 at the moment)

michal2409 · 2022-04-20T09:44:30Z

Have you tried running with --affinity disabled or commenting the L32-33 in the main.py? (https://github.com/NVIDIA/DeepLearningExamples/blob/master/PyTorch/Segmentation/nnUNet/main.py#L32).

Another fix for torchmetrics is to upgrade pytorch lightning to 1.5.10 (there are issues with 1.6.0 at the moment)

`cannot import name 'get_num_classes' from 'torchmetrics.utilities.data'` solved in this comment: NVIDIA/DeepLearningExamples#1113 (comment)

tjhendrickson added the bug Something isn't working label Apr 18, 2022

anhquancao mentioned this issue Apr 21, 2022

ImportError: cannot import name 'get_num_classes' from 'torchmetrics.utilities.data' astra-vision/MonoScene#18

Closed

kapsner mentioned this issue Apr 27, 2022

[Bug] ImportError: cannot import name 'get_num_classes' from 'torchmetrics.utilities.data' MIC-DKFZ/nnDetection#73

Closed

mdmanurung mentioned this issue Apr 30, 2022

ImportError due to torchmetrics theislab/cpa#2

Closed

2003100127 mentioned this issue May 18, 2022

scvi installing errors on get_num_classes and configure formatter 'console' scverse/scvi-tools#1540

Closed

JonnoFTW mentioned this issue Jul 27, 2022

Won't run out of the box pesser/stable-diffusion#1

Open

haouarihk added a commit to haouarihk/image-to-latex that referenced this issue Sep 11, 2022

fixing error

76c9cf1

`cannot import name 'get_num_classes' from 'torchmetrics.utilities.data'` solved in this comment: NVIDIA/DeepLearningExamples#1113 (comment)

zehongs mentioned this issue Oct 21, 2022

ImportError: cannot import name 'get_num_classes' from 'torchmetrics.utilities.data' zju3dv/LoFTR#216

Closed

rallen10 mentioned this issue Nov 10, 2022

ImportError due to torchmetrics version MIT-REALM/neural_clbf#7

Closed

Logicino mentioned this issue Nov 17, 2022

__init__() got an unexpected keyword argument 'truncated_bptt_steps' Lightning-AI/pytorch-lightning#15707

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[nnUNet/PyTorch] PyTorch Libary Import Error with most recent release #1113

[nnUNet/PyTorch] PyTorch Libary Import Error with most recent release #1113

tjhendrickson commented Apr 18, 2022

tjhendrickson commented Apr 19, 2022

tjhendrickson commented Apr 19, 2022

michal2409 commented Apr 20, 2022

michal2409 commented Apr 20, 2022

[nnUNet/PyTorch] PyTorch Libary Import Error with most recent release #1113

[nnUNet/PyTorch] PyTorch Libary Import Error with most recent release #1113

Comments

tjhendrickson commented Apr 18, 2022

tjhendrickson commented Apr 19, 2022

tjhendrickson commented Apr 19, 2022

michal2409 commented Apr 20, 2022

michal2409 commented Apr 20, 2022