cannot import accelerate when torch>=2.0.1 and torch.distributed is disabled #1787

natsukium · 2023-07-28T13:21:52Z

System Info

I can't run `accelerate env` because of an import error.

accelerate: 0.21.0
OS: macOS
python: 3.10.12
numpy: 1.24.2
torch: 2.0.1

Information

The official example scripts
My own modified scripts

Tasks

One of the scripts in the examples/ folder of Accelerate or an officially supported no_trainer script in the examples folder of the transformers repo (such as run_no_trainer_glue.py)
My own task or dataset (give details below)

Reproduction

build torch >= 2.0.1 with USE_DISTRIBUTED=0
install accelerate == 0.21.0
python -c "import accelerate"
raise ModuleNotFoundError: No module named 'torch._C._distributed_c10d'; 'torch._C' is not a package

Traceback (most recent call last):

  File "<string>", line 1, in <module>
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/accelerate/__init__.py", line 3, in <module>
    from .accelerator import Accelerator
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/accelerate/accelerator.py", line 35, in <module>
    from .checkpointing import load_accelerator_state, load_custom_state, save_accelerator_state, save_custom_state
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/accelerate/checkpointing.py", line 24, in <module>
    from .utils import (
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/accelerate/utils/__init__.py", line 132, in <module>
    from .fsdp_utils import load_fsdp_model, load_fsdp_optimizer, save_fsdp_model, save_fsdp_optimizer
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/accelerate/utils/fsdp_utils.py", line 24, in <module>
    import torch.distributed.checkpoint as dist_cp
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/torch/distributed/checkpoint/__init__.py", line 1, in <module>
    from .metadata import (
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/torch/distributed/checkpoint/metadata.py", line 3, in <module>
    from torch.distributed._shard.sharded_tensor.metadata import TensorProperties
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/torch/distributed/_shard/__init__.py", line 1, in <module>
    from .api import (
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/torch/distributed/_shard/api.py", line 5, in <module>
    from torch.distributed import distributed_c10d
  File "/nix/store/v9h5iiawvw6y0j03840qxjpqc9nbk4c2-python3-3.10.12-env/lib/python3.10/site-packages/torch/distributed/distributed_c10d.py", line 16, in <module>
    from torch._C._distributed_c10d import (

Expected behavior

This is the line in the issue.

accelerate/src/accelerate/utils/fsdp_utils.py

Lines 23 to 24 in d5894ab

    
           if is_torch_version(">=", FSDP_PYTORCH_VERSION): 
        
               import torch.distributed.checkpoint as dist_cp

I think it would be better to decide whether to import torch.distributed by the result of torch.distributed.is_available() besides the torch version.

The text was updated successfully, but these errors were encountered:

sgugger · 2023-07-28T13:26:35Z

Yes, we need a torch.distributed.is_available() in that test in case PyTorch was built without distributed support, cc @pacman100

This was referenced Jul 28, 2023

python310Packages.accelerate: 0.19.0 -> 0.21.0 NixOS/nixpkgs#245802

Merged

Fix import error when torch>=2.0.1 and torch.distributed is disabled #1800

Merged

sgugger closed this as completed in #1800 Jul 31, 2023

natsukium mentioned this issue Nov 5, 2023

Fix import error when torch>=2.0.1 and torch.distributed is disabled #2121

Merged

5 tasks

natsukium mentioned this issue Nov 16, 2023

python3Packages.diffusers: init at 0.21.4 NixOS/nixpkgs#262469

Closed

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cannot import accelerate when torch>=2.0.1 and torch.distributed is disabled #1787

cannot import accelerate when torch>=2.0.1 and torch.distributed is disabled #1787

natsukium commented Jul 28, 2023 •

edited

Loading

sgugger commented Jul 28, 2023

cannot import accelerate when torch>=2.0.1 and torch.distributed is disabled #1787

cannot import accelerate when torch>=2.0.1 and torch.distributed is disabled #1787

Comments

natsukium commented Jul 28, 2023 • edited Loading

System Info

Information

Tasks

Reproduction

Expected behavior

sgugger commented Jul 28, 2023

natsukium commented Jul 28, 2023 •

edited

Loading