Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Compatibility Issue with NVIDIA RTX 4060 and PyTorch: CUDA Capability sm_89 Not Supported and TypeError in MMCV Config #1

Open
Qingbolan opened this issue Dec 1, 2023 · 0 comments

Comments

@Qingbolan
Copy link

Qingbolan commented Dec 1, 2023

(ACFormer) silan@Qingbolan:~/MUST/2309/ACFormer$ CUDA_VISIBLE_DEVICES=0 bash tools/dist_train.sh configs/ACFormer_Lizard.py 1 --work-dir=./1.output/
/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launch.py:178: FutureWarning: The module torch.distributed.launch is deprecated
and will be removed in future. Use torchrun.
Note that --use_env is set by default in torchrun.
If your script expects `--local_rank` argument to be set, please
change it to read from `os.environ['LOCAL_RANK']` instead. See
https://pytorch.org/docs/stable/distributed.html#launch-utility for
further instructions

  warnings.warn(
/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/cuda/__init__.py:146: UserWarning:
NVIDIA GeForce RTX 4060 Laptop GPU with CUDA capability sm_89 is not compatible with the current PyTorch installation.
The current PyTorch install supports CUDA capabilities sm_37 sm_50 sm_60 sm_61 sm_70 sm_75 compute_37.
If you want to use the NVIDIA GeForce RTX 4060 Laptop GPU GPU with PyTorch, please check the instructions at https://pytorch.org/get-started/locally/

  warnings.warn(incompatible_device_warn.format(device_name, capability, " ".join(arch_list), device_name))
Traceback (most recent call last):
  File "tools/train.py", line 198, in <module>
    main()
  File "tools/train.py", line 137, in main
    cfg.dump(osp.join(cfg.work_dir, osp.basename(args.config)))
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/mmcv/utils/config.py", line 596, in dump
    f.write(self.pretty_text)
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/mmcv/utils/config.py", line 508, in pretty_text
    text, _ = FormatCode(text, style_config=yapf_style, verify=True)
TypeError: FormatCode() got an unexpected keyword argument 'verify'
ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 26049) of binary: /home/silan/develop/python/anaconda3/envs/ACFormer/bin/python
Traceback (most recent call last):
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/runpy.py", line 192, in _run_module_as_main
    return _run_code(code, main_globals, None,
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/runpy.py", line 85, in _run_code
    exec(code, run_globals)
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launch.py", line 193, in <module>
    main()
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launch.py", line 189, in main
    launch(args)
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launch.py", line 174, in launch
    run(args)
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 752, in run
    elastic_launch(
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 131, in __call__
    return launch_agent(self._config, self._entrypoint, list(args))
  File "/home/silan/develop/python/anaconda3/envs/ACFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 245, in launch_agent
    raise ChildFailedError(
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
============================================================
tools/train.py FAILED
------------------------------------------------------------
Failures:
  <NO_OTHER_FAILURES>
------------------------------------------------------------
Root Cause (first observed failure):
[0]:
  time      : 2023-12-01_18:12:41
  host      : Qingbolan.
  rank      : 0 (local_rank: 0)
  exitcode  : 1 (pid: 26049)
  error_file: <N/A>
  traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html
============================================================
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant