Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

develop版本paddle安装cudnn错误 #51190

Closed
zhangzhiqiangccm opened this issue Mar 4, 2023 · 5 comments
Closed

develop版本paddle安装cudnn错误 #51190

zhangzhiqiangccm opened this issue Mar 4, 2023 · 5 comments
Assignees
Labels
status/following-up 跟进中 type/build 编译/安装问题

Comments

@zhangzhiqiangccm
Copy link

问题描述 Issue Description

cudnn:8.6 cudn:11.7
已经修改环境变量,但是

 paddle.utils.run_check()

Running verify PaddlePaddle program ...
I0304 17:44:36.816897 3122901 interpretercore.cc:273] New Executor is Running.
W0304 17:44:36.817149 3122901 gpu_resources.cc:85] Please NOTE: device: 0, GPU Compute Capability: 8.6, Driver API Version: 11.7, Runtime API Version: 11.7
W0304 17:44:36.817360 3122901 dynamic_loader.cc:307] The third-party dynamic library (libcudnn.so) that Paddle depends on is not configured correctly. (error code is /usr/local/cuda/lib64/libcudnn.so: cannot open shared object file: No such file or directory)
Suggestions:

  1. Check if the third-party dynamic library (e.g. CUDA, CUDNN) is installed correctly and its version is matched with paddlepaddle you installed.
  2. Configure third-party dynamic library environment variables as follows:
  • Linux: set LD_LIBRARY_PATH by export LD_LIBRARY_PATH=...
  • Windows: set PATH by `set PATH=XXX;
    Traceback (most recent call last):
    File "", line 1, in
    File "/opt/app/anaconda3/envs/paddle/lib/python3.9/site-packages/paddle/utils/install_check.py", line 278, in run_check
    _run_static_single(use_cuda, use_xpu, use_npu)
    File "/opt/app/anaconda3/envs/paddle/lib/python3.9/site-packages/paddle/utils/install_check.py", line 168, in _run_static_single
    exe.run(startup_prog)
    File "/opt/app/anaconda3/envs/paddle/lib/python3.9/site-packages/paddle/fluid/executor.py", line 1593, in run
    res = self._run_impl(
    File "/opt/app/anaconda3/envs/paddle/lib/python3.9/site-packages/paddle/fluid/executor.py", line 1862, in _run_impl
    return new_exe.run(
    File "/opt/app/anaconda3/envs/paddle/lib/python3.9/site-packages/paddle/fluid/executor.py", line 682, in run
    tensors = self._new_exe.run(
    RuntimeError: In user code:

PreconditionNotMetError: Cannot load cudnn shared library. Cannot invoke method cudnnGetVersion.
[Hint: cudnn_dso_handle should not be null.] (at /paddle/paddle/phi/backends/dynload/cudnn.cc:64)
[operator < fill_constant > error]

版本&环境信息 Version & Environment Information

paddle:0.0.0
cuda:11.7
cudnn:8.6
修改了环境变量
conda env config vars set LD_LIBRARY_PATH=$LD_LIBRARY_PATH:/opt/app/anaconda3/envs/paddle/include:/opt/app/anaconda3/envs/paddle/lib
依旧报错

@paddle-bot
Copy link

paddle-bot bot commented Mar 4, 2023

您好,我们已经收到了您的问题,会安排技术人员尽快解答您的问题,请耐心等待。请您再次检查是否提供了清晰的问题描述、复现代码、环境&版本、报错信息等。同时,您也可以通过查看官网API文档常见问题历史IssueAI社区来寻求解答。祝您生活愉快~

Hi! We've received your issue and please be patient to get responded. We will arrange technicians to answer your questions as soon as possible. Please make sure that you have posted enough message to demo your request. You may also check out the APIFAQGithub Issue and AI community to get the answer.Have a nice day!

@yt605155624
Copy link
Contributor

如果个人的 cuda 环境不会配置,可以尝试在官方 docker 中运行

@yt605155624 yt605155624 added status/following-up 跟进中 and removed status/new-issue 新建 labels Mar 5, 2023
@zhangzhiqiangccm
Copy link
Author

如果个人的 cuda 环境不会配置,可以尝试在官方 docker 中运行

已经解决,paddle的develop版本和cudnn匹配时有些问题,更改环境变量可能是不起效的,需要重新链接。此外,develop版本缺少libnccl.so,需要从nvidia官网重新下载并设置链接,否则无法使用多卡。

@lywang76
Copy link

如果个人的 cuda 环境不会配置,可以尝试在官方 docker 中运行

已经解决,paddle的develop版本和cudnn匹配时有些问题,更改环境变量可能是不起效的,需要重新链接。此外,develop版本缺少libnccl.so,需要从nvidia官网重新下载并设置链接,否则无法使用多卡。

Can you show me how you solved the problem?
I met the same issue.

Copy link

paddle-bot bot commented Apr 2, 2024

Since you haven't replied for more than a year, we have closed this issue/pr.
If the problem is not solved or there is a follow-up one, please reopen it at any time and we will continue to follow up.
由于您超过一年未回复,我们将关闭这个issue/pr。
若问题未解决或有后续问题,请随时重新打开,我们会继续跟进。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
status/following-up 跟进中 type/build 编译/安装问题
Projects
None yet
Development

No branches or pull requests

3 participants