Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

FAIL: ubuntu 22.04 failed spectacularly #60

Open
jdgh000 opened this issue Apr 15, 2023 · 3 comments
Open

FAIL: ubuntu 22.04 failed spectacularly #60

jdgh000 opened this issue Apr 15, 2023 · 3 comments

Comments

@jdgh000
Copy link

jdgh000 commented Apr 15, 2023

Installed cuda but container appears to be installing 18.04.
Followed the isntruction which is obviously faulty:
https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#installation-guide


jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$ nvidia-ctk --version
NVIDIA Container Toolkit CLI version 1.13.0
commit: b7079454b5b8fed1390ce78ca5a3343748f62657
jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$ sudo nvidia-ctk cdi generate --output=/etc/cdi/nvidia.yaml
INFO[0000] Auto-detected mode as "nvml"
ERRO[0000] failed to generate CDI spec: failed to create device CDI specs: failed to initalize NVML: ERROR_LIB_RM_VERSION_MISMATCH
jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$


jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$ cat /etc/apt/sources.list.d/nvidia-container-toolkit.list
deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://nvidia.github.io/libnvidia-container/stable/ubuntu18.04/$(ARCH) /
#deb [signed-by=/usr/share/keyrings/nvidia-container-toolkit-keyring.gpg] https://nvidia.github.io/libnvidia-container/experimental/ubuntu18.04/$(ARCH) /


jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$ apt list --installed | grep cuda | grep toolkit

WARNING: apt does not have a stable CLI interface. Use with caution in scripts.

cuda-toolkit-12-1-config-common/unknown,now 12.1.55-1 all [installed,auto-removable]
cuda-toolkit-12-1/unknown,now 12.1.0-1 amd64 [installed,auto-removable]
cuda-toolkit-12-config-common/unknown,now 12.1.55-1 all [installed,auto-removable]
cuda-toolkit-config-common/unknown,now 12.1.55-1 all [installed,auto-removable]
nvidia-cuda-toolkit-doc/jammy,jammy,now 11.5.1-1ubuntu1 all [installed,automatic]
nvidia-cuda-toolkit/jammy,now 11.5.1-1ubuntu1 amd64 [installed]
jd@jd-MS-7B22:~/gg/git/codelab/gpu/cuda/cuda-by-example$ cat /etc/os-release
PRETTY_NAME="Ubuntu 22.04.2 LTS"
NAME="Ubuntu"
VERSION_ID="22.04"
VERSION="22.04.2 LTS (Jammy Jellyfish)"
VERSION_CODENAME=jammy
ID=ubuntu
ID_LIKE=debian
HOME_URL="https://www.ubuntu.com/"
SUPPORT_URL="https://help.ubuntu.com/"
BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/"
PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy"
UBUNTU_CODENAME=jammy



@elezar
Copy link
Member

elezar commented Apr 17, 2023

The use of the ubuntu18.04 packages is expected.

What does running nvidia-smi on your machine show?

@pcgeek86
Copy link

The use of the ubuntu18.04 packages is expected.

That doesn't make any sense whatsoever. Why would Ubuntu 18.04 packages be installed on Ubuntu 22.04?

@elezar
Copy link
Member

elezar commented Jul 20, 2023

We only produce one flavour of ubuntu packages (built on Ubuntu18.04) which are forward-compatible. We also do not mirror the entire ubuntu18.04 repository for newer ubuntu versions. We do, however, replicate the repository list file to simplify the installation process on newer Ubuntu versions.

@pcgeek86 does this answer your question?

@jdgh000 looking at your output again, it seems as if the CUDA drivers may not be installed correctly which is why I requested the nvidia-smi output.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants