-
Notifications
You must be signed in to change notification settings - Fork 199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
vulkan is broken #140
Comments
Also tested on a PopOS 18.04 (Ubuntu 18.04 based) system with a Pascal GeForce, nvidia driver 460.67, nvidia-container 1.3.3, and docker CE 20.10.5... same error |
Could this possibly be related to: |
Thanks; that is unlikely the same issue, since |
@qhaas Did you figure out a workaround? |
while granting access to the XAUTHORITY file and XSOCKET as you describe it, works for OpenGL, it does not work somehow for vulkan. You have to activate the So instead of |
@elezar Can you please add some information on how this issue was solved? |
@flxai I closed this issue due to inactivity. We are consolidating our repos and this includes where issues are handled. Since this has not been addressed, I will reopen and transfer to the https://github.com/NVIDIA/nvidia-container-toolkit repo instead. |
@flxai I have had a chance to confirm the instructions for running a vulkan container image. Assuming the following Dockerfile:
Building it:
And run the image:
This should output the same information as This was tested with v1.14.4 of the NVIDIA Container Toolkit. If this does not work as expected, please create a new issue against this repo. |
1. Issue or feature description
Vulkan appears to be broken on Enterprise Linux 8.3 x86-64 hosts, it has worked for us before, not sure what changed or when. OpenGL appears to be working fine, as the X window appears/renders such applications' GUIs as expected. Vulkan applications launch, the window briefly appears, then seg faults. I doubt it is a bug with the nvidia vulkan container since I can run opengl/vulkan applications after converting it into a singularity container.
2. Steps to reproduce the issue
docker run --net=host --rm -it $(xForwardDockerRunArgs) --gpus=all nvidia/vulkan:1.1.121-cuda-10.1-beta.1-ubuntu18.04
glxgears
andvulkan-smoketest
inside container withapt-get update && apt-get install -y vulkan-utils mesa-utils
glxgears
launches an X window with spinning gears and uses the GPU, this implies 'x-forwarding' to the host is working and OpenGL is using the GPU. Runglxgears
in the container and after the window appears runnvidia-smi
on the host.vulkan-smoketest
and watch a black window briefly appear/disappear withSegmentation fault (core dumped)
in the container's terminal.dmesg | tail
on the host reports something likesegfault at 0 ip ... sp ... error 4 in vulkan-smoketest...
For a sanity check with Singularity 3.7, convert the same image to a singularity image and run it, vulkan works fine:
3. Information to attach (optional if deemed irrelevant)
Some nvidia-container information:
nvidia-container-cli -k -d /dev/tty info
:nvidia-container-cli.txt
Kernel version from
uname -a
:Linux fedorarouge 4.18.0-240.15.1.el8_3.x86_64 NVIDIA/nvidia-docker#1 SMP Wed Feb 3 03:12:15 EST 2021 x86_64 x86_64 x86_64 GNU/Linux
Any relevant kernel output lines from
dmesg
Driver information from
nvidia-smi -a
:nvidia-smi.txt
Docker version from
docker version
docker.txt
NVIDIA packages version from
dpkg -l '*nvidia*'
orrpm -qa '*nvidia*'
nvidia_rpm.txt
NVIDIA container library version from
nvidia-container-cli -V
:nvidia_container_cli.txt
NVIDIA container library logs (see troubleshooting): nvidia-container-toolkit.txt
Docker command, image and tag used:
docker run --net=host --rm -it $(xForwardDockerRunArgs) --gpus=all nvidia/vulkan:1.1.121-cuda-10.1-beta.1-ubuntu18.04
The text was updated successfully, but these errors were encountered: