Skip to content

stderr: nvidia-container-cli: initialization error: driver error: failed to process request\\\\n\\\"\"": unknown. #183

@chunniunai220ml

Description

@chunniunai220ml

I have config the docker 19.03.6 and nvidia-docker successfully.BUT ,when I test:

docker run --gpus all nvidia/cuda:10.0-base nvidia-smi
GET errors :

docker: Error response from daemon: OCI runtime create failed: container_linux.go:345: starting container process caused "process_linux.go:430: container init caused "process_linux.go:413: running prestart hook 0 caused \"error running hook: exit status 1, stdout: , stderr: nvidia-container-cli: initialization error: driver error: failed to process request\\n\""": unknown.

then, I check the nvidia-container-cli ,it seems no error
sudo nvidia-container-cli -k -d /dev/tty info

-- WARNING, the following logs are for debugging purposes only --

I0226 06:26:25.224982 78809 nvc.c:281] initializing library context (version=1.0.2, build=ff40da533db929bf515aca59ba4c701a65a35e6b)
I0226 06:26:25.225050 78809 nvc.c:255] using root /
I0226 06:26:25.225061 78809 nvc.c:256] using ldcache /etc/ld.so.cache
I0226 06:26:25.225071 78809 nvc.c:257] using unprivileged user 65534:65534
I0226 06:26:25.230611 78810 nvc.c:191] loading kernel module nvidia
I0226 06:26:25.230931 78810 nvc.c:203] loading kernel module nvidia_uvm
I0226 06:26:25.231053 78810 nvc.c:211] loading kernel module nvidia_modeset
I0226 06:26:25.231436 78811 driver.c:133] starting driver service
I0226 06:26:25.356687 78809 nvc_info.c:434] requesting driver information with ''
I0226 06:26:25.356983 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/vdpau/libvdpau_nvidia.so.418.87.00
I0226 06:26:25.357280 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-tls.so.418.87.00
I0226 06:26:25.357333 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ptxjitcompiler.so.418.87.00
I0226 06:26:25.357441 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opticalflow.so.418.87.00
I0226 06:26:25.357512 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-opencl.so.418.87.00
I0226 06:26:25.357559 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ml.so.418.87.00
I0226 06:26:25.357629 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-ifr.so.418.87.00
I0226 06:26:25.357711 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glsi.so.418.87.00
I0226 06:26:25.357760 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-glcore.so.418.87.00
I0226 06:26:25.357806 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-fbc.so.418.87.00
I0226 06:26:25.357868 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-fatbinaryloader.so.418.87.00
I0226 06:26:25.357928 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-encode.so.418.87.00
I0226 06:26:25.358002 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-eglcore.so.418.87.00
I0226 06:26:25.358053 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-compiler.so.418.87.00
I0226 06:26:25.358108 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvidia-cfg.so.418.87.00
I0226 06:26:25.358179 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libnvcuvid.so.418.87.00
I0226 06:26:25.358606 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libcuda.so.418.87.00
I0226 06:26:25.358847 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libGLX_nvidia.so.418.87.00
I0226 06:26:25.358902 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libGLESv2_nvidia.so.418.87.00
I0226 06:26:25.358951 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libGLESv1_CM_nvidia.so.418.87.00
I0226 06:26:25.359001 78809 nvc_info.c:148] selecting /usr/lib/x86_64-linux-gnu/libEGL_nvidia.so.418.87.00
W0226 06:26:25.359039 78809 nvc_info.c:303] missing compat32 library libnvidia-ml.so
W0226 06:26:25.359047 78809 nvc_info.c:303] missing compat32 library libnvidia-cfg.so
W0226 06:26:25.359056 78809 nvc_info.c:303] missing compat32 library libcuda.so
W0226 06:26:25.359066 78809 nvc_info.c:303] missing compat32 library libnvidia-opencl.so
W0226 06:26:25.359076 78809 nvc_info.c:303] missing compat32 library libnvidia-ptxjitcompiler.so
W0226 06:26:25.359086 78809 nvc_info.c:303] missing compat32 library libnvidia-fatbinaryloader.so
W0226 06:26:25.359097 78809 nvc_info.c:303] missing compat32 library libnvidia-compiler.so
W0226 06:26:25.359107 78809 nvc_info.c:303] missing compat32 library libvdpau_nvidia.so
W0226 06:26:25.359117 78809 nvc_info.c:303] missing compat32 library libnvidia-encode.so
W0226 06:26:25.359128 78809 nvc_info.c:303] missing compat32 library libnvidia-opticalflow.so
W0226 06:26:25.359138 78809 nvc_info.c:303] missing compat32 library libnvcuvid.so
W0226 06:26:25.359149 78809 nvc_info.c:303] missing compat32 library libnvidia-eglcore.so
W0226 06:26:25.359159 78809 nvc_info.c:303] missing compat32 library libnvidia-glcore.so
W0226 06:26:25.359169 78809 nvc_info.c:303] missing compat32 library libnvidia-tls.so
W0226 06:26:25.359177 78809 nvc_info.c:303] missing compat32 library libnvidia-glsi.so
W0226 06:26:25.359186 78809 nvc_info.c:303] missing compat32 library libnvidia-fbc.so
W0226 06:26:25.359194 78809 nvc_info.c:303] missing compat32 library libnvidia-ifr.so
W0226 06:26:25.359203 78809 nvc_info.c:303] missing compat32 library libGLX_nvidia.so
W0226 06:26:25.359212 78809 nvc_info.c:303] missing compat32 library libEGL_nvidia.so
W0226 06:26:25.359220 78809 nvc_info.c:303] missing compat32 library libGLESv2_nvidia.so
W0226 06:26:25.359253 78809 nvc_info.c:303] missing compat32 library libGLESv1_CM_nvidia.so
I0226 06:26:25.359527 78809 nvc_info.c:229] selecting /usr/bin/nvidia-smi
I0226 06:26:25.359560 78809 nvc_info.c:229] selecting /usr/bin/nvidia-debugdump
I0226 06:26:25.359585 78809 nvc_info.c:229] selecting /usr/bin/nvidia-persistenced
I0226 06:26:25.359608 78809 nvc_info.c:229] selecting /usr/bin/nvidia-cuda-mps-control
I0226 06:26:25.359632 78809 nvc_info.c:229] selecting /usr/bin/nvidia-cuda-mps-server
I0226 06:26:25.359667 78809 nvc_info.c:366] listing device /dev/nvidiactl
I0226 06:26:25.359676 78809 nvc_info.c:366] listing device /dev/nvidia-uvm
I0226 06:26:25.359687 78809 nvc_info.c:366] listing device /dev/nvidia-uvm-tools
I0226 06:26:25.359697 78809 nvc_info.c:366] listing device /dev/nvidia-modeset
W0226 06:26:25.359731 78809 nvc_info.c:274] missing ipc /var/run/nvidia-persistenced/socket
W0226 06:26:25.359753 78809 nvc_info.c:274] missing ipc /tmp/nvidia-mps
I0226 06:26:25.359763 78809 nvc_info.c:490] requesting device information with ''
I0226 06:26:25.366457 78809 nvc_info.c:520] listing device /dev/nvidia0 (GPU-03bb5927-ceaa-4166-ff1e-1d58a8cbf883 at 00000000:05:00.0)
I0226 06:26:25.373129 78809 nvc_info.c:520] listing device /dev/nvidia1 (GPU-26602c4d-2069-84f3-3bc9-5d943fb3bdb4 at 00000000:06:00.0)
I0226 06:26:25.380167 78809 nvc_info.c:520] listing device /dev/nvidia2 (GPU-0687efee-81a2-537e-d7fe-3a5694aceb29 at 00000000:85:00.0)
I0226 06:26:25.387215 78809 nvc_info.c:520] listing device /dev/nvidia3 (GPU-4c95eb5b-8940-562c-742f-2078cb3a02eb at 00000000:86:00.0)
NVRM version: 418.87.00
CUDA version: 10.1

Device Index: 0
Device Minor: 0
Model: Tesla K80
Brand: Tesla
GPU UUID: GPU-03bb5927-ceaa-4166-ff1e-1d58a8cbf883
Bus Location: 00000000:05:00.0
Architecture: 3.7

Device Index: 1
Device Minor: 1
Model: Tesla K80
Brand: Tesla
GPU UUID: GPU-26602c4d-2069-84f3-3bc9-5d943fb3bdb4
Bus Location: 00000000:06:00.0
Architecture: 3.7

Device Index: 2
Device Minor: 2
Model: Tesla K80
Brand: Tesla
GPU UUID: GPU-0687efee-81a2-537e-d7fe-3a5694aceb29
Bus Location: 00000000:85:00.0
Architecture: 3.7

Device Index: 3
Device Minor: 3
Model: Tesla K80
Brand: Tesla
GPU UUID: GPU-4c95eb5b-8940-562c-742f-2078cb3a02eb
Bus Location: 00000000:86:00.0
Architecture: 3.7
I0226 06:26:25.387330 78809 nvc.c:318] shutting down library context
I0226 06:26:25.388428 78811 driver.c:192] terminating driver service
I0226 06:26:25.440777 78809 driver.c:233] driver service terminated successfully

is the nvidia-driver-version too low? in fact,the 418.87.00 is the nvidia official network recommend, and how to update the driver by apt instead of mannually with the driver-run file?
I do not konw how to make it works. can anyone help me?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions