Error response from daemon: OCI runtime create failed #614

sconeyard · 2018-01-22T01:44:55Z

1. Issue or feature description

Nvidia-Docker stopped working.
I had a jupyterhub running with nvidia-docker supported and it worked quite well.
Today I logged into the host system and ran sudo apt-get update/upgrade, and somehow, suddenly Nvidia-Docker does not work anymore. That said I can't recall if the upgrade actually did something so that might not be the root of the issue.
System runs debian.

2. Steps to reproduce the issue

sudo docker run --rm nvidia/cuda:8.0-devel nvidia-smi

docker: Error response from daemon: OCI runtime create failed: container_linux.go:296: starting container process caused "process_linux.go:398: container init caused \"process_linux.go:381: running prestart hook 1 caused \\\"error running hook: exit status 1, stdout: , stderr: exec command: [/usr/bin/nvidia-container-cli --load-kmods configure --ldconfig=@/sbin/ldconfig.real --device=all --compute --utility --require=cuda>=8.0 --pid=25807 /var/lib/docker/overlay2/8127e7486398ec495fc98de2cee1f18e769ee97f43211ccbc455a058d3b3923a/merged]\\\\nnvidia-container-cli: ldcache error: open failed: /sbin/ldconfig.real: no such file or directory\\\\n\\\"\"": unknown.

3. Information to attach (optional if deemed irrelevant)

$uname -a Linux donna 4.9.0-5-amd64 #1 SMP Debian 4.9.65-3+deb9u2 (2018-01-04) x86_64 GNU/Linux

 $ docker version
Client:
 Version:	17.12.0-ce
 API version:	1.35
 Go version:	go1.9.2
 Git commit:	c97c6d6
 Built:	Wed Dec 27 20:11:19 2017
 OS/Arch:	linux/amd64

Server:
 Engine:
  Version:	17.12.0-ce
  API version:	1.35 (minimum version 1.12)
  Go version:	go1.9.2
  Git commit:	c97c6d6
  Built:	Wed Dec 27 20:09:54 2017
  OS/Arch:	linux/amd64
  Experimental:	false

$ nvidia-smi
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 375.82                 Driver Version: 375.82                    |
|-------------------------------+----------------------+----------------------+
| GPU  Name        Persistence-M| Bus-Id        Disp.A | Volatile Uncorr. ECC |
| Fan  Temp  Perf  Pwr:Usage/Cap|         Memory-Usage | GPU-Util  Compute M. |
|===============================+======================+======================|
|   0  GeForce GTX 108...  Off  | 0000:41:00.0     Off |                  N/A |
|  0%   23C    P0    55W / 250W |      0MiB / 11170MiB |      3%      Default |
+-------------------------------+----------------------+----------------------+
                                                                               
+-----------------------------------------------------------------------------+
| Processes:                                                       GPU Memory |
|  GPU       PID  Type  Process name                               Usage      |
|=============================================================================|
|  No running processes found                                                 |
+-----------------------------------------------------------------------------+

$ nvidia-container-cli -V
version: 1.0.0
build date: 2018-01-11T00:29+00:00
build revision: 4a618459e8ba522d834bb2b4c665847fae8ce0ad
build compiler: x86_64-linux-gnu-gcc-6 6.3.0 20170516
build flags: -D_GNU_SOURCE -D_FORTIFY_SOURCE=2 -DNDEBUG -std=gnu11 -O2 -g -fdata-sections -ffunction-sections -fstack-protector -fno-strict-aliasing -fvisibility=hidden -Wall -Wextra -Wcast-align -Wpointer-arith -Wmissing-prototypes -Wnonnull -Wwrite-strings -Wlogical-op -Wformat=2 -Wmissing-format-attribute -Winit-self -Wshadow -Wstrict-prototypes -Wunreachable-code -Wconversion -Wsign-conversion -Wno-unknown-warning-option -Wno-format-extra-args -Wno-gnu-alignof-expression -Wl,-zrelro -Wl,-znow -Wl,-zdefs -Wl,--gc-sections

The text was updated successfully, but these errors were encountered:

sconeyard · 2018-01-22T02:09:39Z

Sorry for causing the trouble, it seems that I had the wrong sources list installed. To everyone running Debian and having this issue: Make sure you get your stuff from here: https://nvidia.github.io/nvidia-docker/

flx42 · 2018-01-22T02:33:37Z

You were probably using the Ubuntu packages instead of the Debian ones.

khallaghi · 2018-09-04T13:39:28Z

Is it possible to have other causation?
I have exactly the same issue on the same platform(debian stretch) but I installed from the right repository.

protopyte · 2018-09-24T14:20:36Z

@khallaghi I believe so. I first got hit by #677, then this one.
This is however not a Debian stretch, but a mix of testing and unstable.

My workaround was to symlink /sbin/ldconfig to /sbin/ldconfig.real

undcloud · 2020-01-02T08:36:31Z

met the same problem,thanks @sleveque
sudo docker run --gpus all nvidia/cuda:9.0-base nvidia-smi

docker: Error response from daemon: OCI runtime create failed: container_linux.go:346: starting container process caused "process_linux.go:449: container init caused "process_linux.go:432: running prestart hook 0 caused \"error running hook: exit status 1, stdout: , stderr: nvidia-container-cli: ldcache error: open failed: /sbin/ldconfig.real: no such file or directory\\n\""": unknown.
ERRO[0000] error waiting for container: context canceled
Solution：
ln -s /sbin/ldconfig /sbin/ldconfig.real

davidshen84 · 2021-04-04T03:32:51Z

Hi,

Sorry to post on a closed ticket. Could someone help me understand why we need to create this symlink? It seems neither nvidia nor glibc intended to create this link. But it is consumed in the application. Is it some legacy naming issue?

Thanks.

klueska · 2021-04-06T12:02:17Z

It just depends on what the real binary (not any wrapper shell script is on your host). You don't have to create a symlink, you can also change the path to it in /etc/nvidia-container-runtime/config.toml

shahriar8866 · 2023-11-30T19:21:35Z

I have a google compute vm with Debian 11.
for install GPU Tesla/T4 and activate gpu for microk8s node follow these steps:
1- make sure remove all nvidia and cuda:

sudo apt-get remove --purge '^nvidia-.*'
sudo apt-get remove --purge '^libnvidia-.*'
sudo apt-get remove --purge '^cuda-.*'
sudo apt autoremove
sudo apt autoclean
2- sudo apt-get install linux-headers-$(uname -r)
3- Make sure you have python3 installed on your VM.
4- Download gpu installation python script:
curl https://raw.githubusercontent.com/GoogleCloudPlatform/compute-gpu-installation/main/linux/install_gpu_driver.py --output install_gpu_driver.py
5- sudo python3 install_gpu_driver.py
6- Test gpu by nvidia-smi

7- enable gpu on microk8s by microk8s enable gpu
8- Test GPU on microk8s by microk8s kubectl run gpu-test --rm -t -i --restart=Never --image=nvcr.io/nvidia/cuda:10.1-base-ubuntu18.04 nvidia-smi
if the error nvidia-container-cli.real: ldcache error: open failed: /sbin/ldconfig.real: no such file or directory: unknown happen, this a work around:
sudo cp -r /sbin/ldconfig /sbin/ldconfig.real
now try microk8s kubectl run gpu-test --rm -t -i --restart=Never --image=nvcr.io/nvidia/cuda:10.1-base-ubuntu18.04 nvidia-smi

flx42 closed this as completed Jan 22, 2018

erikbeebe mentioned this issue Aug 21, 2020

Nothing on window mdouchement/docker-zoom-us#30

Closed

samos123 mentioned this issue Aug 22, 2023

Add GPU support kubernetes-sigs/kind#3257

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Error response from daemon: OCI runtime create failed #614

Error response from daemon: OCI runtime create failed #614

sconeyard commented Jan 22, 2018 •

edited

sconeyard commented Jan 22, 2018

flx42 commented Jan 22, 2018

khallaghi commented Sep 4, 2018

protopyte commented Sep 24, 2018

undcloud commented Jan 2, 2020

davidshen84 commented Apr 4, 2021

klueska commented Apr 6, 2021

shahriar8866 commented Nov 30, 2023

Error response from daemon: OCI runtime create failed #614

Error response from daemon: OCI runtime create failed #614

Comments

sconeyard commented Jan 22, 2018 • edited

1. Issue or feature description

2. Steps to reproduce the issue

3. Information to attach (optional if deemed irrelevant)

sconeyard commented Jan 22, 2018

flx42 commented Jan 22, 2018

khallaghi commented Sep 4, 2018

protopyte commented Sep 24, 2018

undcloud commented Jan 2, 2020

davidshen84 commented Apr 4, 2021

klueska commented Apr 6, 2021

shahriar8866 commented Nov 30, 2023

sconeyard commented Jan 22, 2018 •

edited