-
Notifications
You must be signed in to change notification settings - Fork 218
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Overriding nvidia-container-runtime/config.toml with XDG_CONFIG_HOME #56
Comments
The only way I can get this to work is to specify a custom [omnicoder@goblin bin]$ cat nvidia-container-cli-wrapper
#!/bin/sh
if [ -n "$HOME" ] # HOME is not set for dockerd as system service
then
set -- nvidia-container-cli "$@" --no-cgroups
else
set -- nvidia-container-cli "$@"
fi
echo "$@" >> /tmp/nvidia-container-cli-wrapper.log
exec "$@"
[omnicoder@goblin bin]$ |
Hi @hholst80 looking at the code for handling the
That is to say the environment variable represents the We use this envvar in our GPU Operator to apply a specific config file and as such it should work as expected. Note that using CDI to request NVIDIA Devices instead of the traditional injection mechanisms would support your use case (both rootfull and rootless) out of the box since the components of the NVIDIA container stack would no longer be responsible for setting up cgroups for device access in the container -- instead relying on the low-level runtime such as We are in the process of adding support to the Docker daemon (moby/moby#45134) as well as the Docker CLI (docker/cli#4084) and these should be available as an experimental feature soon. If you want to try this out for your usecase beforehand, the An alternative would be to use |
Per documentation:
Where is this supposed to be injected? I supposed that
nvidia-container-runtime
itself reads this file upon being spawned by dockerd. But it does not seem to honor its promise. I copied /etc/nvidia-container-runtime to /root/.config and changedno-cgroups = true
. This does not work for root and it will fail if I have the file /etc/nvidia-container-runtime/config.toml and make the changes there. However, nvidia-container-runtime will gladly ignore my /root/.config file and use the default (no-groups = false).The text was updated successfully, but these errors were encountered: