Troubleshooting

Jonathan Calmels edited this page Nov 16, 2017 · 1 revision

Generating debugging logs

For most common issues, debugging logs can be generated and can help us root cause the problem.
In order to generate these:

  • Edit your runtime configuration under /etc/nvidia-container-runtime/config.toml and uncomment the debug=... line.
  • Run your container again, thus reproducing the issue and generating the logs.

Generating core dumps

In the event of a critical failure, core dumps can be automatically generated and can help us troubleshoot issues.
Refer to core(5) in order to generate these, in particular make sure that:

  • /proc/sys/kernel/core_pattern is correctly set and points somewhere with write access
  • ulimit -c is set to a sensible default

In case the nvidia-container-cli process becomes unresponsive, gcore(1) can also be used.

Sharing your debugging information

You can attach a particular output to your issue with a drag and drop into the comment section.

You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session.
Press h to open a hovercard with more details.