Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Monitor / include progress on CDI based nvidia container setup #813

Closed
dtrudg opened this issue May 18, 2022 · 5 comments
Closed

Monitor / include progress on CDI based nvidia container setup #813

dtrudg opened this issue May 18, 2022 · 5 comments
Labels
enhancement New feature or request roadmap Features / changes that are scheduled to be implemented

Comments

@dtrudg
Copy link
Member

dtrudg commented May 18, 2022

The nvidia-container-toolkit mediated setup of GPU containers for CUDA and other work will be transitioning toware an approach that leverages the CDI (container device interface) standard.

https://github.com/container-orchestrated-devices/container-device-interface

Monitor this work and consider adoption.

@dtrudg dtrudg added the enhancement New feature or request label May 18, 2022
@dtrudg dtrudg added this to the SingularityCE 3.11 milestone May 18, 2022
@dtrudg dtrudg added the roadmap Features / changes that are scheduled to be implemented label May 18, 2022
@dtrudg
Copy link
Member Author

dtrudg commented Nov 16, 2022

Continuing to monitor this. No action is necessary for 3.11 at this time.

@elezar
Copy link
Contributor

elezar commented Feb 20, 2023

@dtrudg with the NVIDIA Container Toolkit v1.12.0 release we have included tooling to generate CDI specifications. We are also updating our documentation to recommend this as the way to access NVIDIA devices when using Podman which offers CDI support as of v4.1.0.

What is required to get CDI as the recommended way to inject NVIDIA devices when using OCI mode?

@dtrudg
Copy link
Member Author

dtrudg commented Feb 20, 2023

Hi @elezar - thanks for noting this issue and getting in touch!

We had postponed work on CDI until such time as there was tooling generating a CDI specification, as your v1.12.0 does. Consequently, there's nothing in SingularityCE at present that is handling CDI specifications... but as the issue notes, it is a roadmap item for the 4.0 release.

To integrate CDI support in SingularityCE's --oci mode shouldn't be too difficult. We have launcher code that constructs an OCI runtime config for the container that is executed. I don't believe it'll be all that much different from implementation in podman.

Per out-of-band comms by email, it'd be great to have a chat about this, and we can sort out when the work will be able to take place. Thanks!

@ArangoGutierrez
Copy link
Contributor

++

@dtrudg
Copy link
Member Author

dtrudg commented Mar 1, 2023

Going to close this in favor of #1394 and #1395 ... now we are moving forward with CDI in OCI mode, and wish to consider it somewhat separately for the singularity native runtime.

Thanks for your input @elezar et al.

@dtrudg dtrudg closed this as completed Mar 1, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request roadmap Features / changes that are scheduled to be implemented
Projects
None yet
Development

No branches or pull requests

3 participants