Skip to content
This repository has been archived by the owner on May 12, 2021. It is now read-only.

How to run kata gpu containers with k8s cluster #764

Closed
lizhixin3016 opened this issue Nov 24, 2020 · 4 comments
Closed

How to run kata gpu containers with k8s cluster #764

lizhixin3016 opened this issue Nov 24, 2020 · 4 comments
Labels
enhancement Improvement to an existing feature needs-review Needs to be assessed by the team.

Comments

@lizhixin3016
Copy link

To run gpu containers with kata-runtime, we need to enable vfio driver for gpu devices,
but the k8s-device-plugin need to install nvidia driver on k8s node to detect the gpu
devices, causing we can not passthrough gpu devices by vfio-pci driver into the kata
containers.

I didn't find any introduction or document about this, anyone knows how to deal with it?

@lizhixin3016 lizhixin3016 added enhancement Improvement to an existing feature needs-review Needs to be assessed by the team. labels Nov 24, 2020
@jodh-intel jodh-intel added this to To do in Issue backlog Nov 24, 2020
@Jimmy-Xu
Copy link
Contributor

@lizhixin3016
Have you tried this device plugin https://github.com/NVIDIA/kubevirt-gpu-device-plugin?
It does not need to install nvidia driver on k8s node, and you can passthrough the VFIO device of Nvidia to the kata container.
You can install and load the Nvidia driver into the guest kernel rootfs image or kata container.

@lizhixin3016
Copy link
Author

@Jimmy-Xu
This plugin looks useful to me, I will try deploy and run it with kata in k8s cluster.

Thank you very much!

@liubin
Copy link
Member

liubin commented Dec 11, 2020

@lizhixin3016 , if Jimmy's comment helped you, consider closing this issue?

@ariel-adam
Copy link
Contributor

Closing this issue.

Issue backlog automation moved this from To do to Done Feb 9, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement Improvement to an existing feature needs-review Needs to be assessed by the team.
Projects
Issue backlog
  
Done
Development

No branches or pull requests

4 participants