-
Notifications
You must be signed in to change notification settings - Fork 661
NVidia cuda support #1637
Comments
+1 |
1 similar comment
+1 |
Working on the support of Nvidia cuda drivers implementation in Rancher OS.But now there is a problem that hardware devices cannot be identified in the os kernel,which leads to the fact that driver cannot be installed. The following are the contrast between RancherOS and Ubuntu16.04 Ubuntu16.04: |
Drivers identify the hardware by their PCI IDs; |
@vincent99 Yes, drivers identify hardware by PCI IDs.The error of installation caused by other reasons. Thank you very much. |
Tested with rancheros v1.4.0-rc1.
|
The nvidia-docker cannot work with our new kernel, we have asked for help to that community. |
Does this work now? |
We have fixed the kernel issue, I think we can add this nvidia-docker support on next release. |
Awesome. Do you know when the next release might be for RancherOS?
…On Aug 15 2018, at 8:56 pm, niusmallnan ***@***.***> wrote:
We have fixed the kernel issue, I think we can add this nvidia-docker support on next release.
—
You are receiving this because you commented.
Reply to this email directly, view it on GitHub (#1637 (comment)), or mute the thread (https://github.com/notifications/unsubscribe-auth/ABu0acPTYAmES5dv9H0-UlFyoB_owcRyks5uRMNXgaJpZM4MFled).
|
Hi there! Any update on this? |
It can work in the Ubuntu console, but we want to support it in the default console. |
@niusmallnan What is the current status for the nVidia CUDA integration? Is it possible to deploy it in some way? |
@niusmallnan At which point in time will the version 1.5.1 be released? |
@tech98321469320842 At the end of Feb. |
I want to integrate nvidia-docker2, currently mainly related to these projects.
They almost only provide deb and rpm packages, and it seems difficult to install from binary. So at this stage I don't plan to support it in the default console. I will give priority to supporting it in the ubuntu console. Usually we just need to add the apt source and install the corresponding deb, but there will be a problem in ROS. The nvidia-docker2 relies on docker deb files, ROS does not use the deb to manage docker.
So I can customize nvidia-docker2, just remove this dependency. Boot a vm(Ubuntu 18.04), and build the package after this patch
Just run Boot a ROS(v1.5.0) instance, and add nvidia-docker repo, but we need to use the ubuntu console
Install pakages
|
Tesla K80 installed lspci | grep NVIDIA Is there a special way of installing NVIDIA driver on RancherOS 1.5.0? docker run --runtime=nvidia --rm nvidia/cuda nvidia-smi reports an error: |
I got a similar error
although the error is different at the end EDIT: |
@niusmallnan So it's fixed now? |
I went through the steps outlined by @niusmallnan above and kept running into the following error:
After a little digging, I found that it's failing when trying to I'm not sure why it's failing there or how to fix it, but thought I'd share my findings in case it helps someone else narrow down the issue. |
@davidhyman did you get this to work on rancher OS using the patched package? |
Is this issue still being worked on and if so, any update on the status? |
+1 interested in the status of this issue |
3 similar comments
+1 interested in the status of this issue |
+1 interested in the status of this issue |
+1 interested in the status of this issue |
Looking through the rancher docs I found this page that talks about scheduling pods to nodes with gpus for what it's worth. |
+1 interested in the status of this issue |
1 similar comment
+1 interested in the status of this issue |
Could you please consider supporting NVidia cuda drivers implementation in Rancher OS?
NVidia is already providing docker support here https://github.com/NVIDIA/nvidia-docker
The text was updated successfully, but these errors were encountered: