Skip to content
This repository has been archived by the owner on May 27, 2024. It is now read-only.

cannot generate nvidia.com/cuda.xxx labels on node #75

Closed
FLM210 opened this issue May 14, 2024 · 1 comment
Closed

cannot generate nvidia.com/cuda.xxx labels on node #75

FLM210 opened this issue May 14, 2024 · 1 comment

Comments

@FLM210
Copy link

FLM210 commented May 14, 2024

I have installed gpu-operator in my cluster, and it appears that all components are running normally, but the nvidia.com/cuda.xxx is missing on a certain node。

image
@FLM210
Copy link
Author

FLM210 commented May 14, 2024

I have found the reason, as gpu-feature-discovery run before the driver installation was completed, resulting in the inability to load NVML library

@FLM210 FLM210 closed this as completed May 14, 2024
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant