-
Notifications
You must be signed in to change notification settings - Fork 282
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
gpu-operator fails to start due to deletion of nonexistent resources #484
Comments
@shivamerla I tested this with the latest release (22.9.2). Is that codepath also for deleting resources, or only for adding them? I'm asking because I didn't receive any other panics around creating resources. I'm happy to make a stack trace or test a patch if it helps. Anecdotally, the operator appears to choose OpenShift codepaths automatically even when setting |
(I closed the issue only to reopen it realizing that it still persists.) |
@xknight somehow k8s version check seems to be failing here in your case and we end up adding PSP manifests for creation/deletion. Can you double check if you see this message in operator logs.
|
@shivamerla no, that's not in the log. |
The log reads:
This is the version provided by OKD 4.12.0-0.okd-2023-01-21-055900; there is one newer version available now, but I doubt that the "structure" of the semantic version will be different. |
@xknight that explains it, we need to fix the constraint as |
1. Quick Debug Checklist
i2c_core
andipmi_msghandler
loaded on the nodes?kubectl describe clusterpolicies --all-namespaces
)1. Issue or feature description
Clean up of PSP resources fails in k8s 1.25.4 / OKD 4.12 (tested with
gpu-operator
22.9.2
):It seems the code here should treat a non-existent resource definition as "not found" and pass the condition, but it fails instead.
2. Steps to reproduce the issue
gpu-operator
in an OKD 4.12 clustergpu-operator
pod3. Other information
Commenting out the code block above allows the operator to start normally.
The text was updated successfully, but these errors were encountered: