-
Notifications
You must be signed in to change notification settings - Fork 38.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Retries for kubelet plugin registration should be at a lower layer #73371
Comments
Thanks for creating this. Background The Kubelet Plugin Registration Mechanism watches a directory on the kubelet host for new socket files. When a new socket file is discovered, the plugin mechanism will attempt to connect to the socket and probe it using the protocol defined here. If this registration step fails, the kubelet never attempts to register the socket again, unless the socket for the driver is deleted & recreated to trigger a new registration. Problem If the registration error was transient, the registration is not retried and the plugin is simply not registered. Requirements
Suggested Solution Implement a controller that manages registration/unregistration, similar to the volumemanager.
CC @msau42 as FYI. /sig node |
@saad-ali I can take up this if none has started already. |
@vikaschoudhary16 I was planning to work on this issue this weekend. @saad-ali what's the priority for this issue? If this is urgent, @vikaschoudhary16 you should probably work on this instead |
@taragu go ahead please. I can help with reviews :) |
@vikaschoudhary16 sounds good. Thanks!! 😄 |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
/remove-lifecycle stale |
/milestone v1.15 |
What would you like to be added:
The kubelet registration mechanism should be more robust (e.g. have a full controller that maintains desired/actual and reconciles that state periodically). The CSI layer is not right layer to implement retries. Retries should not be implemented by every consumer.
Why is this needed: For a more robust kubelet registration mechanism
Related issue: #71487
/cc @saad-ali
/sig storage
/assign
The text was updated successfully, but these errors were encountered: