-
Notifications
You must be signed in to change notification settings - Fork 39.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Record a volume processing failure event for pod while volume mgr start to process volumes. #58272
Comments
This is because by this fixed interval that DSW populator will loop all the pods and check the unprocessed volumes. In terms of log, I think it should be ok, as log should record the actual tracking of the code execution, but for event, I'm thinking it is too often as the event should report only 1 msg to people. |
I agree with 1 msg to people.
Well, the DSW populator checks the unprocessed volumes several times per second. Would it be better to check them less frequently? An error that may occur can be basically permanent, e.g. in case PVC Protection is enabled and a pod is using PVC that is being deleted. So the log may be flooded with error messages for volumes that weren't processed successfully. |
Issues go stale after 90d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Stale issues rot after 30d of inactivity. If this issue is safe to close now please do so with Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Rotten issues close after 30d of inactivity. Send feedback to sig-testing, kubernetes/test-infra and/or fejta. |
Happened to see this issue when using FlexVolume.
If we configured pod with a wrong name of FlexVolume, the volume manager will only report a time out event with no error details like the following:
So user needs to check the kubelet log and see a lot of errors like:
At present, this single time out event is not good enough for user to detect what had happened:
User have to wait until volume attach timeout so that he/she can know there is an error with the volume.
Time out event does not show any details of the volume attach errors
Need to record volume processing events like “no match volume plugin” when volume manager reports the error.
/sig storage
/assign
The text was updated successfully, but these errors were encountered: