-
Notifications
You must be signed in to change notification settings - Fork 2.7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
CI: ConformanceGKE: Timeout on cilium-test deletion #22368
Comments
This should help to work around an issue we're seeing in cilium/cilium CI where deleting the cilium-test namespace times out, see cilium/cilium#22368 Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Tobias Klauser <tobias@cilium.io>
This should help to work around an issue we're seeing in cilium/cilium CI where deleting the cilium-test namespace times out, see cilium/cilium#22368 Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Tobias Klauser <tobias@cilium.io>
This should help to work around an issue we're seeing in cilium/cilium CI where deleting the cilium-test namespace times out, see cilium/cilium#22368 Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Tobias Klauser <tobias@cilium.io>
cilium/cilium-cli#1237 should fix/work around this in cilium-cli. Once this is merged and a new cilium-cli version is released, bumping |
This should help to work around an issue we're seeing in cilium/cilium CI where deleting the cilium-test namespace times out, see cilium/cilium#22368 Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Tobias Klauser <tobias@cilium.io>
Fixed by #22441 |
Seems cilium/cilium-cli#1237 wasn't enough to fix it. It's still happening. E.g.: |
Looking at a recent failure on
Looking at the
The first condition looks suspicious:
We seem to have hit this before, e.g. in cilium/cilium-cli#255 (comment). Googling for the |
This looks oddly similar to the issues you get on EKS when you try to run with tunnel mode enabled. cc @bmcustodio |
Signed-off-by: Tobias Klauser <tobias@cilium.io>
Should hopefully help to debug #22368 Signed-off-by: Tobias Klauser <tobias@cilium.io>
Signed-off-by: Tobias Klauser <tobias@cilium.io>
Another instance: https://github.com/cilium/cilium/actions/runs/4200699587/jobs/7286990604 |
@tklauser Are you still looking into this? |
Currently lacking cycles and ideas on how to proceed, so I'm not actively looking into this. I'm going to unassign myself for now. |
For whoever looks into this next, this is probably a good way to mitigate (without fixing):
We would also need to not block on the namespace deletion attempt. cc @brlbil |
This commit mitigates workflow flake on GKE with tunnel installation until the issue #22368 is fixed. For the test with tunnel test namespace is added and for uninstall --wait option is removed. Signed-off-by: Birol Bilgin <birol@cilium.io>
This commit mitigates workflow flake on GKE with tunnel installation until the issue #22368 is fixed. For the test with tunnel test namespace is added and for uninstall --wait option is removed. Signed-off-by: Birol Bilgin <birol@cilium.io>
#24755 fixed this. |
This should help to work around an issue we're seeing in cilium/cilium CI where deleting the cilium-test namespace times out, see cilium#22368 Suggested-by: Paul Chaignon <paul@cilium.io> Signed-off-by: Tobias Klauser <tobias@cilium.io>
ConformanceGKE runs are failing on master with a 1h15min timeout on:
I don't know the root cause, but this can probably be worked around by deleting the pods in the namespace instead of the namespace itself (you can notice in the sysdump that the namespace's pods are already gone) and using a different test namespace for each run of the connectivity tests.
The text was updated successfully, but these errors were encountered: