Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

With Tigera operator, applicative pod lost network after windows nodes reboot #3193

Closed
amapi opened this issue Feb 23, 2024 · 2 comments
Closed

Comments

@amapi
Copy link

amapi commented Feb 23, 2024

We have installed Calico with Tigera operator with WINDOWS nodes

  • Operator version: 1.32.5
  • Calico version: 3.27.2
  • k8s: 1.27.6

Calico is running ok.

When we reboot a windows node. Calico pods start AFTER applicative pod.s

Result:

Applicative pods sandbox do not have any network. Pod loop in restart. Take care, POD loop in restart, but NOT the sandbox. So, until you delete the pod (and so, delete the sandbox), the pod is stuck in restart loop.

When we delete the pod (and the sandbox), as calico is already started this time, pod(and sandbox) si recreated with network this time.

We do not have this issue when calico was a windows service as calico service start before applicative pod.

Does someone have any idea.

Please: do not ask for usage of Readiness et Liveness Probes. We could not ask project hosted on the cluster to modify their deployment to add Readiness et Liveness Probes depending of calico node.

@tmjd
Copy link
Member

tmjd commented Mar 12, 2024

@coutinhop could you take a look at this issue?

@coutinhop
Copy link
Member

Just to close on this, I don't think there's much that can be done on the Calico (or Tigera operator) side, as this issue happens due to the way Windows handles the containers. There's no way to guarantee Calico pods start up before application pods, and if one does start up before Calico it won't have networking until it is restarted (after Calico is set up and running)...

@coutinhop coutinhop closed this as not planned Won't fix, can't repro, duplicate, stale Jul 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants