Need to support rolling update in the ingress controller. #75

emilverwoerd · 2018-11-26T14:38:35Z

Describe the bug
When performing an rolling update of any kind of service you want the site or service to stay online. But during an update a 502 is given with the exception bad gateway. The problem occurs due the fact that Application Gateway is using the internal IP-addres of the nodes in the backend-pool instead of the Cluster IP of the specified service.

So what happens is that kubernetes is spinning up different nodes with new ip-addresses depending on the replica count and that the original ip-addresses are removed which are used by the Application Gateway. A couple of minutes later the backend-pool is updated with the new IP-addreses of the nodes. But we want the ClusterIP address to be used in the backend-pool so Kubernetes can perform correct load-balancing

To Reproduce
Redeploy a service and check if it is online

asridharan · 2018-11-26T15:22:29Z

@emilverwoerd are you using the latest ingress controller helm chart? There was a bug in AKS because of which ingress controller would stop receiving pod update events to the order of minutes resulting in delay in updating the backend pools in the application gateway.

To comment on your solution, using the cluster IP instead of the pod IP for the backend IP address is a very bad idea. The cluster IP is a VIP (virtual IP address) that is used for layer 4 (TCP) load balancing. Adding the cluster IP as the backend to the application gateway instead of the actual pod IP would result in breaking session affinity if enabled in the application gateway. The correct solution is to observe the deployments associated with a service and not just the endpoints and update the backend pool sets.

Meanwhile if you haven't updated to the latest helm chart could you kindly update to the latest and retry the rolling update to the observe the behavior?

asridharan · 2018-11-26T15:23:13Z

This is a classic case of supporting blue-green deployments. We should try and support this if it is not already working.

emilverwoerd · 2018-11-26T15:41:52Z

@emilverwoerd are you using the latest ingress controller helm chart? There was a bug in AKS because of which ingress controller would stop receiving pod update events to the order of minutes resulting in delay in updating the backend pools in the application gateway.

To comment on your solution, using the cluster IP instead of the pod IP for the backend IP address is a very bad idea. The cluster IP is a VIP (virtual IP address) that is used for layer 4 (TCP) load balancing. Adding the cluster IP as the backend to the application gateway instead of the actual pod IP would result in breaking session affinity if enabled in the application gateway. The correct solution is to observe the deployments associated with a service and not just the endpoints and update the backend pool sets.

Meanwhile if you haven't updated to the latest helm chart could you kindly update to the latest and retry the rolling update to the observe the behavior?

We are currently using version 0.1.4 with latest helm chart so that should be the latest. What we are experiencing is that the site is temporarily unavailable due the fact that the pods are terminated and the new pods are running but it takes some time for Application Gateway to update the backend-pools so what happens is that for a brief amount of time Application Gateway is configured with IP-addresses from the old containers that are not running anymore so when kubernetes is done upgrading the containers applications gateway is not ready yet and this can take a couple of moments.

So the old containers should be terminated when Azure Application Gateway is done updating the backend-pools.

asridharan · 2018-11-26T23:01:45Z

@emilverwoerd just checked the commits and the fix for the AKS event subscription was already present in the ingress controller helm chart 0.1.4, so you should have already had that fix:
#66

So I think the issue will still persist for you even after upgrading to 0.1.5. Could you kindly share your deployment spec here (the relevant parts such as "rolling update strategy", readiness probes and any preStop life cycle hooks you might have added)? Please read below for possible solutions that others have tried.

I dug around a bit as to how other ingress controllers were dealing with Zero downtime. Found these two blogs :
https://blog.sebastian-daschner.com/entries/zero-downtime-updates-kubernetes
and
https://techblog.topdesk.com/continuous-integration/rolling-updates-kubernetes/

The problem you are facing seems to be a common issue with other ingress controllers as well (nginx controllers are the ones cited in the articles above). If you follow the articles, to achieve zero downtime upgrades "with ingress" there are two components that are required in your deployment spec. The first is the readinessProbe and the second is the preStop check in the container lifeCycle spec. The readinessProbe will allow Kubernetes to know when your pods are ready (assuming your pods handle SIGTERM well), and the preStop will give a grace period for the ingress controller to update the application gateway once the Endpoints object is updated.

NOTE: The ingress controller on the application gateway takes at least 40 seconds to take affect so setting a sleep of 40 seconds in the preStop stage should help reduce your outages during upgrades.

emilverwoerd · 2018-11-27T09:00:38Z

Okay I will update my update spec here but I don't understand how you should specify a rolling update if AG isn't completed before the pods are recreated. That was also the reason I tought it would be better to use the Cluster IP of the service since it isn't chaning and the ip of the pods are. But I also understand that AG isn't part of the kubernetes platform so that it is not possible to do so. But than to perform a correct update you should wait unitl with container termination uniitl AG is ready. I will try the pre stop as workaround and will let you know if it works.

service.txt

asridharan · 2018-11-27T13:30:24Z

@emilverwoerd the AG would be updated only after the new pods are created, since Kubernetes will update the Endpionts object for a service only after the new pods have been created. The problem here is not that we are not updating the AG with the new pods, the problem here is that there seems to be a delay between the point when we update the backendpool in the AG with the new pods and the time when the update actually takes affect (~40 sec). During this time if the old pods are taken down by Kubernetes and the backendpool is not updated it will lead to an intermittent outage as you are seeing. This is a problem with any ingress controller nothing specific to AG.

The preStop hook tells Kuberenets to execute a function before sending the SIGTERM to the old pod. Adding a sleep in this preStop will tell Kubernetes to wait for x seconds before sending the SIGTERM allowing the update the in application gateway to go through before the old pods are taken down. This hopefully will make the upgrades smoother.

Hope that explains the proposed solution?

vramakrishnan · 2018-11-28T18:17:37Z

@emilverwoerd Can you please share your findings with this config in deployment Spec.
We will work on reducing the 40sec update time subsequently.

lifecycle:
  preStop:
    exec:
      command: ["sleep", "40"]

Also can you please share your config for these settings in deployment spec.

rollingUpdate:
    maxSurge: 1
    maxUnavailable: 1

This article also has good insights.
https://freecontent.manning.com/handling-client-requests-properly-with-kubernetes/

emilverwoerd · 2018-11-29T09:11:37Z

@emilverwoerd Can you please share your findings with this config in deployment Spec.
We will work on reducing the 40sec update time subsequently.
lifecycle:
  preStop:
    exec:
      command: ["sleep", "40"]
Also can you please share your config for these settings in deployment spec.
rollingUpdate:
    maxSurge: 1
    maxUnavailable: 1
This article also has good insights.
https://freecontent.manning.com/handling-client-requests-properly-with-kubernetes/

I tried to add the command sleep but when performing that it gives issues with the readinessprobe and it terminates the pod. Also the time it takes to update the backend-pool really takes some time so it has no different effect than without the sleep command.

We use the following spec for our rollingUpdate
rollingUpdate:
maxSurge: 1
maxUnavailable: 0

asridharan · 2018-11-30T12:03:34Z

@emilverwoerd could you provide your subscription ID? want to make sure you are using Application Gateway v2 and not v1. Did you create the AKS cluster and application gateway through the templates?

We will try reproducing this problem at our end as well, but without the readiness probe kubernetes wouldn't know when to update the endpoints object with the new pods, so the whole rolling update process would be flaky. So I would think we need to get the readiness probes working for rolling updates.

emilverwoerd · 2018-12-03T08:32:57Z

@asridharan we are on the following subscription '282d71e4-f66b-4e8f-8e49-4faea8667362' but we are using Gateway v2. And we created the cluster through our ARM templates so I could send you those templates if you want.

thx in advance for checking it out

PandaXass · 2019-01-09T13:26:32Z

@emilverwoerd @asridharan I think the terminationGracePeriodSeconds needs to be specified in the deployment yaml file to make sleeping 40s work, since by default, K8s will send TERM signal anyway after 30s.

kernelv5 · 2019-01-15T04:18:57Z

Our environment tier with Azure by VPN. My nginx-ingress external IP is similar to my local network. When I hit nginx-ingress I can easily access my application but through Application Gateway getting 502. I test Application Gateway to nginx-ingress ip by NetworkTroubleshoot from Azure, that's also working fine. Just curious to know In order to route traffic from Azure Application Gateway to ingress, application-gateway-kubernetes-ingress is mandatory or I can go with nginx-ingress as well

asridharan · 2019-04-22T20:23:12Z

@kernelv5 sorry for the late response, but one thing you want to check is that AG subnet is able to route to subnets that your pods are connected to? If AG is not able to connect to your pods than you might be getting a 502 error.

Baklap4 · 2019-11-15T15:15:37Z

I think this is currently still an issue. I 'fixed' it somehow to use the preStop hook, in combination with the terminationGracePeriodSeconds

The sleep i have around 45 seconds, and the termination grace period around 90 seconds which seems to work for our case.

Would be nice if this gets implemented...

hugoderene · 2019-11-25T10:56:06Z

We are dealing with the same issues. The workaround suggested by @Baklap4 functions, but is far from ideal.

The ingress controller pod (image: mcr.microsoft.com/azure-application-gateway/kubernetes-ingress:tag) is initiating a reconfiguration of the appgw as soon as any of the ‘connected’ resources changes (= expected behaviour). This reconfiguration process is first fully completed, before a new reconfiguration process is initiated.

During a rolling redeployment several changes (stopping pods, creating pods, deleting pods) are happening relatively shortly after each other. A redeployment therefore causes a discrepancy between the configuration of the appgw and the actual situation in the cluster. This discrepancy is resulting in 502s and/or 503s which is not very ‘rolling’.

An example to illustrate:

The first step of a redeployment procedure is to stop one or multiple existing pod(s).
Immediately after this first step, a reconfiguration process is initiated by the ingress controller pod. This process takes about 45-60 seconds.
During these 45-60 seconds other existing pods might be stopped and new pods might be created. The addresses of the pods that have been stopped during the ‘second wave’ are not accessible anymore. However they are still being served via the application gateway.
Only when the first reconfiguration process has been completed and a second reconfiguration process is initiated the unavailable addresses will be updated in the appgw.

Is there a possibility to somehow supersede the reconfiguration process initiated by the ingress controller pod as soon as there is another change happening within the connected resources in the cluster?

akshaysngupta · 2020-02-23T20:17:47Z

Closing this issue.
Please follow this document to reduce the 502s during rolling updates.

Baklap4 · 2020-02-24T07:32:25Z

@akshaysngupta Where are the tracking issues for the longterm-support solution? Making the backendpool pick up changes faster?

asridharan added the feature New feature or request label Nov 26, 2018

asridharan assigned vramakrishnan Nov 26, 2018

asridharan assigned asridharan and unassigned vramakrishnan Nov 26, 2018

emilverwoerd closed this as completed Nov 27, 2018

emilverwoerd reopened this Nov 27, 2018

asridharan changed the title ~~Backend pool uses IP-address of node instead of Cluster IP address~~ Need to support rolling update in the ingress controller. Apr 22, 2019

draychev assigned draychev and akshaysngupta and unassigned draychev and asridharan Jul 12, 2019

akshaysngupta closed this as completed Feb 23, 2020

ohadschn mentioned this issue Jul 26, 2022

[Feature Request] Support simplified routing mode that uses the k8s service ClusterIP (rather than pod IP) #1427

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Need to support rolling update in the ingress controller. #75

Need to support rolling update in the ingress controller. #75

emilverwoerd commented Nov 26, 2018

asridharan commented Nov 26, 2018

asridharan commented Nov 26, 2018

emilverwoerd commented Nov 26, 2018 •

edited

Loading

asridharan commented Nov 26, 2018

emilverwoerd commented Nov 27, 2018

asridharan commented Nov 27, 2018

vramakrishnan commented Nov 28, 2018 •

edited

Loading

emilverwoerd commented Nov 29, 2018

asridharan commented Nov 30, 2018 •

edited

Loading

emilverwoerd commented Dec 3, 2018

PandaXass commented Jan 9, 2019

kernelv5 commented Jan 15, 2019

asridharan commented Apr 22, 2019

Baklap4 commented Nov 15, 2019

hugoderene commented Nov 25, 2019 •

edited

Loading

akshaysngupta commented Feb 23, 2020

Baklap4 commented Feb 24, 2020

Need to support rolling update in the ingress controller. #75

Need to support rolling update in the ingress controller. #75

Comments

emilverwoerd commented Nov 26, 2018

asridharan commented Nov 26, 2018

asridharan commented Nov 26, 2018

emilverwoerd commented Nov 26, 2018 • edited Loading

asridharan commented Nov 26, 2018

emilverwoerd commented Nov 27, 2018

asridharan commented Nov 27, 2018

vramakrishnan commented Nov 28, 2018 • edited Loading

emilverwoerd commented Nov 29, 2018

asridharan commented Nov 30, 2018 • edited Loading

emilverwoerd commented Dec 3, 2018

PandaXass commented Jan 9, 2019

kernelv5 commented Jan 15, 2019

asridharan commented Apr 22, 2019

Baklap4 commented Nov 15, 2019

hugoderene commented Nov 25, 2019 • edited Loading

akshaysngupta commented Feb 23, 2020

Baklap4 commented Feb 24, 2020

emilverwoerd commented Nov 26, 2018 •

edited

Loading

vramakrishnan commented Nov 28, 2018 •

edited

Loading

asridharan commented Nov 30, 2018 •

edited

Loading

hugoderene commented Nov 25, 2019 •

edited

Loading