Skip to content

Commit

Permalink
Bug 1982766: [on-prem] Make ingress VIP more tolerant to failures
Browse files Browse the repository at this point in the history
We've noticed that the ingress VIP could be flapping in some
environments where the keepalived check fails due to timeout. We should
make the ingress check more resilient to errors and start using `fall`
and `raise` to mitigate flapping as we did for the API checks.
  • Loading branch information
mandre committed Jul 19, 2021
1 parent 182cb2e commit 881f3b0
Show file tree
Hide file tree
Showing 2 changed files with 8 additions and 0 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -41,12 +41,16 @@ contents:
script "/usr/bin/timeout 0.9 /usr/bin/curl -o /dev/null -Lfs http://localhost:1936/healthz/ready"
interval 1
weight 20
rise 3
fall 2
}
vrrp_script chk_default_ingress {
script "/usr/bin/timeout 4.9 /host/bin/oc --kubeconfig /var/lib/kubelet/kubeconfig get ep -n openshift-ingress router-internal-default -o yaml  | grep 'ip:' | grep {{`{{.NonVirtualIP}}`}} "
interval 5
weight 50
rise 3
fall 2
}
{{`{{$nonVirtualIP := .NonVirtualIP}}`}}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -14,12 +14,16 @@ contents:
script "/usr/bin/timeout 0.9 /usr/bin/curl -o /dev/null -Lfs http://localhost:1936/healthz/ready"
interval 1
weight 20
rise 3
fall 2
}
vrrp_script chk_default_ingress {
script "/usr/bin/timeout 4.9 /host/bin/oc --kubeconfig /var/lib/kubelet/kubeconfig get ep -n openshift-ingress router-internal-default -o yaml  | grep 'ip:' | grep {{`{{.NonVirtualIP}}`}} "
interval 5
weight 50
rise 3
fall 2
}
{{`{{$nonVirtualIP := .NonVirtualIP}}`}}
Expand Down

0 comments on commit 881f3b0

Please sign in to comment.