Unable to set Loadbalancer Service Health Probe Port #1505

vsabella · 2022-04-15T09:33:51Z

Reopen from #1499

When configuring Health Checks for Azure Load Balancer you can specify the path and type, but it is not possible to specify the port of the health probe.

Istio Ingress Gateway and similar services do not expose health checks on their actual endpoints (as it can operate across multiple protocols) but instead expose a general health check on a different status port for the gateway.

In AWS this is easily possible with the service.beta.kubernetes.io/aws-load-balancer-healthcheck-port annotation but it does not seem possible in Azure.

This prevents us from configuring health check probes against Istio which need to be targeting the "Status Ready" port not the actual port of the backend.

Example

Istio hosted on two ports: Port 80 and Port 443 will not serve any traffic until a Virtual Service is deployed
The health check needs to be a HTTP health check against the status port, let's say that's 15021
The probe protocol and endpoint and everything can be set, but the health check port cannot be.

For the following spec:

spec:
  ports:
    - name: http2
      protocol: TCP
      port: 80
      targetPort: 8080
    - name: https
      protocol: TCP
      port: 443
      targetPort: 8443

The generated probe needs to look like this:

The text was updated successfully, but these errors were encountered:

vsabella · 2022-04-15T10:19:45Z

@MartinForReal I've started down this path:
vsabella@5303352

If that looks like it makes sense for a fix I'll complete it also for the MixedLB mode /port_{num}-.... annotations

vsabella · 2022-04-15T11:12:48Z

It also looks like since priority is given to port.AppProtocol we are also unable to:

set appProtocol to the correct value (https)
expect the health probe to be a different protocol (say http)

The ability to decouple the health check from the actual service entry is key to getting Istio ingressgateway to have correct health checks / drain behavior / etc...

	if port.AppProtocol == nil {
		if port.AppProtocol, err = consts.GetAttributeValueInSvcAnnotation(annotations, consts.ServiceAnnotationLoadBalancerHealthProbeProtocol); err != nil {
			return nil, fmt.Errorf("failed to parse annotation %s: %w", consts.ServiceAnnotationLoadBalancerHealthProbeProtocol, err)
		}
		if port.AppProtocol == nil {
			port.AppProtocol = to.StringPtr(string(network.ProtocolTCP))
		}
	}

MartinForReal · 2022-04-15T14:43:47Z

Well, this is a pleasant surprise. :-)

It is reasonable for a fix. If we add two additional annotations, will it work in the current scenario?

port_{num}-probe-protocol
port_{num}-probe-port

Then the priority goes to two new annotations.

@feiskyer @nilo19 @lzhecheng Any advice is appreciated!

@vsabella Thanks for the contribution!

vsabella · 2022-04-17T21:35:51Z

Well, this is a pleasant surprise. :-)

It is reasonable for a fix. If we add two additional annotations, will it work in the current scenario?
port_{num}-probe-protocol
port_{num}-probe-port
Then the priority goes to two new annotations.

@feiskyer @nilo19 @lzhecheng Any advice is appreciated!

@vsabella Thanks for the contribution!

Thats exactly what I was thinking and reasonable. I'll have a PR for you later this week.

MartinForReal · 2022-06-20T02:17:36Z

Hi @vsabella Do you still have bandwidth to work on this?

k8s-triage-robot · 2022-09-18T02:57:20Z

The Kubernetes project currently lacks enough contributors to adequately respond to all issues and PRs.

This bot triages issues and PRs according to the following rules:

After 90d of inactivity, lifecycle/stale is applied
After 30d of inactivity since lifecycle/stale was applied, lifecycle/rotten is applied
After 30d of inactivity since lifecycle/rotten was applied, the issue is closed

You can:

Mark this issue or PR as fresh with /remove-lifecycle stale
Mark this issue or PR as rotten with /lifecycle rotten
Close this issue or PR with /close
Offer to help out with Issue Triage

Please send feedback to sig-contributor-experience at kubernetes/community.

/lifecycle stale

rainest · 2022-09-30T16:50:56Z

/remove-lifecycle stale

Checking on this as well, as we recently encountered this when our users started migrating to 1.24. Is the fork fix still under active development, or is there any ongoing work from Microsoft to implement these annotations?

FWIW, in our case the affected HTTP services:

are not likely to serve 200s for GET /, as they're proxy instances that often only route requests that include some configured host header.
require a TLS client certificate before accepting any requests.

vsabella added the kind/bug Categorizes issue or PR as related to a bug. label Apr 15, 2022

vsabella mentioned this issue Apr 15, 2022

Loadbalancer Health Probes are not configurable for services which use a different health check port #1499

Closed

MartinForReal self-assigned this Apr 15, 2022

MartinForReal assigned vsabella and unassigned MartinForReal Apr 15, 2022

vsabella mentioned this issue Apr 18, 2022

Add support for specifying probe protocol / probe port via annotation per service port #1508

Closed

k8s-ci-robot added the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 18, 2022

k8s-ci-robot removed the lifecycle/stale Denotes an issue or PR has remained open with no activity and has become stale. label Sep 30, 2022

rainest mentioned this issue Oct 6, 2022

Add support for specifying probe protocol / probe port via annotation per service port #2452

Merged

k8s-ci-robot closed this as completed in #2452 Nov 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unable to set Loadbalancer Service Health Probe Port #1505

Unable to set Loadbalancer Service Health Probe Port #1505

vsabella commented Apr 15, 2022 •

edited

vsabella commented Apr 15, 2022

vsabella commented Apr 15, 2022

MartinForReal commented Apr 15, 2022

vsabella commented Apr 17, 2022

MartinForReal commented Jun 20, 2022

k8s-triage-robot commented Sep 18, 2022

rainest commented Sep 30, 2022

Unable to set Loadbalancer Service Health Probe Port #1505

Unable to set Loadbalancer Service Health Probe Port #1505

Comments

vsabella commented Apr 15, 2022 • edited

Example

vsabella commented Apr 15, 2022

vsabella commented Apr 15, 2022

MartinForReal commented Apr 15, 2022

vsabella commented Apr 17, 2022

MartinForReal commented Jun 20, 2022

k8s-triage-robot commented Sep 18, 2022

rainest commented Sep 30, 2022

vsabella commented Apr 15, 2022 •

edited