OCPBUGS-24363: Full implementation of KEP-1669 ProxyTerminatingEndpoints #4072

ricky-rav · 2024-01-04T16:59:22Z

The previous implementation was an approximation of KEP-1669 ProxyTerminatingEndpoints: we simply included terminating serving endpoints (ready=false, serving=true, terminating=true) along with ready ones in the endpoint selection logic. Let's fully implement KEP-1669 and only include terminating endpoints if none are ready. The selection follows two simple steps:

Take all ready endpoints
If no ready endpoints were found, take all serving terminating endpoints.

This is done for all traffic policies (kubernetes/kubernetes#108691).

This should also help with an issue found in a production cluster (https://issues.redhat.com/browse/OCPBUGS-24363) where, due to infrequent readiness probes, terminating endpoints were declared as non-serving (that is, their readiness probe failed) only quite late and were included as valid endpoints for quite a bit, while the existing ready endpoints should have been preferred.

Extended the test cases to include testing against multiple slices and dual stack scenarios.

Successfully tested downstream on 4.16 (openshift/ovn-kubernetes#2008) and 4.14 (openshift/ovn-kubernetes#2009, which is relevant because of the OVNK architecture change)

Fixes #OCPBUGS-24363

coveralls · 2024-01-05T11:26:00Z

coverage: 50.721% (+0.04%) from 50.678%
when pulling 35b1ae8 on ricky-rav:OCPBUGS-24363
into bcffcc7 on ovn-org:master.

JacobTanenbaum · 2024-01-05T16:01:34Z

go-controller/pkg/util/kube.go

+// GetEndpointAddressesWithPrecondition first applies the (optionally) provided function fn
+// to filter endpoints from the given list of endpoint slices; from the resulting list,
+// it returns the IP addresses of eligible endpoints.
+func GetEndpointAddressesWithPreCondition(endpointSlices []*discovery.EndpointSlice, service *kapi.Service, fn func(discovery.Endpoint) bool) sets.Set[string] {


May be pedantic but why change from GetEndpointAddressesWithCondition()?

No, that's that good question :)
In the previous implementation, the selection of endpoints was easier and had no fallback case: I would just take all ready endpoints + all serving terminating endpoints. GetEndpointAddressesWithCondition first selected endpoints that way and after that it would apply the provided filter function (in our case, to keep only local endpoints).
With the new selection logic in this PR, the order between the two steps becomes important: I need to first apply the filter function (e.g keep all local endpoints) and only then apply the new selection logic (take all ready endpoints, but if none are ready take serving terminating endpoints). The ordering here changes the result.

Imagine you have three nodes and you need to get the "eligible" endpoints local to node1:

node1: endpoint1_serving&terminating, endpoint2_serving&terminating

node2: endpoint3_serving&terminating, endpoint4_ready

node3: endpoint5_ready, endpoint6_ready

If I first apply the new selection logic and only then keep local endpoints, I would end up with no endpoint:

[endpoint selection] all ready endpoints in the cluster: endpoint4, endpoint5, endpoint6

[filter function] none, since the ready endpoints are not on node1

If I reverse the steps I get the result I want:

[filter function] endpoint1, endpoint2 --> they are local to node1

[endpoint selection] endpoint1, endpoint2 --> they are serving & terminating

So I replaced Condition with PreCondition in the function name to highlight that it gets applied first.

I've just added a test case to cover the scenario I described above.
https://github.com/ovn-org/ovn-kubernetes/pull/4072/files#diff-4c82fea05fd74e2aaa346d3d3caaa27442ed190cf4057e8e1a810982eafdd286R1807

go-controller/pkg/util/kube.go

ricky-rav · 2024-01-12T09:01:49Z

/retest

ovn-robot · 2024-01-12T09:02:06Z

Oops, something went wrong:

Must have admin rights to Repository.

ricky-rav · 2024-01-12T09:03:25Z

/retest-failed

ovn-robot · 2024-01-12T09:03:41Z

Oops, something went wrong:

Must have admin rights to Repository.

go-controller/pkg/util/kube.go

jcaamano · 2024-01-15T14:05:49Z

go-controller/pkg/util/kube.go

+		serviceStr = fmt.Sprintf(" for service %s/%s", service.Namespace, service.Name)
+	}
+	// separate IPv4 from IPv6 addresses for eligible endpoints
+	for _, endpoint := range getEligibleEndpoints(validSlices, service) {


So the logic above this line, could it be a condFn for ~~getEligibleEndpoints~~ getSelectedEligibleEndpoints?

Here we're separating IPv4 from IPv6 endpoints addresses. Do I need a condFn for that?
Here actually I should use the existing getEligibleEndpointAddresses, but it's at the cost of adding one more iteration through the endpoints. I'm a little hesitant to do that because it gets executed by ovnkube-controller, which configures /all/ the endpoints for a given service: https://github.com/ovn-org/ovn-kubernetes/blob/master/go-controller/pkg/ovn/controller/services/services_controller.go#L392-L402

I mean the logic above this line, not the logic below. So the one where you are matching the port name & protocol of the service.

oh ok, I see. condFn as I defined it is applied to single endpoints, because we need to check their node name, while in GetLbEndpoints we're filtering out entire slices. I don't know if rewriting it once again for this particular usage would improve the overall readability :)

go-controller/pkg/util/kube.go

The previous implementation was an approximation of KEP-1669 ProxyTerminatingEndpoints: we simply included terminating serving endpoints (ready=false, serving=true, terminating=true) along with ready ones in the endpoint selection logic. Let's fully implement KEP-1669 and only include terminating endpoints if none are ready. The selection follows two simple steps: 1) Take all ready endpoints 2) If no ready endpoints were found, take all serving terminating endpoints. This should also help with an issue found in a production cluster (https://issues.redhat.com/browse/OCPBUGS-24363) where, due to infrequent readiness probes, terminating endpoints were declared as non-serving (that is, their readiness probe failed) only quite late and were included as valid endpoints for quite a bit, while the existing ready endpoints should have been preferred. Extended the test cases to include testing against multiple slices and dual stack scenarios. Signed-off-by: Riccardo Ravaioli <rravaiol@redhat.com>

openshift-merge-robot · 2024-01-31T12:50:04Z

Fix included in accepted release 4.16.0-0.nightly-2024-01-31-073538

ricky-rav requested review from trozet, dcbw, girishmg, jcaamano and tssurya as code owners January 4, 2024 16:59

ricky-rav force-pushed the OCPBUGS-24363 branch 5 times, most recently from 5e6d883 to 2f1cd18 Compare January 5, 2024 11:01

JacobTanenbaum reviewed Jan 5, 2024

View reviewed changes

ricky-rav force-pushed the OCPBUGS-24363 branch from 2f1cd18 to 19d69c0 Compare January 8, 2024 09:52

jcaamano reviewed Jan 10, 2024

View reviewed changes

go-controller/pkg/util/kube.go Outdated Show resolved Hide resolved

ricky-rav force-pushed the OCPBUGS-24363 branch 2 times, most recently from b4fe523 to 1c52192 Compare January 11, 2024 18:39

jcaamano reviewed Jan 12, 2024

View reviewed changes

go-controller/pkg/util/kube.go Outdated Show resolved Hide resolved

jcaamano reviewed Jan 12, 2024

View reviewed changes

go-controller/pkg/util/kube.go Outdated Show resolved Hide resolved

ricky-rav force-pushed the OCPBUGS-24363 branch 6 times, most recently from aa295c2 to e3a6002 Compare January 15, 2024 11:34

jcaamano reviewed Jan 15, 2024

View reviewed changes

ricky-rav force-pushed the OCPBUGS-24363 branch from e3a6002 to 35b1ae8 Compare January 15, 2024 18:12

jcaamano approved these changes Jan 16, 2024

View reviewed changes

jcaamano merged commit 418043c into ovn-org:master Jan 16, 2024
30 checks passed

ricky-rav mentioned this pull request Feb 20, 2024

Fix endpoint selection for externalTrafficPolicy=local #4170

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCPBUGS-24363: Full implementation of KEP-1669 ProxyTerminatingEndpoints #4072

OCPBUGS-24363: Full implementation of KEP-1669 ProxyTerminatingEndpoints #4072

ricky-rav commented Jan 4, 2024 •

edited

coveralls commented Jan 5, 2024 •

edited

JacobTanenbaum Jan 5, 2024

ricky-rav Jan 5, 2024

ricky-rav Jan 12, 2024

ricky-rav commented Jan 12, 2024

ovn-robot commented Jan 12, 2024

ricky-rav commented Jan 12, 2024

ovn-robot commented Jan 12, 2024

jcaamano Jan 15, 2024 •

edited

ricky-rav Jan 15, 2024

jcaamano Jan 16, 2024

ricky-rav Jan 16, 2024

openshift-merge-robot commented Jan 31, 2024

OCPBUGS-24363: Full implementation of KEP-1669 ProxyTerminatingEndpoints #4072

OCPBUGS-24363: Full implementation of KEP-1669 ProxyTerminatingEndpoints #4072

Conversation

ricky-rav commented Jan 4, 2024 • edited

coveralls commented Jan 5, 2024 • edited

JacobTanenbaum Jan 5, 2024

Choose a reason for hiding this comment

ricky-rav Jan 5, 2024

Choose a reason for hiding this comment

ricky-rav Jan 12, 2024

Choose a reason for hiding this comment

ricky-rav commented Jan 12, 2024

ovn-robot commented Jan 12, 2024

ricky-rav commented Jan 12, 2024

ovn-robot commented Jan 12, 2024

jcaamano Jan 15, 2024 • edited

Choose a reason for hiding this comment

ricky-rav Jan 15, 2024

Choose a reason for hiding this comment

jcaamano Jan 16, 2024

Choose a reason for hiding this comment

ricky-rav Jan 16, 2024

Choose a reason for hiding this comment

openshift-merge-robot commented Jan 31, 2024

ricky-rav commented Jan 4, 2024 •

edited

coveralls commented Jan 5, 2024 •

edited

jcaamano Jan 15, 2024 •

edited