TLS handshake error from: EOF #2142

ritazh · 2022-07-01T23:12:43Z

What steps did you take and what happened:
[A clear and concise description of what the bug is.]

Getting the following intermittent errors in the gatekeeper-system logs:

http: TLS handshake error from 172.16.0.3:42672: EOF

kube-apiserver logs during the same time range do not have equivalent errors.
Everything is functioning. No impact on functionality.

NOTE:
There isn't any actual functional issues related to these error messages and the policies are working as expected. Lots of other webhook projects have reported the same issue, the error is coming from the kube-apiserver when it drops the connection prematurely and retries afterwards.

Please provide feedback in the following issues:

The EOF errors seems be related to a Go bug golang/go#50984 and appear on Kubernetes 1.23 and 1.24 and later. see kubernetes/kubernetes#109022

What did you expect to happen:
No TLS error in pod logs

Anything else you would like to add:
[Miscellaneous information that will assist in solving the issue.]

Environment:

Gatekeeper version: v3.8.1 and v3.7.1
Kubernetes version: (use kubectl version): 1.23.5

The text was updated successfully, but these errors were encountered:

ritazh · 2022-07-01T23:13:47Z

The EOF errors seems be related to a Go bug golang/go#50984 and appear on Kubernetes 1.23 and 1.24 see kubernetes/kubernetes#109022

From the issue description, it does not seem like there are any actual functional issues related to these error messages (as the policies are working as expected). At the moment, there is nothing we can do to fix this, as the error is coming from Kubernetes core. We can continue to monitor this after the linked issue has been fixed and released as part of a future Kubernetes patch release.

ritazh · 2022-07-01T23:19:19Z

xref: #866 (comment)

stale · 2022-08-30T23:28:29Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

punnarpulusu · 2022-09-08T19:40:25Z

This is not just related to on Kubernetes 1.23 and 1.24 this is happening on all kuberenetes ( AWS EKS ) version 1.21

ritazh · 2022-09-09T00:17:09Z

@punnarpulusu Can you share the exact error in the log and kubernetes and gatekeeper version?

punnarpulusu · 2022-09-09T20:01:31Z

@ritazh Here is the error log ... redacted some information for security purpose.

gatekeeper version is 3.8.1

  k logs -n gatekeeper deploy/gatekeeper-controller-manager -f
  Found 3 pods, using pod/gatekeeper-controller-manager-xxxxxxx-ldsc7
2022/09/08 01:14:32 http: TLS handshake error from x.x.x.x:49070: EOF
2022/09/08 01:46:37 http: TLS handshake error from x.x.x.x:35184: EOF
2022/09/08 02:47:46 http: TLS handshake error from x.x.x.x:39938: EOF
2022/09/08 06:47:20 http: TLS handshake error from x.x.x.x:38652: EOF
2022/09/08 12:37:59 http: TLS handshake error from x.x.x.x:49956: EOF
2022/09/08 13:16:45 http: TLS handshake error from x.x.x.x:56032: EOF
2022/09/08 13:41:48 http: TLS handshake error from x.x.x.x:56232: EOF
2022/09/08 16:38:13 http: TLS handshake error from x.x.x.x:60828: EOF
2022/09/08 19:02:34 http: TLS handshake error from x.x.x.x:36744: EOF

sorry about the delayed response.

punnarpulusu · 2022-09-28T16:48:17Z

@ritazh I am getting the same error on gatekeeper 3.9.0 as well

image: artifactory.dev.earnin.net/docker-remote/openpolicyagent/gatekeeper:v3.9.0

Here is the log

k logs -n gatekeeper deploy/gatekeeper-controller-manager -f
Found 3 pods, using pod/gatekeeper-controller-manager-69b88d77ff-v6fn8
2022/09/28 16:39:44 maxprocs: Updating GOMAXPROCS=5: determined from CPU quota
2022/09/28 16:40:01 http: TLS handshake error from x.x.x.x:48490: EOF

any idea on whats causing this issue and how I can get it fixed.

meons · 2022-10-10T08:27:10Z

Same here on GKE 1.22 + Gatekeeper 3.9.0:

kubectl -n gatekeeper logs deployment/gatekeeper-controller-manager | grep error
Found 3 pods, using pod/gatekeeper-controller-manager-888b9f574-h4vjz
2022/10/09 10:27:38 http: TLS handshake error from x.x.x.x:50782: EOF
2022/10/09 10:36:56 http: TLS handshake error from x.x.x.x:52720: EOF
2022/10/09 10:56:14 http: TLS handshake error from x.x.x.x:55364: EOF
2022/10/09 11:05:39 http: TLS handshake error from x.x.x.x:58102: EOF
2022/10/09 11:34:01 http: TLS handshake error from x.x.x.x:49868: EOF
2022/10/09 13:30:59 http: TLS handshake error from x.x.x.x:46064: EOF
2022/10/09 14:45:18 http: TLS handshake error from x.x.x.x:55056: EOF
2022/10/09 15:06:19 http: TLS handshake error from x.x.x.x:54452: EOF
2022/10/09 16:07:31 http: TLS handshake error from x.x.x.x:54824: EOF
2022/10/09 16:16:03 http: TLS handshake error from x.x.x.x:37644: EOF
2022/10/09 16:43:38 http: TLS handshake error from x.x.x.x:46590: EOF
2022/10/09 21:27:03 http: TLS handshake error from x.x.x.x:54706: EOF
2022/10/09 21:40:59 http: TLS handshake error from x.x.x.x:47458: EOF
2022/10/09 22:47:10 http: TLS handshake error from x.x.x.x:50688: EOF
2022/10/10 00:18:11 http: TLS handshake error from x.x.x.x:44814: EOF
2022/10/10 01:13:12 http: TLS handshake error from x.x.x.x:42378: EOF
2022/10/10 02:45:59 http: TLS handshake error from x.x.x.x:47150: EOF
2022/10/10 02:56:03 http: TLS handshake error from x.x.x.x:32800: EOF
2022/10/10 03:33:21 http: TLS handshake error from x.x.x.x:52332: EOF
2022/10/10 03:58:46 http: TLS handshake error from x.x.x.x:45042: EOF
2022/10/10 04:59:54 http: TLS handshake error from x.x.x.x:48482: EOF
2022/10/10 05:49:48 http: TLS handshake error from x.x.x.x:42376: EOF
2022/10/10 05:57:33 http: TLS handshake error from x.x.x.x:41712: EOF
2022/10/10 07:11:39 http: TLS handshake error from x.x.x.x:60302: EOF

Actually x.x.x.x are GKE control planes IPs.

ZiaUrRehman-GBI · 2022-10-31T12:10:38Z

Kubernetes version : 1.24.3-gke.2100

textPayload: "2022/10/31 11:20:06 http: TLS handshake error from x.x.x.x:41398: EOF"

kfox1111 · 2022-11-03T22:06:04Z

seen on k8s 1.21.11
2022/11/03 19:17:10 http: TLS handshake error from 10.17.0.0:52110: EOF

not sure its affecting anything.

tspearconquest · 2022-11-13T23:57:18Z

Hello, I've noticed these before but not had time to do some proper investigation until now.

I found that these messages are coming from an IP belonging to the konnectivity pods in my kube-system namespace in Azure.

This pod is facilitating the control plane to cluster communications as per https://kubernetes.io/docs/tasks/extend-kubernetes/setup-konnectivity/

Digging into the kube-system namespace labels, I see that there is control-plane: true on that namespace.

I believe what's going on which is causing this, is that konnectivity-agent is looking for all namespaces where the label control-plane exists (regardless of the value) and trying to make a connection to the gatekeeper pods.

I found #1061 which covers the removal of the control-plane label, however it has only been partially implemented by removing the check from the validating webhook configuration in Gatekeeper (#758)

Is it safe to remove the control-plane: controller-manager label from the gatekeeper-system namespace currently, if we have already applied the admission.gatekeeper.sh/ignore: no-self-managing label?

In case it is safe, then we should push to have the control-plane label removed from the namespace as soon as possible, as this is really causing problems for teams with log monitoring agents like fluentd.

ritazh · 2022-11-14T06:12:40Z

NOTE

The EOF errors seems be related to a Go bug golang/go#50984 and appear on Kubernetes 1.23 and 1.24 see kubernetes/kubernetes#109022

From the issue description, it does not seem like there are any actual functional issues related to these error messages (as the policies are working as expected). At the moment, there is nothing we can do to fix this, as the error is coming from Kubernetes core. We can continue to monitor this after the linked issue has been fixed and released as part of a future Kubernetes patch release.

tspearconquest · 2022-11-14T13:04:47Z

Hi @ritazh I believe that is incorrect. These errors also come on Kubernetes 1.22 for us, and also others have noted in this issue that they happen on K8s 1.21.

This is not just related to on Kubernetes 1.23 and 1.24 this is happening on all kuberenetes ( AWS EKS ) version 1.21

comment

Furthermore, kubernetes/kubernetes#109022 clearly indicates the errors coming from 127.0.0.1.
The original post of this issue does not indicate 127.0.0.1, but rather has the IP addresses masked as x.x.x.x which leads me to believe that the OP is experiencing this from their 10.x.x.x/8 subnet, the same as myself.

ritazh · 2022-11-14T14:43:39Z

Thanks for the additional data @tspearconquest! If you remove the control-plane label from the gatekeeper-system namespace as you suggested, do you still see the error in the log?

tspearconquest · 2022-11-14T15:01:08Z

We're testing today and I will report back soon!

Murtaza-Solangi · 2022-11-23T14:14:04Z

We're testing today and I will report back soon!

Was the test successful?

tspearconquest · 2022-11-23T15:26:15Z

We're testing today and I will report back soon!

Was the test successful?

Hello, apologies as I put my update on the other issue: #1061

Hi @ritazh - It seems my suspicion was not correct, and removing the control-plane label did not help.

It's really interesting that this is only affecting Gatekeeper, as we do have other tools with MWH and VWH which do not see this problem, and the traffic causing the errors is 100% coming from the konnectivity-agent pods in kube-system

I also took a look in konnectivity configmap and deployment manifest in one of our clusters to see if I could find a log format option, but I'm afraid I couldn't find any. My main concern is that these are not coming in json format, so it causes a lot of spam for our fluentd instance to try to parse non-json log outputs as json.

stale · 2023-01-22T23:10:48Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

tspearconquest · 2023-01-22T23:14:21Z

Not stale.

wondywang · 2023-03-02T07:05:11Z

I also encountered these error logs, looking forward to someone to solve it.

maxsmythe · 2023-09-05T23:06:46Z

Gatekeeper works but this error happens periodically. May be, because of
cert rotation

I'd expect the same errors during cert rotation, though would think the cert rotation frequency is low enough (O(years)) for the error to never repeat in the standard lifecycle of a pod.

In any case, having multiple concurrent writers with one "winner" is probably the best model. This is essentially how leader election works anyway, and avoids needing to worry about any one pod becoming a SPOF or figuring out who is eligible to become a leader. There is an edge case where there is the possibility of a controller fight if there is an incompatible change. This can be mitigated by gradually introducing a change and leaning on our "upgrades are N - 1 compatible" policy.

stale · 2023-11-05T06:23:22Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

pankajmt · 2023-11-06T05:21:39Z

still an open issue

resnostyle · 2023-11-16T19:17:53Z

We're currently running version 1.25.15 of kube and running version v3.12.0 of the opa gatekeeper and still seeing this error.

cccsss01 · 2023-12-02T17:00:20Z

Seeing this error on 1.27.1 with gatekeeper v3.11.0 not sure if this is causing issues with timeouts for leaderelection or not

stale · 2024-02-01T06:32:16Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 14 days if no further activity occurs. Thank you for your contributions.

immae1 · 2024-02-01T08:30:17Z

Still a topic on AKS (1.27.7) with latest node images;

2024/02/01 08:28:47 http: TLS handshake error from 10.2.0.1:32914: EOF

OrKarstoft · 2024-02-01T11:43:41Z

Same issue with Kubernetes v1.27.9-gke.1092000.

aimbot31 · 2024-02-14T13:14:32Z

still

rjbrown57 · 2024-02-14T15:12:40Z

Seeing this is 1.26 as well

ritazh · 2024-02-23T05:23:29Z

Closing this issue as there isn't any actual functional issues related to these error messages and the policies are working as expected. Lots of other webhook projects have reported the same issue, the error is coming from the kube-apiserver when it drops the connection prematurely and retries afterwards.

cbugneac-nex · 2024-02-23T09:48:25Z

But it does unnecessarily pollute the logs with these errors in DataDog (in our case) creating unnecessary noise.

sozercan · 2024-02-23T17:38:01Z

@cbugneac-nex it would be good to provide that feedback in golang/go#50984 as this is not an issue Gatekeeper (or any Kubernetes webhooks) can address on its own.

pythonking6 · 2024-05-30T02:24:29Z

This maybe separate, but I am seeing this in the Loadbalancer controller on kubernetes 1.29, terraformed from eks_blueprints:

2024/05/29 21:55:41 http: TLS handshake error from 10.0.119.240:46106: EOF

ritazh added the bug Something isn't working label Jul 1, 2022

ritazh pinned this issue Jul 1, 2022

stale bot added the stale label Aug 30, 2022

stale bot removed the stale label Sep 8, 2022

tspearconquest mentioned this issue Nov 14, 2022

Deprecate control-plane labels #1061

Open

ralgozino mentioned this issue Nov 30, 2022

Gatekeeper Controller Manager logs http: TLS handshake error sighupio/fury-kubernetes-opa#89

Open

mochizuki875 mentioned this issue Jan 5, 2023

read: connection reset by peer kubernetes-sigs/hierarchical-namespaces#236

Closed

stale bot added the stale label Jan 22, 2023

stale bot removed the stale label Jan 22, 2023

rigl1 mentioned this issue Feb 18, 2023

Install opensearch 2.5 with operator fails in kubernetes opensearch-project/opensearch-k8s-operator#439

Closed

zodd3131 mentioned this issue Oct 28, 2023

rke2 cis profile 1.23, api server can't contact gatekeeper (TLS ERROR) rancher/rke2#4910

Closed

sozercan mentioned this issue Oct 31, 2023

TSL handshake error not valid JSON scheme logs #3136

Closed

slavogiez mentioned this issue Nov 3, 2023

Recurrent error in zonal controlplane: TLS handshake error from xxx.xxx.xxx.xxx:yyy: EOF kumahq/kuma#8247

Closed

stale bot added the stale label Nov 5, 2023

stale bot removed the stale label Nov 6, 2023

moolen mentioned this issue Jan 18, 2024

Webhook Pod outputs TLS handshake errors external-secrets/external-secrets#2983

Closed

stale bot added the stale label Feb 1, 2024

stale bot removed the stale label Feb 14, 2024

salaxander added wontfix This will not be worked on and removed bug Something isn't working labels Feb 21, 2024

ritazh added wontfix This will not be worked on and removed wontfix This will not be worked on labels Feb 23, 2024

ritazh closed this as completed Feb 23, 2024

ritazh closed this as not planned Won't fix, can't repro, duplicate, stale Feb 23, 2024

ritazh unpinned this issue Mar 19, 2024

hesamhamdarsi mentioned this issue Jun 5, 2024

Receiving ssl handshake error from kubernetes APIserver and opentelemetry webhook open-telemetry/opentelemetry-operator#2956

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TLS handshake error from: EOF #2142

TLS handshake error from: EOF #2142

ritazh commented Jul 1, 2022 •

edited by sozercan

Loading

ritazh commented Jul 1, 2022 •

edited

Loading

ritazh commented Jul 1, 2022

stale bot commented Aug 30, 2022

punnarpulusu commented Sep 8, 2022

ritazh commented Sep 9, 2022

punnarpulusu commented Sep 9, 2022 •

edited

Loading

punnarpulusu commented Sep 28, 2022

meons commented Oct 10, 2022 •

edited

Loading

ZiaUrRehman-GBI commented Oct 31, 2022

kfox1111 commented Nov 3, 2022

tspearconquest commented Nov 13, 2022

ritazh commented Nov 14, 2022

tspearconquest commented Nov 14, 2022 •

edited

Loading

ritazh commented Nov 14, 2022

tspearconquest commented Nov 14, 2022

Murtaza-Solangi commented Nov 23, 2022

tspearconquest commented Nov 23, 2022

stale bot commented Jan 22, 2023

tspearconquest commented Jan 22, 2023 via email •

edited

Loading

wondywang commented Mar 2, 2023

maxsmythe commented Sep 5, 2023

stale bot commented Nov 5, 2023

pankajmt commented Nov 6, 2023

resnostyle commented Nov 16, 2023

cccsss01 commented Dec 2, 2023

stale bot commented Feb 1, 2024

immae1 commented Feb 1, 2024

OrKarstoft commented Feb 1, 2024

aimbot31 commented Feb 14, 2024

rjbrown57 commented Feb 14, 2024

ritazh commented Feb 23, 2024

cbugneac-nex commented Feb 23, 2024

sozercan commented Feb 23, 2024 •

edited

Loading

pythonking6 commented May 30, 2024

TLS handshake error from: EOF #2142

TLS handshake error from: EOF #2142

Comments

ritazh commented Jul 1, 2022 • edited by sozercan Loading

ritazh commented Jul 1, 2022 • edited Loading

ritazh commented Jul 1, 2022

stale bot commented Aug 30, 2022

punnarpulusu commented Sep 8, 2022

ritazh commented Sep 9, 2022

punnarpulusu commented Sep 9, 2022 • edited Loading

punnarpulusu commented Sep 28, 2022

meons commented Oct 10, 2022 • edited Loading

ZiaUrRehman-GBI commented Oct 31, 2022

kfox1111 commented Nov 3, 2022

tspearconquest commented Nov 13, 2022

ritazh commented Nov 14, 2022

tspearconquest commented Nov 14, 2022 • edited Loading

ritazh commented Nov 14, 2022

tspearconquest commented Nov 14, 2022

Murtaza-Solangi commented Nov 23, 2022

tspearconquest commented Nov 23, 2022

stale bot commented Jan 22, 2023

tspearconquest commented Jan 22, 2023 via email • edited Loading

wondywang commented Mar 2, 2023

maxsmythe commented Sep 5, 2023

stale bot commented Nov 5, 2023

pankajmt commented Nov 6, 2023

resnostyle commented Nov 16, 2023

cccsss01 commented Dec 2, 2023

stale bot commented Feb 1, 2024

immae1 commented Feb 1, 2024

OrKarstoft commented Feb 1, 2024

aimbot31 commented Feb 14, 2024

rjbrown57 commented Feb 14, 2024

ritazh commented Feb 23, 2024

cbugneac-nex commented Feb 23, 2024

sozercan commented Feb 23, 2024 • edited Loading

pythonking6 commented May 30, 2024

ritazh commented Jul 1, 2022 •

edited by sozercan

Loading

ritazh commented Jul 1, 2022 •

edited

Loading

punnarpulusu commented Sep 9, 2022 •

edited

Loading

meons commented Oct 10, 2022 •

edited

Loading

tspearconquest commented Nov 14, 2022 •

edited

Loading

tspearconquest commented Jan 22, 2023 via email •

edited

Loading

sozercan commented Feb 23, 2024 •

edited

Loading