[EKS] [request]: allow to configure metricsBindAddress for kube-proxy #657

chancez · 2019-12-17T23:34:13Z

Community Note

Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
Please do not leave "+1" or "me too" comments, they generate extra noise for issue followers and do not help prioritize the request
If you are interested in working on this issue or have submitted a pull request, please leave a comment

Tell us about your request
I want to be able to monitor kube-proxy with Prometheus, but cannot because, by default the metricsBindAddress is set to 127.0.0.1:10249 meaning it isn't accessible outside of the pod.

Which service(s) is this request for?
EKS

Tell us about the problem you're trying to solve. What are you trying to do, and why is it hard?
I want to monitor kube-proxy. I cannot do this unless I can reconfigure kube-proxy in some way.

Are you currently working around this issue?
I am not.

Additional context
You can see the kube-proxy config created by EKS in the kube-system namespace via k get cm -n kube-system kube-proxy-config -o yaml.

The text was updated successfully, but these errors were encountered:

TBeijen · 2019-12-20T06:18:50Z

Additional context illustrating the problem: https://github.com/helm/charts/tree/master/stable/prometheus-operator#kubeproxy

dawidmalina · 2019-12-21T18:47:23Z

Quick hotfix:

kubectl --kubeconfig kubeconfig* -n kube-system get cm kube-proxy-config -o yaml |sed 's/metricsBindAddress: 127.0.0.1:10249/metricsBindAddress: 0.0.0.0:10249/' | kubectl --kubeconfig kubeconfig* apply -f -
kubectl --kubeconfig kubeconfig* -n kube-system patch ds kube-proxy -p "{\"spec\":{\"template\":{\"metadata\":{\"labels\":{\"updateTime\":\"`date +'%s'`\"}}}}}"

alfredkrohmer · 2019-12-21T19:10:06Z

I cannot do this unless I can reconfigure kube-proxy in some way.

You can see the kube-proxy config created by EKS in the kube-system namespace via k get cm -n kube-system kube-proxy-config -o yaml.

So what exactly prevents you from changing this configmap?

chancez · 2019-12-23T17:05:34Z

Nothing, except that it's entirely possible that'll get overwritten at some point in the future. EKS has no documentation on what's supported and unsupported when it comes to this kind of modification, so I'd like to know what kind of guarantees there would be that it won't be stomped on in the future.

alfredkrohmer · 2019-12-23T18:14:51Z

The EKS cluster upgrade documentation explicitly instructs to manually update kube-proxy, coredns and aws-node after a control plane upgrade. I highly doubt that AWS will touch these components in an automated fashion. We have been using custom configuration for all three (kube-proxy, coredns and aws-node) since we started using EKS and never had a problem with this.

chancez · 2019-12-23T18:23:09Z

That's the daemonset, not the configuration. While I agree it's unlikely to be an issue, having previously worked on commercial Kubernetes distributions, making assumptions on what's going to be safe to modify isn't always the best choice. I suppose I can always open a ticket.

kogeler · 2020-02-19T13:58:05Z

I really need this feature!

Cre-eD · 2020-02-19T15:10:21Z

adamrbennett · 2020-02-28T18:53:30Z

Manually editing configurations is a work around, not a solution. All configs should be coded and recorded in source control. Unless I'm missing something, it appears EKS users have no control over kube-proxy configuration, except for manual edits.

damscott · 2020-04-28T01:39:28Z

I think @devkid is correct, but we could use confirmation from the EKS team that automated changes will not be made to the kube-system namespace in a running cluster.

Here is the problem I encountered when setting metricsBindAddress:

When I tried to edit metricsBindAddress I found that I did not have a kube-proxy-config configmap. I have two EKS clusters: one is 565 day old, the other is 325 days old.
The newer cluster has the kube-proxy-config configmap, the older cluster does not. I have never touched the configuration of those resources beyond patching the DaemonSet image tag as documented in Updating an Amazon EKS cluster Kubernetes version.

Here is my proposed solution:

Have a git repo with all of the baseline EKS manifests (including kube-proxy) and document which EKS Platform Version corresponds with each revision

This already exists to some degree for the aws-node DaemonSet as part of the CNI patching process
It would provide guidance during upgrades while leaving full control in the hands of the user
It would help recover from accidental misconfigurations made to core EKS components
It would have helped me dealing with the undocumented drift in the kube-proxy manifests

Update:

Copying the DaemonSet and ConfigMaps from the newer cluster to the old one works. It was necessary to change one line in the kube-proxy configmap:

        server: https://<foo>.<bar>.<region>.eks.amazonaws.com

abangser · 2020-04-28T16:40:13Z

Just to add, I have a cluster that was last upgraded in late Jan to version 1.14 but initially created in October at 1.10. It does not seem to have the kube-proxy-config configmap. We are also running prometheus-operator and unable to use the kubeProxy exporter without an edit to the metricsBindAddress referenced in the docs here: #657 (comment).

karlskewes · 2020-05-07T22:28:30Z

100% agree with @damscott. Good write up.
kube-proxy-config ConfigMap and kube-proxy DaemonSet should be in a versioned manifest like other components (cni, coredns, calico).

There's a second ConfigMap called kube-proxy that is a cluster specific k8s kubeconfig from the looks of it that I guess we wouldn't touch.

mbarrien · 2020-05-12T19:20:45Z

Note that in my investigation of the clusters in my org (originally made in a wide range of EKS versions from 1.11 to 1.15), any EKS clusters created from 1.12 onwards use the kube-proxy-config configmap. There is still the issue of migrating any EKS clusters created before 1.12 to using a configmap; this might be possible to do manually.

Any EKS clusters created up to 1.12 also have a slightly different kube-proxy configmap; the configmap 1.12+ contains the cluster URL, while 1.11 and below have the cluster URL (along with oom-scor-0adj and proxy-mode) passed as a command line argument to kubeproxy (within the podspec inside the DaemonSet spec).

If you created your cluster pre-EKS 1.12, to be able pass metricsBindAddress you either need to "upgrade" your daemonset to take in the kube-proxy-config configmap (which you'll need top create manually), or pass it in as a command line argument. Unfortunately this "upgrade" of kube-proxy to the new way is not documented.

mbarrien · 2020-05-12T19:22:52Z

@damscott https://github.com/aws/eks-charts/tree/master/stable/aws-vpc-cni has some helm charts for keeping aws-vpc-cni up to date. Would be great if they added similar charts of kube-proxy and coredns.

TBBle · 2020-06-15T19:34:20Z

I haven't verified this, but CoreDNS already has a Helm chart, so it should just be a matter of getting the right parameters for that to match the default-installed version, then any changes you need.

OffCode · 2020-06-25T06:20:18Z

I also have the same issue with most recent EKS version (Kubernetes 1.16) and prometheus-operator stable helm chart. The root cause of the problem is that EKS likely uses an out-dated config value metricsBindAddress.
The Kubernetes documentation describes:

--metrics-bind-address ipport Default: 127.0.0.1:10249

| The IP address with port for the metrics server to serve on (set to '0.0.0.0:10249' for all IPv4 interfaces and '[::]:10249' for all IPv6 interfaces). Set empty to disable.

See also
https://kubernetes.io/docs/reference/command-line-tools-reference/kube-proxy/

boatmisser · 2020-08-19T20:49:35Z

This is still an issue with EKS 1.17.

The kube-proxy-config has 0.0.0.0 for bindAddress and healthzBindAddress.
However it still uses 127.0.0.1 for metricsBindAddress!!!

Would it be much of an issue to move metricsBindAddress to 0.0.0.0 to bring it in line with the other addresses, and allow us to monitor kube-proxy???

chancez · 2020-08-28T22:56:01Z

Original creator of the issue here. I thought I'd chime in on how I handle this these days.

At this point I just maintain a kustomize base for kube-proxy that I apply to all clusters immediately after deploying and it's managed via ArgoCD. You could do the same with a helm chart or similar.

At this point I think AWS has made it quite clear, it's unlikely they'll manage components inside the cluster, at least in the short to medium term. If anything, I'd bet that if they were to begin managing "core" components, like kube-proxy, aws-node/cni, etc, that you would opt-in to EKS managing these components for you, and even then, it's likely to only be in new EKS clusters. This is just a guess.

Their documentation provides instructions on managing the versions of various components in the cluster, so it makes me think that it's unlikely there will be any automated management of this stuff for quite some time, so there's little risk to managing the versions, and configuration of these resources yourself.

As others suggested, I think a more likely avenue for supporting this feature is aws/eks-charts#156, #923, and the other issues where you would fully manage kube-proxy yourself.

aviau · 2021-07-02T02:32:20Z

At this point I think AWS has made it quite clear, it's unlikely they'll manage components inside the cluster,

Good news! EKS now manages components inside the cluster via EKS cluster addons.

However, they have zero config.

Can we please have addon config that would allow for things like this?

Oliniusz · 2021-07-06T11:12:32Z

I've made a dumb move and enabled the kube-proxy addon. It failed with installing due to the modified metricsBindAddress in the configmap. Eh, whatever - I thought - and I've disabled it straight away. That actually completely deleted kube-proxy resources from the cluster...

So to have the cluster back, I've re-enabled the addon back with the config overwriting option checked. That installed kube-proxy back but restored metricsBindAddress back to original 127.0.0.1. After that I've modified the configmap by hand cursing at myself how stupid I was and how unpredictable it was.

I hope AWS will add anytime soon some option to keep the configmap untouched or just will change the default setting of metricsBindAddress to 0.0.0.0 - that's the only thing I would need here.

Otherwise I'm putting kube-proxy manifests into the gitops repository like I should have done already and forget about all those addons.

aviau · 2021-07-06T16:49:10Z

That actually completely deleted kube-proxy resources from the cluster...

Same experience here. Deploying an addon without overwrite enable results in breaking everything.

Pluies · 2021-07-15T17:08:58Z

Same boat here - switched to the add-on, lost my monitoring :(

The add-ons are a great step forward to keep these sort of resources up-to-date, so I don't want to revert back to maintaining them manually.

It would be really good to have metricsBindAddress: 0.0.0.0:10249 be the default on the kube-proxy add-on! Or at the very least have this configurable at the add-on level, similar to the way the vpc-cni add-on allows you to pick an IAM role for IRSA.

pramodpawade · 2021-07-27T15:52:32Z

I'll second @Pluies here. Recently I tried this on my project and lost my monitoring.

I see a couple of comments saying that AWS EKS Addons doesn't update kube-proxy config on your cluster if you modify them but unfortunately it does.
I tried this by enabling EKS Addons and then modifying the kube-proxy config on my cluster manually to include metricsBindAddress: 0.0.0.0:10249 but after a couple of mins without doing any update to my cluster, I can see the metricsBindAddress: 127.0.0.1:10249.
So it does prove that EKS Addons are updating your cluster config and keeps checking & modifying after periodic intervals.

It would be really good if AWS can make metricsBindAddress: 0.0.0.0:10249 be the default on the kube-proxy add-on or provide some placeholder to modify this on the add-ons level.

Update as of 03/08/2021: I was checking this with the AWS support team and it seems this issue is now in the proposed stage so I think, we might get some solution on this from AWS soon.

andrew-pickin-epi · 2021-08-12T09:28:44Z

I have to echo the above. Especially as there is no was to roll back this change in production clusters without downtime and significant remedial action.

I the meantime if AWS are reading this - please remove this addon as the default way to upgrade a cluster, or at least include a warning.

It's possible, as happened with us, that upgrade appears fine on test cluster only to break later on production, and rolling back isn't possible/practical.

mikestef9 · 2022-04-04T21:30:22Z

Correct, kube-proxy on EKS v1.22 clusters (either unmanaged kube-proxy installed with new 1.22 cluster, or managed add-on) now expose metrics by default. If you are upgrading an older cluster that still used the unmanaged add-on, the manifest will not change.

julianxhokaxhiu · 2022-04-04T21:55:48Z

What about kube-proxy on older K8s versions on EKS though? Why a one-line change patch is not applied to all versions 1.19 and above?

ddmunhoz · 2022-04-05T00:55:16Z

Thanks @stevehipwell and mikestef9 for the update!

stevehipwell · 2022-04-05T06:15:02Z

@mikestef9 I was going to say that the documentation for this was confusing as it called out "Amazon Managed Service for Prometheus" but you've already fixed that so thank you.

TBBle · 2022-04-05T13:23:18Z

If you are upgrading an older cluster that still used the unmanaged add-on, the manifest will not change.

@mikestef9 (and relevant to @julianxhokaxhiu 's question): If you manually update the configmap on an older cluster (and/or an older-but-upgraded-to-1.22 cluster) will it still revert the configmap change back to the localhost bind address?

stevehipwell · 2022-04-05T14:12:35Z

@TBBle for unmanaged add-ons unless you know any differently the manifests aren't modified once the cluster has been provisioned. Or are you asking about managed add-ons?

TBBle · 2022-04-05T15:01:43Z

I was referring to #657 (comment), #657 (comment), and #657 (comment). They're talking about managed addons.

Reading #657 (comment) again, perhaps I misparsed it, and the change applies to the kube-proxy managed addon on all clusters, it's only the unmanaged add-ons where the cluster version is relevant because it chooses which config map to go with initially and doesn't touch it again.

mikestef9 · 2022-04-05T16:14:05Z

That's right. If you upgrade to the 1.22 kube proxy managed add-on on an existing cluster upgraded to 1.22, it will contain the change to expose metrics

zekena2 · 2022-06-01T21:56:30Z

What about old versions? I've version 1.21 do I've to upgrade in order to monitor kube-proxy?

zekena2 · 2022-06-02T16:05:13Z

I tried editing the configMap kube-proxy-config and it got reverted to 127.0.0.1 and I tried creating a new configMap with the solution and pointing the daemonSet to the new one and still the daemonSet got reverted.
The docs claim that we can edit the configuration of the add-on starting from v1.18 and later but it seems false in the case of kube-proxy at least for me.
https://docs.aws.amazon.com/eks/latest/userguide/add-ons-configuration.html

stevehipwell · 2022-06-02T22:18:11Z

@zekena2 did you check if the field was being managed by EKS? AFAIK you can't use the managed kube-proxy add-on and change this value. You either need to manage it yourself or upgrade to v1.22.

zekena2 · 2022-06-03T07:40:28Z

Yes it seems all the config related stuff are managed by EKS which basically means that nothing is configurable. regardless of the metrics problem I should be able to configure the addon as I want.

stevehipwell · 2022-06-03T09:10:40Z

@zekena2 you could install the managed add-on for v1.21 and then disable it before patching. When you upgrade to v1.22 you can just re enable the managed add-on and it should just work.

zekena2 · 2022-06-03T11:58:53Z

Thanks @stevehipwell I got it working that way. I removed the plugin with checking preserve on cluster and then edited the configMap and it preserved. I still think it's not an ideal way to configure the addon.

samox73 · 2022-08-16T13:39:09Z

According to https://docs.aws.amazon.com/eks/latest/userguide/kubernetes-versions.html starting from K8s 1.22+ on EKS, the metricsBindAddress should be set to expose metrics outside the pod:

Starting with Amazon EKS version 1.22, kube-proxy is configured by default to expose Prometheus metrics outside the pod. This behavior change addresses the request made in containers roadmap issue

However, in both our clusters (prod is on 1.22 while staging is on 1.23) the kube-proxy-config ConfigMap still reads:

metricsBindAddress: 127.0.0.1:10249

I tried installing the EKS add-on manually but that didn't change anything either.

TBBle · 2022-08-16T16:00:00Z

Were those clusters installed at an earlier version and upgraded? If they were upgraded from a version before the change, then they won't have changed the setting as part of the upgrade. See #657 (comment)

samox73 · 2022-08-17T06:53:40Z

Were those clusters installed at an earlier version and upgraded? If they were upgraded from a version before the change, then they won't have changed the setting as part of the upgrade. See #657 (comment)

Yes, they were upgraded from 1.21, however, we have only been using the managed add-on.

julianxhokaxhiu · 2022-08-17T06:56:11Z

When I did upgrade my clusters, I removed the extension first and then I re-added it via the AWS EKS Web console, and it worked perfectly. Prometheus is able to fetch metrics from kube-proxy now. If you upgrade the add-on probably it won't touch the config, so try the remove and re-add approach.

andrew-pickin-epi · 2022-11-09T16:27:18Z

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249.
However, I used the --force option when upgrading

samox73 · 2022-11-16T09:49:18Z

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249. However, I used the --force option when upgrading

@andrew-pickin-epi see the reply just above yours.

andrew-pickin-epi · 2022-11-16T10:09:31Z

I have managed addon, and upgrading to 1.22 the resulting in metricsBindAddress: 0.0.0.0:10249. However, I used the --force option when upgrading

@andrew-pickin-epi see the reply just above yours.

My point is you don't need to remove and re-apply. Using --force overrides existing setting and is the documneted way to upgrade as per the AWS documentation. https://docs.aws.amazon.com/eks/latest/userguide/managing-add-ons.html

praveenchandran · 2023-09-14T16:23:30Z

Quick fix :
Edit the kube-proxy Daemonset and pass an additional parameter to kube-proxy --metrics-bind-address 0.0.0.0:10249.
The pods will restart and the ports will be open.

rfcku · 2024-07-05T19:16:16Z

Quick fix : Edit the kube-proxy Daemonset and pass an additional parameter to kube-proxy --metrics-bind-address 0.0.0.0:10249. The pods will restart and the ports will be open.

Indeed this resolved also my issue, when updating from eks 1.22 to 1.23 the proxy-config still pointed to 127.0.0.1:10249, so after updating the configmap kube-proxy-config to point to 0.0.0.0:10249 and restarting the kube-proxy daemonset the issue was resolved.

chancez added the Proposed Community submitted issue label Dec 17, 2019

mikestef9 added the EKS Amazon Elastic Kubernetes Service label Apr 9, 2020

This was referenced Jun 7, 2020

Kube proxy helm chart aws/eks-charts#156

Closed

[EKS] [request]: Optional creations of AWS-CNI, Kube-proxy to manage them in helm charts #934

Closed

damscott mentioned this issue Sep 10, 2020

[EKS]: One-click Full Cluster Upgrade #600

Open

ChrsMark mentioned this issue Apr 6, 2021

Provide option to expose metrics of kube-proxy kubernetes-sigs/kind#2184

Closed

mikestef9 closed this as completed Apr 4, 2022

containers-roadmap automation moved this from Coming Soon to Just Shipped Apr 4, 2022

joncotton mentioned this issue May 24, 2022

kube-prometheus-stack wrong selector in kube-proxy service monitor prometheus-community/helm-charts#2068

Closed

OHaimanov mentioned this issue Nov 9, 2022

Metrics doesn't work (loadKubeMetrics failed) 404 page not found lensapp/lens#6510

Open

Cajga mentioned this issue Dec 7, 2022

Expose kube-proxy prometheus metrics aws/eks-anywhere#4299

Open

HarshadRanganathan mentioned this issue Mar 31, 2023

EKS Upgrades HarshadRanganathan/harshadranganathan.github.io#98

Open

mnaser mentioned this issue Aug 30, 2023

Bind address for kube scheduler and controller manager vexxhost/magnum-cluster-api#180

Closed

[EKS] [request]: allow to configure metricsBindAddress for kube-proxy #657

[EKS] [request]: allow to configure metricsBindAddress for kube-proxy #657

Comments

chancez commented Dec 17, 2019 • edited Loading

Community Note

TBeijen commented Dec 20, 2019

dawidmalina commented Dec 21, 2019

alfredkrohmer commented Dec 21, 2019

chancez commented Dec 23, 2019

alfredkrohmer commented Dec 23, 2019 • edited Loading

chancez commented Dec 23, 2019

kogeler commented Feb 19, 2020

Cre-eD commented Feb 19, 2020

adamrbennett commented Feb 28, 2020

damscott commented Apr 28, 2020 • edited Loading

abangser commented Apr 28, 2020

karlskewes commented May 7, 2020

mbarrien commented May 12, 2020

mbarrien commented May 12, 2020

TBBle commented Jun 15, 2020

OffCode commented Jun 25, 2020

--metrics-bind-address ipport Default: 127.0.0.1:10249

boatmisser commented Aug 19, 2020 • edited Loading

chancez commented Aug 28, 2020

aviau commented Jul 2, 2021 • edited Loading

Oliniusz commented Jul 6, 2021

aviau commented Jul 6, 2021

Pluies commented Jul 15, 2021

pramodpawade commented Jul 27, 2021 • edited Loading

andrew-pickin-epi commented Aug 12, 2021

mikestef9 commented Apr 4, 2022

julianxhokaxhiu commented Apr 4, 2022

ddmunhoz commented Apr 5, 2022 • edited Loading

stevehipwell commented Apr 5, 2022

TBBle commented Apr 5, 2022

stevehipwell commented Apr 5, 2022

TBBle commented Apr 5, 2022 • edited Loading

mikestef9 commented Apr 5, 2022

zekena2 commented Jun 1, 2022 • edited Loading

zekena2 commented Jun 2, 2022

stevehipwell commented Jun 2, 2022

zekena2 commented Jun 3, 2022

stevehipwell commented Jun 3, 2022

zekena2 commented Jun 3, 2022

samox73 commented Aug 16, 2022

TBBle commented Aug 16, 2022

samox73 commented Aug 17, 2022

julianxhokaxhiu commented Aug 17, 2022

andrew-pickin-epi commented Nov 9, 2022 • edited Loading

samox73 commented Nov 16, 2022 • edited Loading

andrew-pickin-epi commented Nov 16, 2022

praveenchandran commented Sep 14, 2023

rfcku commented Jul 5, 2024

chancez commented Dec 17, 2019 •

edited

Loading

alfredkrohmer commented Dec 23, 2019 •

edited

Loading

damscott commented Apr 28, 2020 •

edited

Loading

boatmisser commented Aug 19, 2020 •

edited

Loading

aviau commented Jul 2, 2021 •

edited

Loading

pramodpawade commented Jul 27, 2021 •

edited

Loading

ddmunhoz commented Apr 5, 2022 •

edited

Loading

TBBle commented Apr 5, 2022 •

edited

Loading

zekena2 commented Jun 1, 2022 •

edited

Loading

andrew-pickin-epi commented Nov 9, 2022 •

edited

Loading

samox73 commented Nov 16, 2022 •

edited

Loading