cluster-autoscaler clusterapi provider performance degrades when there are a high number of node groups #6784

elmiko · 2024-04-30T13:32:51Z

Which component are you using?:

cluster-autoscaler

What version of the component are you using?:

Component version: all versions up to and including 1.30.0

What k8s version are you using (kubectl version)?:

this affects all kubernetes versions that are compatible with the cluster autoscaler

What environment is this in?:

clusterapi provider, with more than 50 node groups (eg. MachineDeployments, MachineSets, MachinePools)

What did you expect to happen?:

expect cluster autoscaler to operate as normal

What happened instead?:

as the number of node groups increases, the performance of the autoscaler appears to degrade. it takes longer and longer to process the scan interval and in some cases (when node groups are in the 100s) it can take more than 40 minutes to add a new node when pods are pending.

How to reproduce it (as minimally and precisely as possible):

setup a cluster with clusterapi and cluster autoscaler
create 100 machinedeployments
configure autoscaler to recognize all 100 machinedeployments as node groups
creating a pending job to the cluster
observe the autoscaler behavior

Anything else we need to know?:

this problem appears related to how the clusterapi provider interacts with the api server. when assessing activity in the cluster, the provider will query the api server for all the node groups, then query again for scalable resources, and potentially another time for the infrastructure machine template. i have a feeling that this interaction is causing the issues.

i think it's possible that extending the scan interval time might alleviate some of the issues, but i hove not confirmed anything yet.

The text was updated successfully, but these errors were encountered:

enxebre · 2024-04-30T13:41:29Z

/area provider/cluster-api

elmiko · 2024-04-30T14:36:28Z

i've been hacking on a PR to add some timing metrics on the NodeGroups interface function. i believe we spend the most time in this function and have been trying to prove out how the number of node groups affects the time that this call takes.

elmiko@1a5d9cd

enxebre · 2024-05-07T11:48:28Z

I don't think kas calls are the main bottle neck but rather the cloudprovider.NodeGroup function implementation. Currently it takes ~20 seconds with ~90 MachineSets. This #6796 avoids expensive loop and copy pointers resulting in ~5 seconds each NodeGroups call .

elmiko · 2024-05-07T13:33:02Z

it seems we might have multiple areas for improvement, when i observed behavior with 50 to 75 node groups, i could see the performance becoming worse over time. it appeared that we might have inefficiency in the way we handle all the various cluster-api CRs.

adrianmoisey · 2024-07-08T18:54:48Z

/area cluster-autoscaler

songminglong · 2024-08-22T08:09:45Z

I have encountered a situation where the startup process of autoscaler (provider is alicloud) takes a very long time. The reason is that MixedTemplateNodeInfoProvider:Process() will build nodeinfo for each nodegroup. During the initialization of this function, it will make DescribeScalingInstances request to aliyun provider, and this request takes 2s each time. My cluster has 4k nodes, and the startup phase takes a very long time.

I don’t know if this issue is caused by the long startup process of ca, or the long loop phase after startup?

songminglong · 2024-08-22T08:47:10Z

I have encountered a situation where the startup process of autoscaler (provider is alicloud) takes a very long time. The reason is that MixedTemplateNodeInfoProvider:Process() will build nodeinfo for each nodegroup. During the initialization of this function, it will make DescribeScalingInstances request to aliyun provider, and this request takes 2s each time. My cluster has 4k nodes, and the startup phase takes a very long time.

I don’t know if this issue is caused by the long startup process of ca, or the long loop phase after startup?

The cluster-api provider can support the nodegroup cache to cache nodegroups, such as AWS AwsManager

songminglong · 2024-08-22T09:02:17Z

/cc

elmiko added the kind/bug Categorizes issue or PR as related to a bug. label Apr 30, 2024

k8s-ci-robot added the area/provider/cluster-api Issues or PRs related to Cluster API provider label Apr 30, 2024

enxebre mentioned this issue May 7, 2024

Avoid expesive pointer copy in capi nodegroup #6796

Merged

k8s-ci-robot added the area/cluster-autoscaler label Jul 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cluster-autoscaler clusterapi provider performance degrades when there are a high number of node groups #6784

cluster-autoscaler clusterapi provider performance degrades when there are a high number of node groups #6784

elmiko commented Apr 30, 2024

enxebre commented Apr 30, 2024

elmiko commented Apr 30, 2024

enxebre commented May 7, 2024

elmiko commented May 7, 2024

adrianmoisey commented Jul 8, 2024

songminglong commented Aug 22, 2024

songminglong commented Aug 22, 2024

songminglong commented Aug 22, 2024

cluster-autoscaler clusterapi provider performance degrades when there are a high number of node groups #6784

cluster-autoscaler clusterapi provider performance degrades when there are a high number of node groups #6784

Comments

elmiko commented Apr 30, 2024

enxebre commented Apr 30, 2024

elmiko commented Apr 30, 2024

enxebre commented May 7, 2024

elmiko commented May 7, 2024

adrianmoisey commented Jul 8, 2024

songminglong commented Aug 22, 2024

songminglong commented Aug 22, 2024

songminglong commented Aug 22, 2024