[Bug]: `cluster-autoscaler` does not wait long enough for new server to become available #1278

UBaggeler · 2024-03-15T09:56:58Z

Description

Since midnight our autoscaler is having issues launching new nodes (from 1 --> 3). The creation of the new servers seems to take longer than the expected (default) timeout of 5min:

failed to create error: failed to start server hcloud-autoscaled-xyz error: timeout waiting for server hcloud-autoscaled-xyz

In this case cluster-autoscaler removes/deletes the nodes and repeatedly tries to spin up new servers, without success.

Manually increasing the timeout (for example to 15min) by setting the env variable HCLOUD_SERVER_CREATION_TIMEOUT on the autoscaler deployment resolves the issue.

Unfortunately the autoscaler.yaml.tpl file does not allow to set this environment variable.

Kube.tf file

n/a

Screenshots

No response

Platform

Linux

The text was updated successfully, but these errors were encountered:

zarevavasyl · 2024-03-15T12:35:04Z

Tnx!)

mysticaltech · 2024-03-16T06:22:58Z

@UBaggeler Your fix was just released as part of v2.13.4 🚀

UBaggeler added the bug Something isn't working label Mar 15, 2024

UBaggeler mentioned this issue Mar 15, 2024

autoscaler: Add support to set HCLOUD_SERVER_CREATION_TIMEOUT #1279

Merged

mysticaltech closed this as completed in #1279 Mar 16, 2024

mysticaltech mentioned this issue Mar 16, 2024

Tweak autoscaler creation timeout #1284

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: `cluster-autoscaler` does not wait long enough for new server to become available #1278

[Bug]: `cluster-autoscaler` does not wait long enough for new server to become available #1278

UBaggeler commented Mar 15, 2024

zarevavasyl commented Mar 15, 2024

mysticaltech commented Mar 16, 2024

[Bug]: cluster-autoscaler does not wait long enough for new server to become available #1278

[Bug]: cluster-autoscaler does not wait long enough for new server to become available #1278

Comments

UBaggeler commented Mar 15, 2024

Description

Kube.tf file

Screenshots

Platform

zarevavasyl commented Mar 15, 2024

mysticaltech commented Mar 16, 2024

[Bug]: `cluster-autoscaler` does not wait long enough for new server to become available #1278

[Bug]: `cluster-autoscaler` does not wait long enough for new server to become available #1278