Unregister node from RKE2 after agent deletion #12

zifeo · 2021-10-08T13:56:58Z

Currently RKE2 keeps the agent as NotReady:

NAME                     STATUS     ROLES                       AGE   VERSION
rke-cluster-blue-001     Ready      <none>                      65m   v1.21.5+rke2r2
rke-cluster-green-001    NotReady   <none>                      65m   v1.21.5+rke2r2

The text was updated successfully, but these errors were encountered:

remche · 2021-10-11T07:47:49Z

As stated in documentation, you need to manually drain and remove node before downscaling a pool nodes.
I did not find a clean way to do it, using ssh on server node was leading to chaotic behavior.
I will be really happy if a clean implementation is proposed though ;)

zifeo · 2021-10-11T10:02:55Z

@remche Yes, I've seen this. The issue is that even upgrades are not stable if there is no volume on agent nodes. Is there a reason/use case to use a non volume server or node?

remche · 2021-10-14T07:41:29Z

@zifeo I did not manage to reproduce this issue. Can you provide a sanitized configuration ?

Is there a reason/use case to use a non volume server or node?

In my use case I only use ephemeral volume for VM w/o any problem.

zifeo · 2021-10-14T08:33:01Z

@remche I have experimenting based on a different setup here, I will update if I find something stable and portable.

remche · 2021-10-14T08:39:38Z

@zifeo Nice, do not hesitate to contribute back ;)
Would be very happy to find a clean autoscaling method !

zifeo · 2021-10-14T22:44:59Z

As stated in documentation, you need to manually drain and remove node before downscaling a pool nodes.

This seems related to k3s-io/k3s#1264.

As for autoscaling, I would suggest something like orchestration_stack_v1 to keep a coherent Terraform state. This should work for simple autoscaling behaviours, but a custom/vanilla cloud provider could be written for advanced use cases.

remche · 2021-10-15T11:33:54Z

As for autoscaling, I would suggest something like orchestration_stack_v1 to keep a coherent Terraform state. This should work for simple autoscaling behaviours, but a custom/vanilla cloud provider could be written for advanced use cases.

I came to the same conclusions. Using Heat stack seems pretty hacky to me, I would prefer a custom cluster-autoscaler but it's more work :)

zifeo · 2021-10-15T11:36:33Z

@remche I will build a poc later to see how stable it could be. The issue with a custom autoscaler is the compatibility with the TF state. A remote & shared backend maybe but this seems even more hacky.

remche · 2021-10-15T11:44:30Z

My first though would be to use a remote state supporting locking. But I'm not sure there is a way to retrieve current backend configuration in data sources...

stale · 2021-12-14T15:28:59Z

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

zifeo · 2022-01-28T17:14:00Z

@zifeo Nice, do not hesitate to contribute back ;)
Would be very happy to find a clean autoscaling method !

Node deletion seems stable so far. I am happy to bring https://github.com/zifeo/terraform-openstack-rke2 over (merge all here), but the expose module interface is rather different. What is your point of view on this? This is why I chose to start from scratch originally.

stale bot added the wontfix This will not be worked on label Dec 14, 2021

stale bot closed this as completed Dec 28, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unregister node from RKE2 after agent deletion #12

Unregister node from RKE2 after agent deletion #12

zifeo commented Oct 8, 2021

remche commented Oct 11, 2021

zifeo commented Oct 11, 2021

remche commented Oct 14, 2021

zifeo commented Oct 14, 2021

remche commented Oct 14, 2021

zifeo commented Oct 14, 2021

remche commented Oct 15, 2021

zifeo commented Oct 15, 2021 •

edited

Loading

remche commented Oct 15, 2021

stale bot commented Dec 14, 2021

zifeo commented Jan 28, 2022

Unregister node from RKE2 after agent deletion #12

Unregister node from RKE2 after agent deletion #12

Comments

zifeo commented Oct 8, 2021

remche commented Oct 11, 2021

zifeo commented Oct 11, 2021

remche commented Oct 14, 2021

zifeo commented Oct 14, 2021

remche commented Oct 14, 2021

zifeo commented Oct 14, 2021

remche commented Oct 15, 2021

zifeo commented Oct 15, 2021 • edited Loading

remche commented Oct 15, 2021

stale bot commented Dec 14, 2021

zifeo commented Jan 28, 2022

zifeo commented Oct 15, 2021 •

edited

Loading