Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

With a Cloud Provider enabled, Windows worker nodes cannot join cluster due to DNS suffixes being passed to kubelet as commands #29782

Closed
rosskirkpat opened this issue Oct 27, 2020 · 2 comments
Assignees
Milestone

Comments

@rosskirkpat
Copy link
Contributor

rosskirkpat commented Oct 27, 2020

What kind of request is this (question/bug/enhancement/feature request):
bug

Steps to reproduce (least amount of steps as possible):

  • setup multiple DNS suffixes in AWS DHCP Options Sets
  • enable aws cloud provider in a new mixed cluster
  • add new windows worker node

Result:

kubelet will be unable to start due to interpreting the DNS suffix options as commands.

For example:

--hostname-override=ip-X-X-X-X.XXXXX.com gad.XXXXX.com apa.gad.XXXXX.com eur.gad.XXXXX.com gmea.gad.XXXXX.com nam.XXXXX.com us-east-1.compute.internal --feature-gates=HyperVContainer=true,WindowsGMSA=true

They will see the following when adding windows nodes in the kubelet logs:

10328 server.go:163] unknown command: apa.gad.XXXXX.com

Other details that may be helpful:

Issue can be addressed in the following powershell script by truncating to the first hostname returned.

https://github.com/rancher/rke-tools/blob/v0.1.65/windows/cloud-provider.psm1#L229-L232

gz#12873

@rosskirkpat rosskirkpat changed the title Windows worker nodes With a Cloud Provider enabled, Windows worker nodes cannot join cluster due to DNS suffixes being passed to kubelet as commands Oct 27, 2020
@maggieliu maggieliu added this to the v2.4.9 milestone Oct 27, 2020
@sowmyav27 sowmyav27 self-assigned this Oct 27, 2020
@sowmyav27
Copy link
Contributor

sowmyav27 commented Oct 27, 2020

Validated On 2.4.9-rc5

  • Deploy windows cluster - AWS cloud provider enabled, on k8s 1.18,1.17,1.16 and 1.15 (1 etcd/control, 1 worker node - linux nodes and 3 windows worker nodes)
  • Deploy PVC and LB
  • PVC and LB deployed successfully on all clusters except for this issue

@sowmyav27
Copy link
Contributor

Closing this issue as the forward port issue is tracked separately.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

4 participants