Kubelet does not update node-labels from flags if they change #59314

smarterclayton · 2018-02-03T22:58:57Z

Once a kubelet has created a node object for itself via registration, if the --node-labels flag ever changes the labels aren't updated. I see the labels being set into the objectmeta in the init code, but I don't see the server being updated (external ID hasn't changed).

Recreate:

Start kubelet with no --node-labels, wait for it to register
Update kubelet to start with --node-labels a=b

Expect:

API object for node to have labels a=b

Actual:

API object labels don't change until node object is deleted

I don't see any obvious place where this has been broken, but it looks like the patch calculated by nodeutil is only the conditions.

The text was updated successfully, but these errors were encountered:

jhorwit2 · 2018-02-03T23:22:00Z

@smarterclayton this is intended functionality. The flag (see below) mentions they're only added when registering.

https://kubernetes.io/docs/reference/generated/kubelet/

--node-labels mapStringString Labels to add when registering the node in the cluster. Labels must be key=value pairs separated by ','.

jhorwit2 · 2018-02-03T23:23:35Z

I think we should probably rename it to --register-with-labels so it matches --register-with-taints especially since it's still an alpha feature and that's more clear.

smarterclayton · 2018-02-03T23:49:06Z

This is going to be broken with dynamic configuration - as a user, it was unexpected for new labels not to be added. I think it's time to revisit this.

smarterclayton · 2018-02-03T23:49:42Z

@mtaufen @liggitt re dynamic reconfiguration.

smarterclayton · 2018-02-03T23:50:41Z

note: It's silly we call this an alpha feature when it's been in the kubelet since 1.1. We should own up to it one way or another.

liggitt · 2018-02-04T00:37:52Z

kubelet ownership of its own labels is not deterministic and is problematic. Kubelet updating labels on an existing Node API object on start is not the direction we want to go, since it removes the ability to centrally manage those labels via the API

smarterclayton · 2018-02-04T19:00:58Z

I disagree - the kubelet owns its own labels (those specified at --node-labels time) at a minimum. I agree that node labels should be manageable via an API, especially those set by humans. I do not agree that a fleet of bootstrapped nodes in scaling groups should not be able to specify their own labels and keep them up to date based on config. And given that the node controller can stomp down nodes, the newly bootstrapped nodes in those scale groups will end up having those labels (when the node object is deleted), so having a weird inconsistent state where labels sometimes show up is pointless and confusing.

Since we're not going to remove --node-labels from the kubelet, and we are moving head on towards bootstrapped and centrally configured nodes (dynamic node config), the node should manage at a minimum the set of labels and taints specified by CLI. I don't actually care whether the old labels are removed, since most of those label changes will be managed by rotating scale groups, but the current state is dumb.

liggitt · 2018-02-04T19:10:13Z

I disagree - the kubelet owns its own labels

That is incompatible with using labels to segment node groups for anything security related. Why wouldn't the person applying config to nodes also label them?

smarterclayton · 2018-02-06T05:53:08Z

In that case we should just make the node controller do it based on the config - it can observe the active config correctly. But if we really care about security, then the node shouldn’t be able to set taints or labels on create other and only the node controller manages that set. That seems strictly better, although disabling node labels ends up being a painful deprecation security process.

smarterclayton · 2018-02-06T05:54:07Z

Disabling/preventing the kubelets node-labels and register-taints flags, that is

liggitt · 2018-02-06T14:29:21Z

if we really care about security, then the node shouldn’t be able to set taints or labels on create other and only the node controller manages that set. That seems strictly better, although disabling node labels ends up being a painful deprecation security process.

That is the goal, trying to work through how to transition there. It probably looks like whitelisting existing labels/taints the kubelets set, but in the meantime I want to avoid expansion of kubelet control over taints/labels

cc @mikedanese @kubernetes/sig-auth-misc

mikedanese · 2018-02-08T22:29:26Z

This might be a dup of #18394 but is more specific

liggitt · 2018-02-10T03:28:55Z

this seems like the same issue as #18394
/close

@liggitt

Automatic merge from submit-queue. Make openshift-ansible use static pods to install the control plane, make nodes prefer bootstrapping 1. Nodes continue to be configured for bootstrapping (as today) 2. For bootstrap nodes, we write a generic bootstrap-node-config.yaml that contains static pod references and any bootstrap config, and then use that to start a child kubelet using `--write-flags` instead of launching the node ourselves. If a node-config.yaml is laid down in `/etc/origin/node` it takes precedence. 3. For 3.10 we want dynamic node config from Kubernetes to pull down additional files, but there are functional gaps. For now, the openshift SDN container has a sidecar that syncs node config to disk and updates labels (kubelet doesn't update labels, kubernetes/kubernetes#59314) 4. On the masters, if openshift_master_bootstrap_enabled we generate the master-config.yaml and the etcd config, but we don't start etcd or the masters (no services installed) 5. On the masters, we copy the static files into the correct pod-manifest-path (/etc/origin/node/pods) or similar 6. The kubelet at that point should automatically pick up the new static files and launch the components 7. We wait for them to converge 8. We install openshift-sdn as the first component, which allows nodes to go ready and start installing things. There is a gap here where the masters are up, the nodes can bootstrap, but the nodes are not ready because no network plugin is installed. Challenges at this point: * The master shims (`master-logs` and `master-restart`) need to deal with CRI-O and systemd. Ideally this is a temporary shim until we remove systemd for these components and have cri-ctl installed. * We need to test failure modes of the static pods * Testing Further exploration things: * need to get all the images using image streams or properly replaced into the static pods * need to look at upgrades and updates * disk locations become our API (`/var/lib/origin`, `/var/lib/etcd`) - how many customers have fiddled with this? * may need to make the kubelet halt if it hasn't been able to get server/client certs within a bounded window (5m?) so to ensure that autoheals happen (openshift/origin#18430) * have to figure out whether dynamic kubelet config is a thing we can rely on for 3.10 (@liggitt), and what gaps there are with dynamic reconfig * client-ca.crt is not handled by bootstrapping or dynamic config. This needs a solution unless we keep the openshift-sdn sidecar around * kubelet doesn't send sd notify to systemd (kubernetes/kubernetes#59079) @derekwaynecarr @sdodson @liggitt @deads2k this is the core of self-hosting.

k8s-ci-robot added the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Feb 3, 2018

smarterclayton added the sig/node Categorizes an issue or PR as relevant to SIG Node. label Feb 3, 2018

k8s-ci-robot removed the needs-sig Indicates an issue or PR lacks a `sig/foo` label and requires one. label Feb 3, 2018

smarterclayton self-assigned this Feb 3, 2018

smarterclayton mentioned this issue Feb 4, 2018

Make openshift-ansible use static pods to install the control plane, make nodes prefer bootstrapping openshift/openshift-ansible#6916

Merged

k8s-ci-robot added the sig/auth Categorizes an issue or PR as relevant to SIG Auth. label Feb 6, 2018

k8s-ci-robot assigned liggitt Feb 10, 2018

k8s-ci-robot closed this as completed Feb 10, 2018

embano1 mentioned this issue May 28, 2018

vSphere Cloud Provider should support implement Zones() interface #64021

Closed

MrHohn mentioned this issue Jun 14, 2018

Fix kube-proxy static pods to daemonset migration path #65114

Closed

vjernej mentioned this issue Jan 18, 2019

Openshift enterprise 3.11 deploy fails openshift/openshift-ansible#11027

Closed

rphillips mentioned this issue Mar 1, 2019

initial pass at populating the os version label openshift/machine-config-operator#514

Merged

saveshnshetty mentioned this issue Apr 18, 2019

Default SDN issue openshift/openshift-ansible#11485

Closed

skam-github mentioned this issue May 21, 2019

Openshift 3.10: need apiserver executable but none of the candidates are running aquasecurity/kube-bench#242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Kubelet does not update node-labels from flags if they change #59314

Kubelet does not update node-labels from flags if they change #59314

smarterclayton commented Feb 3, 2018

jhorwit2 commented Feb 3, 2018

jhorwit2 commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

liggitt commented Feb 4, 2018

smarterclayton commented Feb 4, 2018

liggitt commented Feb 4, 2018

smarterclayton commented Feb 6, 2018

smarterclayton commented Feb 6, 2018

liggitt commented Feb 6, 2018

mikedanese commented Feb 8, 2018 •

edited

Loading

liggitt commented Feb 10, 2018

Kubelet does not update node-labels from flags if they change #59314

Kubelet does not update node-labels from flags if they change #59314

Comments

smarterclayton commented Feb 3, 2018

jhorwit2 commented Feb 3, 2018

jhorwit2 commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

smarterclayton commented Feb 3, 2018

liggitt commented Feb 4, 2018

smarterclayton commented Feb 4, 2018

liggitt commented Feb 4, 2018

smarterclayton commented Feb 6, 2018

smarterclayton commented Feb 6, 2018

liggitt commented Feb 6, 2018

mikedanese commented Feb 8, 2018 • edited Loading

liggitt commented Feb 10, 2018

mikedanese commented Feb 8, 2018 •

edited

Loading