Bug 1538389 - Allow node IP change to update Host IP in HostSubnet resource #18281

pravisankar · 2018-01-25T01:44:46Z

Node status addresses may have both old and new IP address and by
validating against these addrs we may not be able to update HostSubnet
with new node IP. Openshift node service waits for its HostSubnet resource
with new Host IP and eventually fails.
This change fixes the above issue by reverting the change for node flip/flop.
Recommended/correct way to handle node flip/flop case is to specify nodeIP config option in node-config.yaml

pravisankar · 2018-01-25T01:47:16Z

@openshift/sig-networking PTAL

imcsk8 · 2018-01-25T02:04:26Z

LGTM

pravisankar · 2018-01-26T00:25:52Z

/retest

dcbw · 2018-01-27T18:03:54Z

pkg/network/master/subnets.go

+
+	var nodeIP string
+	if len(node.Status.Addresses) > 0 && node.Status.Addresses[0].Address != "" {
+		nodeIP = node.Status.Addresses[0].Address


Do we want to check on NodeAddressType here now, to eg prefer NodeInternalAddress? I'm assuming that netutils.GetNodeIP() would most likely return the private/internal address for the node given it's a DNS lookup and a cloud provider would likely return the private/internal address rather than the public one.

Otherwise LGTM; better to be more explicit in the node config than rely on "magic" in the master to figure out this stuff.

It is possible that node address got from DNS lookup can differ from what cloud provider returns. Currently, kubelet is populating first node address as the NodeInternalIP but I agree that we should not depend on the kubelet internals.
Yes, I will make it explicit here.

knobunc · 2018-01-29T15:09:39Z

@rajatchopra PTAL

knobunc

The approach seems reasonable to me too. Thanks.

- If IP addr is not populated in node object, master should not try to resolve the node address because it may not match exact IP addr used by node. - We can safely ignore the event in the case and when the IP is populated in then node status object, we will get another event and that is the right time to create/update hostsubnet.

- Node status addresses may have both old and new IP address and by validating against these addrs we may not be able to update HostSubnet with new node IP. Openshift node service waits for its HostSubnet resource with new Host IP and eventually fails. - This change fixes the above issue by reverting the change for node flip/flop. Recommended/correct way to handle node flip/flop case is to specify nodeIP config option in node-config.yaml

pravisankar · 2018-01-29T20:05:29Z

Updated, address with NodeInternalIP type is used instead of node.Status.Addresses[0].Address to fetch the node IP.
@dcbw PTAL

smarterclayton · 2018-01-29T22:57:37Z

pkg/network/master/subnets.go

+		}
+	}
+	if len(nodeIP) == 0 {
+		glog.Errorf("Node IP is not set for node %s, skipping %s event, node: %v", node.Name, eventType, node)


This entire file needs to use utilruntime.HandleError. Any kube code that eats an error (that uses glog.Errorf) should be utilruntime.HandleError instead. That ensures that we get rate limiting, that errors are reported to sentry, and also that hotloop requests get processed.

Can be a follow up PR but please correct that before we ship 3.9

Created follow up pr: #18343

rajatchopra

good to go
/lgtm
/approve

openshift-ci-robot · 2018-01-30T00:24:52Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pravisankar, rajatchopra

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~pkg/network/OWNERS~~ [pravisankar,rajatchopra]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

pravisankar · 2018-01-30T07:07:10Z

/retest

openshift-merge-robot · 2018-01-30T11:13:30Z

Automatic merge from submit-queue.

openshift-ci-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Jan 25, 2018

openshift-ci-robot requested review from danwinship and smarterclayton January 25, 2018 01:45

openshift-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Jan 25, 2018

pravisankar force-pushed the fix-master-nodeevent branch from f97e1ab to 79fc3bb Compare January 25, 2018 01:46

pravisankar added kind/bug Categorizes issue or PR as related to a bug. component/networking sig/networking labels Jan 25, 2018

pravisankar requested review from rajatchopra, knobunc and dcbw January 25, 2018 01:46

dcbw reviewed Jan 27, 2018

View reviewed changes

knobunc approved these changes Jan 29, 2018

View reviewed changes

Ravi Sankar Penta added 2 commits January 29, 2018 11:56

pravisankar force-pushed the fix-master-nodeevent branch from 79fc3bb to 31fcad5 Compare January 29, 2018 19:58

smarterclayton reviewed Jan 29, 2018

View reviewed changes

openshift-ci-robot assigned rajatchopra Jan 30, 2018

rajatchopra approved these changes Jan 30, 2018

View reviewed changes

openshift-ci-robot added the lgtm Indicates that a PR is ready to be merged. label Jan 30, 2018

pravisankar mentioned this pull request Jan 30, 2018

Use utilruntime.HandleError instead of glog.Error in openshift SDN #18343

Merged

openshift-merge-robot merged commit 48a0122 into openshift:master Jan 30, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug 1538389 - Allow node IP change to update Host IP in HostSubnet resource #18281

Bug 1538389 - Allow node IP change to update Host IP in HostSubnet resource #18281

pravisankar commented Jan 25, 2018

pravisankar commented Jan 25, 2018

imcsk8 commented Jan 25, 2018

pravisankar commented Jan 26, 2018

dcbw Jan 27, 2018

pravisankar Jan 29, 2018

knobunc commented Jan 29, 2018

knobunc left a comment

pravisankar commented Jan 29, 2018

smarterclayton Jan 29, 2018

pravisankar Jan 30, 2018

rajatchopra left a comment

openshift-ci-robot commented Jan 30, 2018

pravisankar commented Jan 30, 2018

openshift-merge-robot commented Jan 30, 2018

Bug 1538389 - Allow node IP change to update Host IP in HostSubnet resource #18281

Bug 1538389 - Allow node IP change to update Host IP in HostSubnet resource #18281

Conversation

pravisankar commented Jan 25, 2018

pravisankar commented Jan 25, 2018

imcsk8 commented Jan 25, 2018

pravisankar commented Jan 26, 2018

dcbw Jan 27, 2018

Choose a reason for hiding this comment

pravisankar Jan 29, 2018

Choose a reason for hiding this comment

knobunc commented Jan 29, 2018

knobunc left a comment

Choose a reason for hiding this comment

pravisankar commented Jan 29, 2018

smarterclayton Jan 29, 2018

Choose a reason for hiding this comment

pravisankar Jan 30, 2018

Choose a reason for hiding this comment

rajatchopra left a comment

Choose a reason for hiding this comment

openshift-ci-robot commented Jan 30, 2018

pravisankar commented Jan 30, 2018

openshift-merge-robot commented Jan 30, 2018