-
Notifications
You must be signed in to change notification settings - Fork 90
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[release-4.7] Bug 1941993: Fix incorrect unmonitoring of egress nodes #279
[release-4.7] Bug 1941993: Fix incorrect unmonitoring of egress nodes #279
Conversation
Currently we incorrectly unmonitoring an egress node whenever any egress IP stops referencing that node, this is incorrect because there might be other egress IPs (defined on a separate namespace) that stills references that egress node. Consider the following example: ``` oc get hostsubnet NAME HOST HOST IP SUBNET EGRESS CIDRS EGRESS IPS huirwang-45-xslwp-master-0 huirwang-45-xslwp-master-0 172.31.249.251 10.128.0.0/23 huirwang-45-xslwp-master-1 huirwang-45-xslwp-master-1 172.31.249.250 10.130.0.0/23 huirwang-45-xslwp-master-2 huirwang-45-xslwp-master-2 172.31.249.45 10.129.0.0/23 [172.31.249.205 172.31.249.206] huirwang-45-xslwp-worker-4hhvt huirwang-45-xslwp-worker-4hhvt 172.31.249.182 10.131.0.0/23 [172.31.249.203 172.31.249.204] huirwang-45-xslwp-worker-mssqh huirwang-45-xslwp-worker-mssqh 172.31.249.112 10.128.2.0/23 oc get netnamespace ... test1 13617042 [172.31.249.203 172.31.249.205] test2 14768742 [172.31.249.204 172.31.249.206] ``` If someone now decides to remove the egress IPs associated with test2: it will lead to having both egress nodes huirwang-45-xslwp-worker-4hhvt and huirwang-45-xslwp-master-2 unmonitored, this can be seen in the SDN logs on the worker node hosting the workload pods as: ``` I0310 13:00:47.380770 4220 egressip.go:411] Removing egress IP 172.31.249.206 from node 172.31.249.45 I0310 13:00:47.380828 4220 vxlan_monitor.go:110] Unmonitoring node 172.31.249.45 I0310 13:00:47.380834 4220 egressip.go:411] Removing egress IP 172.31.249.204 from node 172.31.249.182 I0310 13:00:47.380837 4220 vxlan_monitor.go:110] Unmonitoring node 172.31.249.182 ``` This is however incorrect as the egress IPs in namespace test1 still has a reference to those egress nodes. This patch fixes that. Signed-off-by: Alexander Constantinescu <aconstan@redhat.com>
@openshift-cherrypick-robot: Bugzilla bug 1926931 has been cloned as Bugzilla bug 1941993. Retitling PR to link against new bug. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
@openshift-cherrypick-robot: This pull request references Bugzilla bug 1941993, which is valid. The bug has been moved to the POST state. The bug has been updated to refer to the pull request using the external bug tracker. 6 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Bugzilla (huirwang@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/assign @danwinship |
/lgtm |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: danwinship, openshift-cherrypick-robot The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/retest Please review the full test history for this PR and help us cut down flakes. |
/retest Please review the full test history for this PR and help us cut down flakes. |
@openshift-cherrypick-robot: All pull requests linked via external trackers have merged: Bugzilla bug 1941993 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/cherry-pick release-4.6 |
@alexanderConstantinescu: new pull request created: #298 In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
This is an automated cherry-pick of #275
/assign alexanderConstantinescu