New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCPBUGS-18652: Do not return error if pod IP cannot be retrieved for deletePeerPod
and perf improvements
#1903
OCPBUGS-18652: Do not return error if pod IP cannot be retrieved for deletePeerPod
and perf improvements
#1903
Conversation
It was seen in perf/scale runs that unmarshaling pod annotation on a lot of remote pod deletes (for address set removal) was taking a lot of CPU. Change the order of getting pod IP to use kapi pod.IPs rather than annotation for default network. Signed-off-by: Tim Rozet <trozet@redhat.com> (cherry picked from commit 6b2088b)
Changes-Include: - Minor improvement when ensuring remote zone pod to not always get the pod IPs - Major improvement to searching for colliding IPs where we can filter pods only specific to the node where the original pod resided. This search is applicable when the pods are not kubevirt migratable and the topology type is Layer 3. Signed-off-by: Tim Rozet <trozet@redhat.com> (cherry picked from commit c9fa5bb)
Changes-Include: - On pod add/update the IP not might be available yet, since another node might be allocating its IP. Suppress the error. - Warning was printing a wrapped error object - On completed pods check, if the pod had no IP, it would cause an error and be retried. If the pod is completed and has no IP, we should not retry or log error. --- Note on BP: Only the third item in the list above is valid for this cherry-pick and even then this code path is very different on 4.12 and does not match the original commit. The goal of the said commit is clear - during the deletion path in netpol related funcs, ensure that we do not return an error if we fail to find selected pod IPs that will never be found. Signed-off-by: Tim Rozet <trozet@redhat.com> (cherry picked from commit d7d9a56)
@martinkennelly: This pull request references Jira Issue OCPBUGS-18652, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/lgtm |
deletePeerPod
and perf improvements
GW mode migration jobs will never pass on 4.12. They always need override. @jluhrsen is working on fixing it. |
/test e2e-gcp-ovn unrelated failure |
/test e2e-vsphere-windows |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/approve
/label backport-risk-assessed
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: martinkennelly, npinaeva, trozet The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/jira refresh |
@martinkennelly: This pull request references Jira Issue OCPBUGS-18652, which is valid. The bug has been moved to the POST state. 6 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (jechen@redhat.com), skipping review request. The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
/label cherry-pick-approved |
/test 4.12-upgrade-from-stable-4.11-e2e-aws-ovn-upgrade |
/test e2e-metal-ipi-ovn-dualstack Unrelated failures |
/test e2e-openstack-ovn Known failure:
|
Gateway mode migration jobs are perma-failing and will be fixed by: https://issues.redhat.com/browse/OCPBUGS-17391 |
/override ci/prow/e2e-aws-ovn-shared-to-local-gateway-mode-migration Unrelated and should be fixed by https://issues.redhat.com/browse/OCPBUGS-17391 |
@dcbw: Overrode contexts on behalf of dcbw: ci/prow/e2e-aws-ovn-local-to-shared-gateway-mode-migration, ci/prow/e2e-aws-ovn-shared-to-local-gateway-mode-migration In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
|
CI retested everything again sigh - no idea why since every required job passed... It didnt seem to respect the overridden GW mode jobs. @martinkennelly: The following tests failed, say /retest to rerun all failed tests or /retest-required to rerun all mandatory failed tests:
|
/tide refresh |
/ocpbugs cc-qa |
/label qe-approved |
/test e2e-aws-ovn |
/test e2e-aws-ovn-upgrade-local-gateway |
/test e2e-aws-ovn-upgrade Telemeter alert fired again... No indication its related to this PR. Retesting to see if its flake. I downloaded MG and tried to find reason but failed to do so. |
@martinkennelly: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
4af0a99
into
openshift:release-4.12
@martinkennelly: Jira Issue OCPBUGS-18652: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-18652 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. |
Fix included in accepted release 4.12.0-0.nightly-2023-09-27-051001 |
Do not return error if pod IP cannot be retrieved for
podPeerPod
and perf improvements for retrieving the pod IPs.