New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
OCPBUGS-29397: 4.14 High CPU usage with APB CRD #2118
OCPBUGS-29397: 4.14 High CPU usage with APB CRD #2118
Conversation
Signed-off-by: Jordi Gil <jgil@redhat.com>
/hold |
8e2a60d
to
3c91d65
Compare
…e in status to avoid hitting the KAPI server for any pod status change Signed-off-by: Jordi Gil <jgil@redhat.com>
3c91d65
to
b9eeed6
Compare
…g initialized to avoid the risk of time being initialized and slice not having any element Signed-off-by: jordigilh <jgil@redhat.com>
/retest-required |
/retest-required |
Signed-off-by: jordigilh <jgil@redhat.com>
f6144fa
to
b8d9ebe
Compare
/hold cancel |
@jordigilh: The following test failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
/lgtm |
/label backport-risk-assessed |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
/approve |
/retitle OCPBUGS-29397: 4.14 High CPU usage with APB CRD |
@jordigilh: This pull request references Jira Issue OCPBUGS-29397, which is invalid:
Comment The bug has been updated to refer to the pull request using the external bug tracker. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: jordigilh, npinaeva, trozet The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
/label cherry-pick-approved |
/label qe-approved |
From a perf/scale perspective the fix was successfully validated on the ScaleLab at a 120 node scale. |
/jira refresh |
@jordigilh: This pull request references Jira Issue OCPBUGS-29397, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
/jira refresh |
@jordigilh: This pull request references Jira Issue OCPBUGS-29397, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
/jira refresh |
@jordigilh: This pull request references Jira Issue OCPBUGS-29397, which is invalid:
Comment In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
/jira refresh |
@jordigilh: This pull request references Jira Issue OCPBUGS-29397, which is valid. The bug has been moved to the POST state. 7 validation(s) were run on this bug
No GitHub users were found matching the public email listed for the QA contact in Jira (dwilson@redhat.com), skipping review request. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
cc219eb
into
openshift:release-4.14
@jordigilh: Jira Issue OCPBUGS-29397: All pull requests linked via external trackers have merged: Jira Issue OCPBUGS-29397 has been moved to the MODIFIED state. In response to this:
Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the openshift-eng/jira-lifecycle-plugin repository. |
[ART PR BUILD NOTIFIER] This PR has been included in build ose-ovn-kubernetes-base-container-v4.14.0-202405021608.p0.gcc219eb.assembly.stream.el9 for distgit ovn-kubernetes-base. |
Fix included in accepted release 4.14.0-0.nightly-2024-05-02-211455 |
Fixes an issue when using APB in a cluster with high number of pods where the APB controller would hit the KAPI for each pod event to update the APB status, causing a high cpu usage.
The fix resolves in checking if the last message in the APB status in the informer matches the same message generated for the pod event, and if different or different status (succeeded or failed) then proceed to request the latest copy of the APB CR from the KAPI server and update it.