-
Notifications
You must be signed in to change notification settings - Fork 8.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Monitoring] Fix a couple of issues with the cpu usage alert #80737
[Monitoring] Fix a couple of issues with the cpu usage alert #80737
Conversation
Pinging @elastic/stack-monitoring (Team:Monitoring) |
@elasticmachine merge upstream |
1 similar comment
@elasticmachine merge upstream |
} else { | ||
cpuUsage = stat.cpuUsage; | ||
stat.cpuUsage = stat.cpuUsage; | ||
} |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't think you need this else clause here
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated
16b6225
to
5dc3d18
Compare
💚 Build SucceededMetrics [docs]async chunks size
page load bundle size
History
To update your PR or re-run it, just comment with: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good 👍
…#80737) * Fix a couple of issues with the cpu usage alert * Fix tests * PR feedback # Conflicts: # x-pack/plugins/monitoring/public/components/elasticsearch/node/advanced.js
…#80737) * Fix a couple of issues with the cpu usage alert * Fix tests * PR feedback # Conflicts: # x-pack/plugins/monitoring/public/components/elasticsearch/node/advanced.js
* master: (37 commits) [ILM] Migrate Warm phase to Form Lib (elastic#81323) [Security Solutions][Detection Engine] Fixes critical bug with error reporting that was doing a throw (elastic#81549) [Detection Rules] Add 7.10 rules (elastic#81676) [kbn/optimizer] ignore missing metrics when updating limits with --focus (elastic#81696) [SECURITY SOLUTIONS] Bugs overview page + investigate eql in timeline (elastic#81550) [Maps] fix unable to edit cluster vector styles styled by count when switching to super fine grid resolution (elastic#81525) Fixed migration issue for case specific actions, by extending email action migrator checks (elastic#81673) [CI] Preparation for APM tracking on CI (elastic#80399) [Home] Fixes Kibana app description order on home page and updates Canvas copy (elastic#80057) Make sure `to` is 'now' and not the same as `from` (elastic#81524) Nitpicking the 8.0 Breaking Change issue template (elastic#81678) [SECURITY_SOLUTION] Fix text on onboarding screen (elastic#81672) [data.search] Skip async search tests in build candidates and production builds (elastic#81547) Fix previousStartedAt by not changing when execution fails (elastic#81388) [Monitoring] Fix a couple of issues with the cpu usage alert (elastic#80737) Telemetry collection xpack to ts project references (elastic#81269) Elasticsearch: don't use url authentication for new client (elastic#81564) [App Search] Credentials: implement working flyout form (elastic#81541) Properly encode links to edit user page (elastic#81562) [Alerting UI] Don't wait for health check before showing Create Alert flyout (elastic#80996) ...
Resolves #80689
Resolves #80684
@ravikesarwani found a couple issues with the CPU alert and this PR aims to address both of them.
First, we discovered that the
Node -> Advanced
page wasn't properly filtering out alert instances for other nodes.Second, the way we calculate container CPU utilization didn't match what we are doing for the CPU utilization chart, which is actually more accurate. A couple of the data points are running counters (usage and periods cgroup data) and need to be treated as such, by using a derivative to properly use them. In order to use a derivative, we need to use a
date_histogram
even though we have no need for multiple buckets, we just need them to properly calculate the derivative so we are using afixed_interval
set to the same time range as theduration
parameter of the alert (which is also used in the same query in arange
filter)