This repository has been archived by the owner on Nov 16, 2022. It is now read-only.
Lower alert severity for memory based alerts #168
Merged
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
JIRA ID
https://issues.redhat.com/browse/INTLY-6614
Additional Information
Some discussion in https://issues.redhat.com/browse/INTLY-6614, the gist of which is a review of the critical alerts and if they should be critical (i.e. they page someone when they fire).
My thinking is that only problems that affect a potential service level objective (SLO) should be critical.
In this case, its:
KeycloakInstanceNotAvailable
if the service isn't availableKeycloakAPIRequestDuration90PercThresholdExceeded
&KeycloakAPIRequestDuration99.5PercThresholdExceeded
if the service isn't responding as fast as expectedAll other alerts would help with debugging & point at potential causes as to why one of the critical alerts are firing, but as long as the SLO is being met, no-one needs to be paged.
Verification Steps
Checklist:
Additional Notes