Skip to content

Commit

Permalink
Merge pull request #2748 from harche/bug_2000500
Browse files Browse the repository at this point in the history
[release-4.8] Bug 2000500: bump SystemMemoryExceedsReservation alert threshold to 95%
  • Loading branch information
openshift-merge-robot committed Sep 17, 2021
2 parents ef22729 + 0904a8f commit 214e285
Showing 1 changed file with 2 additions and 2 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -51,9 +51,9 @@ spec:
rules:
- alert: SystemMemoryExceedsReservation
expr: |
sum by (node) (container_memory_rss{id="/system.slice"}) > ((sum by (node) (kube_node_status_capacity{resource="memory"}) - sum by (node) (kube_node_status_capacity{resource="hugepages_1Gi"}) - sum by (node) (kube_node_status_capacity{resource="hugepages_2Mi"}) - sum by (node) (kube_node_status_allocatable{resource="memory"}) - sum by (node) (kube_node_status_allocatable{resource="hugepages_1Gi"}) - sum by (node) (kube_node_status_allocatable{resource="hugepages_2Mi"})) * 0.9)
sum by (node) (container_memory_rss{id="/system.slice"}) > ((sum by (node) (kube_node_status_capacity{resource="memory"}) - sum by (node) (kube_node_status_capacity{resource="hugepages_1Gi"}) - sum by (node) (kube_node_status_capacity{resource="hugepages_2Mi"}) - sum by (node) (kube_node_status_allocatable{resource="memory"}) - sum by (node) (kube_node_status_allocatable{resource="hugepages_1Gi"}) - sum by (node) (kube_node_status_allocatable{resource="hugepages_2Mi"})) * 0.95)
for: 15m
labels:
severity: warning
annotations:
message: "System memory usage of {{ $value | humanize }} on {{ $labels.node }} exceeds 90% of the reservation. Reserved memory ensures system processes can function even when the node is fully allocated and protects against workload out of memory events impacting the proper functioning of the node. The default reservation is expected to be sufficient for most configurations and should be increased (https://docs.openshift.com/container-platform/latest/nodes/nodes/nodes-nodes-managing.html) when running nodes with high numbers of pods (either due to rate of change or at steady state)."
message: "System memory usage of {{ $value | humanize }} on {{ $labels.node }} exceeds 95% of the reservation. Reserved memory ensures system processes can function even when the node is fully allocated and protects against workload out of memory events impacting the proper functioning of the node. The default reservation is expected to be sufficient for most configurations and should be increased (https://docs.openshift.com/container-platform/latest/nodes/nodes/nodes-nodes-managing.html) when running nodes with high numbers of pods (either due to rate of change or at steady state)."

0 comments on commit 214e285

Please sign in to comment.