Skip to content

[observability] Introduce "ReplicaUnavailable" alerts #20344

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 2 commits into from
Nov 7, 2024
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Original file line number Diff line number Diff line change
Expand Up @@ -47,7 +47,18 @@ spec:
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: Desired number of replicas for image-builder-mk3 are not available in cluster {{ $labels.cluster }}
description: The mismatch is {{ printf "%.2f" $value }}
description: 'Desired number of replicas for image-builder-mk3 are not available in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }} are missing'
expr: |
kube_deployment_spec_replicas{deployment="image-builder-mk3", cluster!~"ephemeral.*"} != kube_deployment_status_replicas_available{deployment="image-builder-mk3", cluster!~"ephemeral.*"}
for: 3m
- alert: GitpodImageBuilderMk3ReplicaUnavailable
labels:
# TODO(gpl): warning for now, to set it up and fine-tune it
severity: warning
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: image-builder-mk3 replicas are unavailable in cluster {{ $labels.cluster }}
description: 'image-builder-mk3 pods are unavailable in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }}'
expr: |
kube_deployment_status_replicas_unavailable{deployment="image-builder-mk3", cluster!~"ephemeral.*"} > 0
for: 10m
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,18 @@ spec:
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: Desired number of replicas for node-labeler are not available in cluster {{ $labels.cluster }}
description: The mismatch is {{ printf "%.2f" $value }}
description: 'Desired number of replicas for node-labeler are not available in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }} are missing'
expr: |
kube_deployment_spec_replicas{deployment="node-labeler", cluster!~"ephemeral.*"} != kube_deployment_status_replicas_available{deployment="node-labeler", cluster!~"ephemeral.*"}
for: 3m
- alert: GitpodNodeLabelerReplicaUnavailable
labels:
# TODO(gpl): warning for now, to set it up and fine-tune it
severity: warning
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: node-labeler replicas are unavailable in cluster {{ $labels.cluster }}
description: 'node-labeler pods are unavailable in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }}'
expr: |
kube_deployment_status_replicas_unavailable{deployment="node-labeler", cluster!~"ephemeral.*"} > 0
for: 10m
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,18 @@ spec:
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: Desired number of replicas for ws-manager-mk2 are not available in cluster {{ $labels.cluster }}
description: The mismatch is {{ printf "%.2f" $value }}
description: 'Desired number of replicas for ws-manager-mk2 are not available in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }} are missing'
expr: |
kube_deployment_spec_replicas{deployment="ws-manager-mk2", cluster!~"ephemeral.*"} != kube_deployment_status_replicas_available{deployment="ws-manager-mk2", cluster!~"ephemeral.*"}
for: 3m
- alert: GitpodWsManagerMk2ReplicaUnavailable
labels:
# TODO(gpl): warning for now, to set it up and fine-tune it
severity: warning
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: ws-manager-mk2 replicas are unavailable in cluster {{ $labels.cluster }}
description: 'ws-manager-mk2 pods are unavailable in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }}'
expr: |
kube_deployment_status_replicas_unavailable{deployment="ws-manager-mk2", cluster!~"ephemeral.*"} > 0
for: 10m
Original file line number Diff line number Diff line change
Expand Up @@ -31,7 +31,18 @@ spec:
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: Desired number of replicas for ws-proxy are not available in cluster {{ $labels.cluster }}
description: The mismatch is {{ printf "%.2f" $value }}
description: 'Desired number of replicas for ws-proxy are not available in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }} are missing'
expr: |
kube_deployment_spec_replicas{deployment="ws-proxy", cluster!~"ephemeral.*"} != kube_deployment_status_replicas_available{deployment="ws-proxy", cluster!~"ephemeral.*"}
for: 3m
- alert: GitpodWsProxyMk2ReplicaUnavailable
labels:
# TODO(gpl): warning for now, to set it up and fine-tune it
severity: warning
annotations:
runbook_url: https://github.com/gitpod-io/runbooks/blob/main/runbooks/GitpodWorkspaceDeploymentReplicaMismatch.md
summary: ws-proxy replicas are unavailable in cluster {{ $labels.cluster }}
description: 'ws-proxy pods are unavailable in cluster {{ $labels.cluster }}: {{ printf "%.2f" $value }}'
expr: |
kube_deployment_status_replicas_unavailable{deployment="ws-proxy", cluster!~"ephemeral.*"} > 0
for: 10m
Loading