Skip to content

Fix inhibit rule silencing all warning alerts#178

Merged
scotwells merged 1 commit intomainfrom
fix/remove-blackhole-alertmanager-route
Mar 31, 2026
Merged

Fix inhibit rule silencing all warning alerts#178
scotwells merged 1 commit intomainfrom
fix/remove-blackhole-alertmanager-route

Conversation

@scotwells
Copy link
Copy Markdown
Contributor

@scotwells scotwells commented Mar 31, 2026

Summary

A severity-based inhibit rule intended for activity system alerts was accidentally suppressing all warning alerts across the entire cluster. Any critical alert (like PolicyBindingsNotReady or CertManagerCertNotReady) was silencing every warning alert — including unrelated ones like KubePodCrashLooping and KubeJobFailed.

The fix scopes the rule so it only applies to alerts that have a component label, which limits it to activity system alerts as originally intended.

Test plan

  • Verify warning alerts start appearing in Slack again
  • Verify activity-specific severity inhibition still works for alerts with matching component labels

Relates to datum-cloud/infra#277

The severity-based inhibit rule (critical suppresses warning) used
equal: [component] but did not require the component label to exist.
When both source and target lack the label, alertmanager treats them
as matching, causing any critical alert to suppress all warnings
across the entire cluster.

Add component=~".+" matchers to both source and target so the rule
only applies to alerts that actually have a component label.
@scotwells scotwells changed the title Fix severity inhibit rule suppressing all warnings globally Fix inhibit rule silencing all warning alerts Mar 31, 2026
Copy link
Copy Markdown
Contributor

@ecv ecv left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

whoops

@scotwells scotwells merged commit 8acad55 into main Mar 31, 2026
7 checks passed
@scotwells scotwells deleted the fix/remove-blackhole-alertmanager-route branch March 31, 2026 13:52
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants