Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question: Handling DeadmanSwitch in Karma #2512

Closed
felskrone opened this issue Dec 4, 2020 · 5 comments · Fixed by #2546
Closed

Question: Handling DeadmanSwitch in Karma #2512

felskrone opened this issue Dec 4, 2020 · 5 comments · Fixed by #2546
Assignees
Labels
enhancement New feature or request

Comments

@felskrone
Copy link

I monitor several services with prometheus and set up a DeadmanSwitch to ensure my alerting pipeline is working.

The DeadmanSwitch is an active alert that is always firing:

      #
      # Constant testing of the alerting pipeline
      #
      - alert: DeadMansSwitch
        expr: vector(1)
        labels:
          severity: clear
          team: MyTeam
          hive: DeadManSwitch
        annotations:
          description: vector(1)
          title: "DeadManSwitch ServerA"

Since its an active alarm it always shows up in Karma. But what im actually interested in is, to see if the alert is missing.

Is this possible with karma? I could not find anything in the documentation that occured to me as an approach to this.

I would not expect anything to be there as its a dashboard for showing alerts, but i figured asking wouldnt hurt :)

@prymitive
Copy link
Owner

I'm not sure what you expect karma to do here. Did you consider using something like https://github.com/gouthamve/deadman to generate alerts when your DeadMansSwitch is now being generated?

@felskrone
Copy link
Author

I would like to have a DeadmanSnitch without an external service like the one you linked to.

So basically have the possibility to "invert" an alert and report if it is missing instead of showing in the dashboard.

Just like the absent() function in prometheus for testing if a metric is missing.

@prymitive
Copy link
Owner

prymitive commented Dec 12, 2020

I think it’s worth adding a health check feature like that. Will look into this, thanks.

@prymitive prymitive self-assigned this Dec 12, 2020
@prymitive prymitive added the enhancement New feature or request label Dec 12, 2020
@prymitive
Copy link
Owner

Raised #2546 that adds new config options for it

@felskrone
Copy link
Author

Cool, thx!

@github-actions github-actions bot locked as resolved and limited conversation to collaborators Dec 30, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants