Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dynamic alerting rules for refactoring #10

Open
rosogon opened this issue Sep 25, 2020 · 3 comments
Open

Dynamic alerting rules for refactoring #10

rosogon opened this issue Sep 25, 2020 · 3 comments
Assignees
Labels
runtime SODALITE Runtime Layer

Comments

@rosogon
Copy link

rosogon commented Sep 25, 2020

The M18 Rule-based refactorer used static rules to alert on high|load cpu usage. These should be dynamic in the sense that a new application should add its own rules. According to https://prometheus.io/docs/prometheus/latest/configuration/configuration/,
rule_files is a glob of files, so this could be addresed as:
0. rule_files: - /etc/prometheus/rules_*

  1. the deployment of an application installs the rule files as /etc/prometheus/rules_
  2. prometheus server is restarted

Still, the generation of the rule files from the application SLA is needed, but to be addressed in other ticket.

@rosogon rosogon added the runtime SODALITE Runtime Layer label Sep 25, 2020
@rosogon
Copy link
Author

rosogon commented Oct 28, 2020

This relates to the problems raised by a multi-tenant Prometheus. https://github.com/cherti/PromAuthProxy is a project that could help on that.

The other alternative is to modify the approach and use one Prometheus per deployment.

@pmundt
Copy link

pmundt commented Dec 7, 2020

I've tested this under Kubernetes for the Edge cases as well, and have had success with placing the prometheus config in a Kubernetes configmap and injecting a monitoring sidecar that dispatches a POST to the prometheus server config reload endpoint whenever the configuration changes. The process is roughly described here: https://www.weave.works/blog/prometheus-configmaps-continuous-deployment/

@jramosrivas
Copy link
Contributor

At the moment the ruleserver offers a REST API to add alerts and remove them from the Prometheus server, there is a description of its functioning in the readme.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
runtime SODALITE Runtime Layer
Projects
None yet
Development

No branches or pull requests

6 participants