Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Closes #43
Added Prometheus example alert rules:
up
),Alerts are separated into three groups.
common
is non-Tarantool alert on process work (Prometheusup
).tarantool-common
are general Tarantool alerts (Lua memory and slab_ratio) that can be applied to any Tarantool application.tarantool-business
is a list of references on how you can monitor your business logic. One can base its alert rules on what's described there (because it's impossible to say if it is OK for your app to have 1000 RPS of 4xx errors or 0 requests on a router for an hour or not without knowing your app business logic beforehand). That's also the reason why I fixedjob='example_project'
in alltarantool-business
alert rules while leavingcommon
andtarantool-common
alert rules process all possible Tarantool instances (if you have two different apps, they are likely to have different HTTP load and business logic, while 2 Gb Lua threshold is true for both of them).Test Prometheus example alert rules with promtool.
The next step should be adding some documentation based on this example (here or in tarantool/metrics), but I think it should be a different PR.