New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Alerting: rule evaluation needs better retry semanatics #49621
Comments
During the meeting, we decided that we need to distinguish between retriable errors and non-retriable ones. |
We're looking for this kind of feature, there are any hints to help anyone who would starting implementing this? I'll like to check and try something |
Then in
|
Has there been any movement on this issue? |
This has not been prioritized so far but we are moving in this direction slowly: there have been some improvements in |
Awesome thanks for the update
…On Tue, Aug 29, 2023 at 2:56 PM Yuri Tseretyan ***@***.***> wrote:
This has not been prioritized so far but we are moving in this direction
slowly: there have been some improvements in expr package recently, which
may help alerting solve the main problem of distinguishing repeatable and
non-repeatable errors.
—
Reply to this email directly, view it on GitHub
<#49621 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AUE5A2VILTJP2JPFFICHQQ3XXXYH3ANCNFSM5W5LXW7A>
.
You are receiving this because you commented.Message ID:
***@***.***>
--
[image: photo]
*Andy Murray*
Manager, Talent Acquisition R&D Cloud
***@***.*** ***@***.***>
www.grafana.co
<https://sales.grafana.com/api/mailings/click/PMRHK4TMEI5CE2DUORYDULZPO53XOLTHOJQWMYLOMEXGG33NF4RCYITJMQRDUMJQGUYTSMJMEJXXEZZCHIRGEMDFMZQTGOJTFU4TEMJXFU2GGNLCFU4TQM3GFUZWMOBUMVSDSNJQGUZDQIRMEJ3GK4TTNFXW4IR2EI2CELBCONUWOIR2EJIXUUL2PJBWK5LNPF4TM5KYJ4YEU4T2GQ4EW33SOB3XQ5ZQJB3VGT2OJBCFMLLQFVEDKVJ5EJ6Q====>
m
Series C funding announcement - $220M round with a $3B valuation
<https://www.bloomberg.com/news/articles/2021-08-24/grafana-labs-raises-220-million-round-at-3-billion-valuation>
The 7 Cultural Values that Drive Grafana Labs
<https://grafana.com/blog/2020/12/09/the-7-cultural-values-that-drive-grafana-labs/>
|
What happened:
Sometimes there can be intermittent errors when a rule is evaluated (network, service-related). Although Grafana can notify users about errors via dedicated channels, it does not retry evaluations. It can be useful, especially for rules with a long evaluation interval.
Also, we need to make sure that retries do not affect rule evaluation. In other words, the ticks from the scheduler should be processed as soon as possible. As a dumb solution, we can retry for N times unless the total evaluation duration exceeds half of the interval.
Tasks
Grafana: 10.2.3
The text was updated successfully, but these errors were encountered: