Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v10.1.x] Alerting: Attempt to retry retryable errors #79211

Merged
merged 1 commit into from Dec 7, 2023

Conversation

gotjosh
Copy link
Contributor

@gotjosh gotjosh commented Dec 7, 2023

Backport c631261 from #79161


Retrying has been broken for a good while now (at least since version 9.4) - this change attempts to re-introduce them in their simplest and safest form possible.

I first introduced #79095 to make sure we don't disrupt or put additional load on our customer's data sources with this change in a patch release. Paired with this change, retries can now work as expected.

There's two small differences between how retries work now and how they used to work in legacy alerting.

Retries only occur for valid alert definitions - if we suspect that that error comes from a malformed alert definition, we skip retrying. We have added a constant backoff of 1s in between retries.

Once this is merged, I'll make sure to update:

With what we should do next.

* Alerting: Attempt to retry retryable errors

Retrying has been broken for a good while now (at least since version 9.4) - this change attempts to re-introduce them in their simplest and safest form possible.

I first introduced #79095 to make sure we don't disrupt or put additional load on our customer's data sources with this change in a patch release. Paired with this change, retries can now work as expected.

There's two small differences between how retries work now and how they used to work in legacy alerting.

Retries only occur for valid alert definitions - if we suspect that that error comes from a malformed alert definition we skip retrying.
We have added a constant backoff of 1s in between retries.

---------

Signed-off-by: gotjosh <josue.abreu@gmail.com>
(cherry picked from commit c631261)
@gotjosh gotjosh added area/backend add to changelog backport A backport PR product-approved Pull requests that are approved by product/managers and are allowed to be backported labels Dec 7, 2023
@gotjosh gotjosh added this to the 10.1.x milestone Dec 7, 2023
@gotjosh gotjosh requested a review from a team as a code owner December 7, 2023 11:27
@gotjosh gotjosh requested review from rwwiv, JacobsonMT, yuri-tceretian and grobinson-grafana and removed request for a team December 7, 2023 11:27
@gotjosh gotjosh merged commit 8f3cddb into v10.1.x Dec 7, 2023
18 checks passed
@gotjosh gotjosh deleted the backport-79161-to-v10.1.x branch December 7, 2023 13:32
@aangelisc aangelisc modified the milestones: 10.1.x, 10.1.6 Dec 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
add to changelog area/backend backport A backport PR product-approved Pull requests that are approved by product/managers and are allowed to be backported
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants