[Discuss][Actions] Should actions remain "at most once"? #102888

gmmorris · 2021-06-22T10:50:15Z

When the actions plugin was introduced we chose to go with an "at most once" approach, meaning we wanted to ensure actions don't accidentally fire more than once.

We achieved this by setting action tasks to maxAttempts to 1, which means actions are never retried.
The thinking at the time was that running an action twice was worse than not running it at all, as we had a history of failed tasks in the .kibana_task_manager index and we had concrete plans of building an Event Log UI before GA.

Best laid plans of 🐭 and 👷‍♀️ and all , we're long past GA without an Event Log UI and following this PR these failed tasks are cleaned up every 5m minutes.

This means that even though actions are still "at most once", we no longer have the mitigation of users being able to investigate failed tasks. Failed actions are still logged in the Event Log, so it's not that there is no record, but we have no concrete plans for an Event Log UI and that tasks themselves are deleted so it isn't easy to reproduce the failure case.

This feels like a risk to me, and I think we should either reconsider the "at most once" guidance, or we should consider some kind of record of the failed tasks that can't cause migration failures further down the line.

Discuss :)

cc @elastic/kibana-alerting-services @stacey-gammon @kobelb

The text was updated successfully, but these errors were encountered:

mikecote · 2021-07-28T16:08:21Z

cc @arisonl for product awareness on what we should do from a product perspective.

kobelb · 2021-08-09T15:07:59Z

I'm largely putting on a product hat here, but I think we should allow our users to choose between "at most once" and "at least once" delivery for actions on a per-action basis. Depending on the situation, I anticipate users wanting the ability to customize how this works for each different alerting rule.

gmmorris · 2021-10-06T12:03:02Z

This relates to the long running actions issue as well: #113424

mikecote · 2022-10-11T11:02:09Z

Closing in favour of #143046

gmmorris added discuss Feature:Alerting Feature:Actions Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) labels Jun 22, 2021

mikecote added this to Backlog in Kibana Alerting Jul 28, 2021

gmmorris added the loe:needs-research This issue requires some research before it can be worked on or estimated label Aug 5, 2021

gmmorris mentioned this issue Aug 11, 2021

[Actions] Add support for the retry logic on the failed action task execution. #106771

Open

gmmorris added resilience Issues related to Platform resilience in terms of scale, performance & backwards compatibility estimate:needs-research Estimated as too large and requires research to break down into workable issues labels Aug 16, 2021

gmmorris removed the loe:needs-research This issue requires some research before it can be worked on or estimated label Sep 2, 2021

gmmorris added the impact:high Addressing this issue will have a high level of impact on the quality/strength of our product. label Sep 16, 2021

XavierM removed this from Backlog in Kibana Alerting Jan 6, 2022

kobelb added the needs-team Issues missing a team label label Jan 31, 2022

botelastic bot removed the needs-team Issues missing a team label label Jan 31, 2022

mikecote mentioned this issue Oct 11, 2022

Make actions retry when encountering failures #143046

Closed

mikecote closed this as completed Oct 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Discuss][Actions] Should actions remain "at most once"? #102888

[Discuss][Actions] Should actions remain "at most once"? #102888

gmmorris commented Jun 22, 2021 •

edited

Loading

mikecote commented Jul 28, 2021

kobelb commented Aug 9, 2021

gmmorris commented Oct 6, 2021

mikecote commented Oct 11, 2022

[Discuss][Actions] Should actions remain "at most once"? #102888

[Discuss][Actions] Should actions remain "at most once"? #102888

Comments

gmmorris commented Jun 22, 2021 • edited Loading

mikecote commented Jul 28, 2021

kobelb commented Aug 9, 2021

gmmorris commented Oct 6, 2021

mikecote commented Oct 11, 2022

gmmorris commented Jun 22, 2021 •

edited

Loading