Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Task manager] Evaluate the impact of Ephemeral Tasks #102616

Open
gmmorris opened this issue Jun 18, 2021 · 1 comment
Open

[Task manager] Evaluate the impact of Ephemeral Tasks #102616

gmmorris opened this issue Jun 18, 2021 · 1 comment
Labels
estimate:needs-research Estimated as too large and requires research to break down into workable issues Feature:Task Manager impact:medium Addressing this issue will have a medium level of impact on the quality/strength of our product. resilience Issues related to Platform resilience in terms of scale, performance & backwards compatibility Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)

Comments

@gmmorris
Copy link
Contributor

The Ephemeral Task feature was released in #90888 (PR: #97311) under a feature flag and is disabled by default.

In order to open this feature up we need to verify its performance and sustainability at scale.

Acceptance Criteria

  1. Before this is tested, we need to define a concrete experiment which we should write up as an RFC and share with tech leads for review. Such an experiment needs a clear hypothesis, a reproducible test plan and defined metrics for success.
  2. Once an experiment is defined, we should run it with both the feature enabled, and disabled, for comparison.
  3. After the experiment the results will be shared with the team, so a decision can be made in regards to enabling the feature by default.

Possible Outcomes

The way I see it (open to debate, obviously) we have three possible outcomes:

  1. We decide the feature is scalable, sustainable and valuable - enabling it by default.
  2. We decide the feature is scalable and sustainable, but not valuable enough - we delete it (yay, less code to maintain, and we've learned something new)
  3. We decide the feature is valuable, but isn't scalable or sustainable enough (due to added complexity, for instance) - further discussion is needed as to whether the value is sufficient to warrant further technical work that might be needed to address the lack of scalability/sustainability.
@gmmorris gmmorris added Feature:Task Manager Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams) labels Jun 18, 2021
@elasticmachine
Copy link
Contributor

Pinging @elastic/kibana-alerting-services (Team:Alerting Services)

@gmmorris gmmorris added Project:AlertingNotifyEfficiently Alerting team project for reducing the noise created by the alerting framework. and removed Project:AlertingNotifyEfficiently Alerting team project for reducing the noise created by the alerting framework. labels Jun 30, 2021
@gmmorris gmmorris added the resilience Issues related to Platform resilience in terms of scale, performance & backwards compatibility label Jul 15, 2021
@mikecote mikecote added this to Backlog in Kibana Alerting Jul 21, 2021
@gmmorris gmmorris added the loe:needs-research This issue requires some research before it can be worked on or estimated label Aug 11, 2021
@gmmorris gmmorris added the estimate:needs-research Estimated as too large and requires research to break down into workable issues label Aug 18, 2021
@gmmorris gmmorris removed the loe:needs-research This issue requires some research before it can be worked on or estimated label Sep 2, 2021
@gmmorris gmmorris added the impact:medium Addressing this issue will have a medium level of impact on the quality/strength of our product. label Sep 16, 2021
@XavierM XavierM removed this from Backlog in Kibana Alerting Jan 6, 2022
@kobelb kobelb added the needs-team Issues missing a team label label Jan 31, 2022
@botelastic botelastic bot removed the needs-team Issues missing a team label label Jan 31, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
estimate:needs-research Estimated as too large and requires research to break down into workable issues Feature:Task Manager impact:medium Addressing this issue will have a medium level of impact on the quality/strength of our product. resilience Issues related to Platform resilience in terms of scale, performance & backwards compatibility Team:ResponseOps Label for the ResponseOps team (formerly the Cases and Alerting teams)
Projects
Development

No branches or pull requests

3 participants