Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ROX-16887: always extend SLO calculation over entire 28 days #87

Merged
merged 3 commits into from
May 5, 2023

Conversation

stehessel
Copy link
Contributor

@stehessel stehessel commented May 4, 2023

Extended average over time refers to the time series effectively being extended over the entire time interval. This is in contrast to avg_over_time, which only averages over time intervals where the time series is not nil. This is important during the initial 28 days of Central instances. For example, consider a Central instance that lived for 5 minutes and was down for 2 minutes. Using avg_over_time, the availability would be 3 min / 5 min = 60%. The extended average over 28 days would yield 1 - 2 min / 28 days = 99.995%. After the initial 28 days, both averages are equivalent, because data points will exist for the entire 28d range.

Also make sure that alerts only fire as long as the Central instance still exists (central:sli:availability >= 0 condition).

Extended average over time refers to the time series effectively being extended
over the entire time interval. This is in contrast to avg_over_time, which
only averages over time intervals where the time series is not nil.
This is important during the initial 28 days of Central instances. For example,
consider a Central instance that lived for 5 minutes and was down for 2 minutes.
Using avg_over_time, the availability would be 3 min / 5 min = 60%. The
extended average over 28 days would yield 1 - 2 min / 28 days ~ 99.995%.
After the initial 28 days, both averages are equivalent.
@stehessel stehessel marked this pull request as ready for review May 4, 2023 18:20
@stehessel stehessel requested a review from a team as a code owner May 4, 2023 18:20
@stehessel stehessel requested a review from 0x656b694d May 4, 2023 18:33
@stehessel stehessel merged commit ae9c16b into master May 5, 2023
1 check passed
@stehessel stehessel deleted the ROX-16887/extend-slo-interval branch May 5, 2023 13:22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants