New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
test/e2e/upgrade/alert: Allow AggregatedAPIDown to unblock 4.7->4.8 CI #26220
test/e2e/upgrade/alert: Allow AggregatedAPIDown to unblock 4.7->4.8 CI #26220
Conversation
[APPROVALNOTIFIER] This PR is NOT APPROVED This pull-request has been approved by: wking The full list of commands accepted by this bot can be found here.
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
test/e2e/upgrade/alert/alert.go
Outdated
@@ -143,7 +143,7 @@ func (t *UpgradeTest) Test(f *framework.Framework, done <-chan struct{}, upgrade | |||
// Invariant: No non-info level alerts should have fired during the upgrade | |||
firingAlertQuery := fmt.Sprintf(` | |||
sort_desc( | |||
count_over_time(ALERTS{alertstate="firing",severity!="info",alertname!~"Watchdog|AlertmanagerReceiversNotConfigured"}[%[1]s:1s]) | |||
count_over_time(ALERTS{alertstate="firing",severity!="info",alertname!~"Watchdog|AlertmanagerReceiversNotConfigured|AggregatedAPIDown"}[%[1]s:1s]) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please make this a known violation rather than an exclusion, and add more label selectors
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
a965a20
to
ea88b65
Compare
We're getting: alert AggregatedAPIDown fired for 210 seconds with labels: {name="v1beta1.metrics.k8s.io", namespace="default", severity="warning"} and such pretty consistently in those jobs. Tracked in [1]. Until that gets fixed, ignore the alert, so we are more likely to notice other breakage. [1]: https://bugzilla.redhat.com/show_bug.cgi?id=1970624
ea88b65
to
c26cfb5
Compare
@wking: The following tests failed, say
Full PR test history. Your PR dashboard. Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes/test-infra repository. I understand the commands that are listed here. |
We're getting:
and such pretty consistently in those jobs. Tracked in rhbz#1970624. Until that gets fixed, ignore the alert, so we are more likely to notice other breakage.