WIP - Alert filing by airimovici · Pull Request #7076 · mozilla/treeherder

airimovici · 2021-03-30T08:21:12Z

@gmierz I created this PR to easily compare the changes and leave some notes

ionutgoldan · 2021-03-30T13:09:18Z

treeherder/perf/auto_perf_sheriffing/perf_sheriff_bot.py

+    def _handle_backfill_alerts(self):
+        # Get backfilled alerts
+        bugzilla = BugzillaHelper()
+        backfilled_records = BackfillRecord.objects.select_related(


I think we can extract this as a method of BackfillRecord.

ionutgoldan · 2021-03-30T13:12:29Z

treeherder/perf/auto_perf_sheriffing/alert_manager.py

+        return
+
+    @staticmethod
+    def recompute_backfill_alert(record: BackfillRecord) -> Push:


The type hint for the output isn't correct.

ionutgoldan · 2021-03-30T13:39:35Z

treeherder/perf/auto_perf_sheriffing/alert_manager.py

+            else:
+                logger.info(f"Change not found here {prev} vs. {cur}")
+
+        if not first_changed:


I'm not sure what to say about this. So far we've only marked invalid alerts manually.
Trying to automate this can be risky & end up hiding real regressions.

Agree. I wouldn't go that far.

ionutgoldan · 2021-03-30T13:50:51Z

treeherder/perf/auto_perf_sheriffing/outcome_checker.py

        # TODO: get job_type from record when soft launch lands ---> job_type = record.job_type
        job_type = get_job_type(record)
-        from_time, to_time = self._get_push_timestamp_range(record.get_context())
+        from_time, to_time = Helper.get_push_timestamp_range(record.get_context())


There's no need to refactor OutcomeChecker, as we already covered these exact methods in PR 7070.

airimovici · 2021-03-30T13:52:35Z

treeherder/perf/auto_perf_sheriffing/perf_sheriff_bot.py

+
+            # Check if this push has an alert summary already. If it does, reassign
+            # the alerts to it, delete this summary.
+            existing_summary = PerformanceAlertSummary.objects.filter(push=cur_push)


PerformanceAlertSummary.objects.filter(push=cur_push).first() gets the first one or None if there is no result after filtering

alexandru-io · 2021-04-02T08:40:33Z

The idea of this bot is awesome, but is pretty tricky and complex to implement. I am saying this from the experience of developing the automated bug filling from the alerts view.
I would suggest as small sprints of development&testing as possible. This bot should take care of the code limitations that are currently present (and they are not few) and the current sheriffing workflow. Even if this is automatic, its purpose is to automate what the sheriffs do, thus it should respect this workflow.
Another thing I would suggest is to use some sort of defensive approach: if the bot picks up an alerts and looses itself in scenarios, it should be able to abandon it and leave the "manual sheriffs" to take care of it further. Perhaps if the backfill was finished but there's too much noise, it shouldn't risk opening a regression bug with low confidence, but to leave a note like "backfilled with depth x and y retriggers but there's too much noise to confidently identify the culprit". Even if it doesn't file the bug, there would already be a big win for sheriffing.
A last thought is to release this bot gradually. Build metrics and awsy alerts are generally the most clean. Some build metrics test are even running at every revision, so would be the best production scenario to test the automated regression filling. Awsy are very suitable to trigger the backfill and then easily find the culprit. But if the culprit revision contains patches from several bugs in different products&components, this should be abandoned to the care of the sheriff in the first release(s) of the bot. Some talos tests are clean and reasonably easy to sheriff, while other are pretty noisy. then, raptor and browsertime tests are the most noisy, I would release the bot to sheriff them only when we have steady results in sheriffing the cleaner tests like AWSY and build metrics.

jmaher · 2022-07-18T17:40:01Z

is there planned work on this PR? I am trying to close out old PRs or ensure they have a valid project and will be worked on in the near future.

gmierz · 2022-07-20T11:16:18Z

@jmaher you can close it. We can find it in the closed PRs if we need it again.

gmierz added 6 commits March 26, 2021 22:03

Alert Filing Prototype

a710df4

Add a helper class to hold general functions.

64b72f2

Add a class for communicating with Bugzilla.

650b4fb

Add a class for alert manipulation and management.

0ff16fe

Move helper-specific tests to test_helper.

dd3a84e

Restructure alert filing and bug handling in the bot.

c167451

ionutgoldan reviewed Mar 30, 2021

View reviewed changes

airimovici commented Mar 30, 2021

View reviewed changes

jmaher closed this Jul 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

WIP - Alert filing#7076

WIP - Alert filing#7076
airimovici wants to merge 6 commits intomozilla:masterfrom
gmierz:alert_filing

airimovici commented Mar 30, 2021

Uh oh!

ionutgoldan Mar 30, 2021

Uh oh!

ionutgoldan Mar 30, 2021

Uh oh!

ionutgoldan Mar 30, 2021

Uh oh!

alexandru-io Apr 2, 2021

Uh oh!

ionutgoldan Mar 30, 2021

Uh oh!

airimovici Mar 30, 2021

Uh oh!

alexandru-io commented Apr 2, 2021 •

edited

Loading

Uh oh!

jmaher commented Jul 18, 2022

Uh oh!

gmierz commented Jul 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Comments

Conversation

airimovici commented Mar 30, 2021

Uh oh!

ionutgoldan Mar 30, 2021

Choose a reason for hiding this comment

Uh oh!

ionutgoldan Mar 30, 2021

Choose a reason for hiding this comment

Uh oh!

ionutgoldan Mar 30, 2021

Choose a reason for hiding this comment

Uh oh!

alexandru-io Apr 2, 2021

Choose a reason for hiding this comment

Uh oh!

ionutgoldan Mar 30, 2021

Choose a reason for hiding this comment

Uh oh!

airimovici Mar 30, 2021

Choose a reason for hiding this comment

Uh oh!

alexandru-io commented Apr 2, 2021 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jmaher commented Jul 18, 2022

Uh oh!

gmierz commented Jul 20, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

alexandru-io commented Apr 2, 2021 •

edited

Loading