Approve jobs if at least older jobs passed #168

michaelgrifalconi · 2024-02-21T09:29:06Z

https://progress.opensuse.org/issues/97118

if aggregate update failed, do not give up immediately
look at openQA previous jobs, if present, green, not too old and including the update under test: ignore that failure

This is to reduce the impact of one test being broken one day, a different test another day and the update not being approved even if combined result give all green, just not at the same time.

I did not touch the tests yet. Before investing more I would like to discuss about the new logic and it's implementation.

okurz

Neat idea! I wonder if this can actually work because so far the approver workflow does AFAIK never contact openQA directly.

openqabot/approver.py

michaelgrifalconi · 2024-02-21T11:22:45Z

Neat idea! I wonder if this can actually work because so far the approver workflow does AFAIK never contact openQA directly.

I can assure it works, I ran a lot of dry-runs on my machine with dashboard and openQA live data!
I stole some openQA calls that the bot do when syncing jobs and created some more using the same client

michaelgrifalconi · 2024-02-22T08:42:33Z

If the code it's looking good enough, I will move on adapting the tests!

openqabot/approver.py

michaelgrifalconi · 2024-02-22T09:59:14Z

As you stated you already tested manually can you provide the logs from such run so that we can see the flow of execution from the log messages?

https://gist.github.com/michaelgrifalconi/4d07eee0197c929db7c2a11b85759edb

Mind that stuff like

2024-02-22 08:13:59 INFO     20240221
2024-02-22 08:13:59 INFO     Failed job date: 2024-02-21

Will be removed as I mention on the other conversations

michaelgrifalconi · 2024-02-22T13:05:05Z

New log with less frequent/redundant and more descriptive logs: https://gist.github.com/michaelgrifalconi/41b6451b36cdfb87814b9ce9635a9459

Would not reduce too much logging, since it as something you would look only in case of problems, and do not disturb anyone (of course as long as it's not so huge to be expensive on resources or just clutter when debugging)

openqabot/approver.py

okurz

My feedback after the QE Tools workshop about this topic:

What are alternatives that we can consider, how about again enabling automatic retry?
How about shifting more aggregate tests to incidents, especially the tricky ones more prone to failing?
There is no feedback to openQA so it will make it even harder for reviewers to find which jobs are actually blocking -> Hence my suggestion is instead to cover this logic in a separate application which simply writes "approvable_for" comments on openQA directly. This way there is direct feedback to reviewers same as to qem-bot. Thinking even further here such external applications can be triggered from openQA job hooks like scripts in https://github.com/os-autoinst/scripts/ do. I suggest you take a look into https://github.com/os-autoinst/scripts/blob/master/openqa-label-known-issues-and-investigate-hook for this

openqabot/approver.py

okurz · 2024-03-13T09:48:42Z

Before too much developent+review effort is invested please keep #168 (review) in mind which I think we agreed upon to follow up with.

michaelgrifalconi · 2024-03-13T10:34:14Z

1. What are alternatives that we can consider, how about again enabling automatic retry?

I agree on making sure automatic retries should be set at least at RETRY=1 globally. Where would be a good place to set that? Medium types?
This is anyhow separate and does not substitute this PR. This is for reproducible issues (either test or product issues, unrelated to the update that shall be approved and was green the day before).

2. How about shifting more aggregate tests to incidents, especially the tricky ones more prone to failing?

Sure, there are tickets open for that. Still this is a different topic and I believe both things do help.

3. There is no feedback to openQA so it will make it even harder for reviewers to find which jobs are actually blocking -> Hence my suggestion is instead to cover this logic in a separate application.[...]

No, introducing a new thing that changes the behavior of something, from a different place makes it even more frustrating for an engineer to understand "what is happening and why".
I would agree on moving out the entire approval logic and handle it in a simpler way (no hand-crafted caching and syncing of data) but it's a future topic.

Right now this is the smallest tweak possible to increase the quality of life for maintenance test developers.

About the visibility issue, I think that all current test failure should be fixed/softfailed at some point, since they surely block updates that were not tested the day before(like newly released one).
In addition, this is just about aggregates, and they run every day. I see no reason why anyone would spend time fixing a failure from the previous day instead of focusing on the present ones.

I see that would still be nice to have visibility of what is really blocking an update and what could be ignored for a specific update request (by "approvable for comment"). Too bad that the dashboard does not show that either AFAIK.

For this topic I would either:

switch from ignoring the failure to writing a comment "Approvable for" (before looking at comments and ignore it)
merge it as it is to get the benefit of getting stuff approved, and then start discussing on a better solution, like the one in the line above or something else

I have no strong opinions for either option, as long as we don't spend too much time discussing on cosmetics and also new cosmetics changes do not require me to do more rebase of this PR (which i believe to be more important than suddenly changing code styles)

okurz · 2024-03-13T11:39:37Z

switch from ignoring the failure to writing a comment "Approvable for" (before looking at comments and ignore it)
Yes, that could be done in here as well. I just think that this is easier to implement and also run in a separate script. Or, maybe a compromise, run that pre-approval with writing the comments in a separate command within qem-bot? I am fine with either approach, just pointing out ideas. You can choose and we will support you either way :)

okurz

One minor phrasing issue left, rest is fine

openqabot/approver.py

michaelgrifalconi · 2024-03-14T07:10:00Z

Considering that:

we see more and more situations where this logic would help in real situations
changing the design to writing comments will require more time and make it more difficult to test dry runs

I would like to proceed as it is, add the necessary tests and then as soon as it is merged we can start a discussion on how to improve visibility and consider all various options like commenting and or moving to different bot job, etc.

okurz · 2024-03-14T08:53:07Z

CI failures in https://github.com/openSUSE/qem-bot/actions/runs/8276775359/job/22649198458?pr=168#step:5:221

I would like to proceed as it is, add the necessary tests and then as soon as it is merged we can start a discussion on how to improve visibility and consider all various options like commenting and or moving to different bot job, etc.

Yes, I can accept that although be aware that concerns were raised by others than just me about the overall approach. So besides the CI failures I see only two points missing before I can approve:

The PR is marked as "WIP" and I won't approve before you confirm that the PR is ready to be approved, i.e. remove the WIP
Squash the commits that just fixup the original commit

codecov-commenter · 2024-03-14T10:12:09Z

Codecov Report

Attention: Patch coverage is 84.05797% with 11 lines in your changes are missing coverage. Please review.

Project coverage is 67.74%. Comparing base (7b921a0) to head (806cbc0).

Files	Patch %	Lines
openqabot/approver.py	85.41%	7 Missing ⚠️
openqabot/openqa.py	80.00%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##           master     #168      +/-   ##
==========================================
+ Coverage   67.00%   67.74%   +0.73%     
==========================================
  Files          25       25              
  Lines        1664     1730      +66     
==========================================
+ Hits         1115     1172      +57     
- Misses        549      558       +9

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

michaelgrifalconi · 2024-03-14T10:37:54Z

Right when I was feeling ready with this, I found a confirmation of the issue caused by using SMELT ids instead of RR ids.
Moving the discussion about it here: #174

michaelgrifalconi · 2024-03-14T12:22:44Z

Right when I was feeling ready with this, I found a confirmation of the issue caused by using SMELT ids instead of RR ids. Moving the discussion about it here: #174

Having lunch brought me some wisdom. We can make use of the same workaround described in the open issue to avoid falling in the rabbit hole of fixing everything in the system at once.
I can add a check here to make sure the selected job is still present in the qem-dashboard. If it's not, then it means it was removed by qem-dashboard/#78 and should not be used.

openqabot/approver.py

michaelgrifalconi · 2024-03-19T08:22:22Z

I think I am ready for the last review. If it's all fine I will do one last consolidation of commits and we can merge!
By the way, this is what would be approved by the new logic:

2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32461:324048
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32575:324054
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32795:324057
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32898:324045
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32912:324047
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32934:324023
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32940:323930
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32951:323996
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32956:324041
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32959:324042
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32960:324072
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32967:324105
2024-03-19 08:14:56 INFO     * SUSE:Maintenance:32971:324109

okurz

+1

okurz

ok, now we need to find a better subject line than "Enhance bot approval logic". How about "Approve jobs if at least older jobs passed"?

okurz · 2024-03-19T14:27:24Z

openqabot/approver.py

+            job = older_jobs["data"][i]
+            job_build = job["build"][:-2]
+            job_build_date = datetime.strptime(job_build, "%Y%m%d")
+
+            # Check the job is not too old
+            if job_build_date < oldest_build_usable:
+                log.info(
+                    "Cannot ignore aggregate failure %s for update %s because: Older jobs are too old to be considered"
+                    % (failed_job_id, inc)
+                )
+                return False
+
+            if job["result"] == "passed" or job["result"] == "softfailed":
+                # Check the job contains the update under test
+                job_settings = self.client.get_single_job(job["id"])
+                if not regex.match(str(job_settings)):
+                    # Likely older jobs don't have it either. Giving up
+                    log.info(
+                        "Cannot ignore aggregate failure %s for update %s because: Older passing jobs do not have update under test"
+                        % (failed_job_id, inc)
+                    )
+                    return False
+
+                if not self.validate_job_qam(job["id"]):
+                    log.info(
+                        "Cannot ignore failed aggregate %s using %s for update %s because is not present in qem-dashboard. It's likley about an older release request"
+                        % (failed_job_id, job["id"], inc)
+                    )
+                    return False
+
+                log.info(
+                    "Ignoring failed aggregate %s and using instead %s for update %s"
+                    % (failed_job_id, job["id"], inc)
+                )
+                return True


This for-body is now indented a bit too much. Can you extract a method here?

Not a huge fan of creating methods for things that get called only in one place in the code, making me jump around in the file to follow the flow, but I guess it's a personal preference. Will do as requested

I refactored a bit the code, without the need of a new method I removed a nested if to get down of one level of indentation and make it more readable

openqabot/approver.py

openqabot/openqa.py

openqabot/approver.py

okurz

oh, sorry. You added another commit. Please squash, then I can approve

michaelgrifalconi · 2024-03-20T08:32:54Z

Yeah I usually try to show what I change instead of force push on every change since it makes more difficult to check and might hide some stuff! Rebased now

- if aggregate update failed, do not give up immediately - look at openQA previous jobs, if present, green, not too old, still present in the qem-dashboard (to avoid using tests about different Release Requests) and it includes the update under test: ignore that failure This is to reduce the impact of one test being broken one day, a different test another day and the update not being approved even if combined result give all green, just not at the same time.

michaelgrifalconi · 2024-03-20T09:29:51Z

Thank for the approval! I have no rights to merge so someone else will have to do it :)

michaelgrifalconi changed the title ~~Enhance bot approval logic~~ [WIP] Enhance bot approval logic Feb 21, 2024

okurz requested changes Feb 21, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

openqabot/approver.py Outdated Show resolved Hide resolved

openqabot/approver.py Outdated Show resolved Hide resolved

This comment was marked as resolved.

Sign in to view

michaelgrifalconi commented Feb 22, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

asmorodskyi reviewed Feb 22, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

asmorodskyi reviewed Feb 22, 2024

View reviewed changes

openqabot/approver.py Show resolved Hide resolved

asmorodskyi reviewed Feb 22, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

b10n1k reviewed Feb 23, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

b10n1k reviewed Feb 23, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

okurz mentioned this pull request Feb 23, 2024

Simplify job evaluation avoiding a double-loop #169

Merged

okurz requested changes Feb 23, 2024

View reviewed changes

This comment was marked as resolved.

Sign in to view

michaelgrifalconi force-pushed the try-harder branch from 5c87c45 to b0587b6 Compare March 13, 2024 09:23

Martchus reviewed Mar 13, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

This comment was marked as off-topic.

Sign in to view

okurz requested changes Mar 14, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

michaelgrifalconi mentioned this pull request Mar 14, 2024

bot uses SMELT ids to track tests and approve updates, but release-request (ibs) ids should be used #174

Open

okurz requested changes Mar 15, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

openqabot/approver.py Outdated Show resolved Hide resolved

openqabot/approver.py Show resolved Hide resolved

openqabot/approver.py Outdated Show resolved Hide resolved

okurz reviewed Mar 19, 2024

View reviewed changes

michaelgrifalconi force-pushed the try-harder branch from 17ea796 to 806cbc0 Compare March 19, 2024 10:05

michaelgrifalconi changed the title ~~[WIP] Enhance bot approval logic~~ Enhance bot approval logic Mar 19, 2024

michaelgrifalconi requested a review from okurz March 19, 2024 14:02

okurz reviewed Mar 19, 2024

View reviewed changes

michaelgrifalconi force-pushed the try-harder branch from 806cbc0 to 5391f07 Compare March 19, 2024 14:29

okurz reviewed Mar 19, 2024

View reviewed changes

okurz changed the title ~~Enhance bot approval logic~~ Approve jobs if at least older jobs passed Mar 20, 2024

okurz approved these changes Mar 20, 2024

View reviewed changes

openqabot/approver.py Outdated Show resolved Hide resolved

okurz requested changes Mar 20, 2024

View reviewed changes

michaelgrifalconi force-pushed the try-harder branch from 96aae51 to 05e0a06 Compare March 20, 2024 08:31

michaelgrifalconi force-pushed the try-harder branch from 05e0a06 to 813f225 Compare March 20, 2024 08:41

okurz approved these changes Mar 20, 2024

View reviewed changes

kalikiana approved these changes Mar 20, 2024

View reviewed changes

kalikiana merged commit 00b4000 into openSUSE:master Mar 20, 2024
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Approve jobs if at least older jobs passed #168

Approve jobs if at least older jobs passed #168

michaelgrifalconi commented Feb 21, 2024 •

edited

Loading

okurz left a comment

michaelgrifalconi commented Feb 21, 2024

michaelgrifalconi commented Feb 22, 2024

This comment was marked as resolved.

michaelgrifalconi commented Feb 22, 2024

michaelgrifalconi commented Feb 22, 2024

okurz left a comment •

edited

Loading

This comment was marked as resolved.

okurz commented Mar 13, 2024

michaelgrifalconi commented Mar 13, 2024

okurz commented Mar 13, 2024

This comment was marked as off-topic.

This comment was marked as off-topic.

okurz left a comment

michaelgrifalconi commented Mar 14, 2024

okurz commented Mar 14, 2024 •

edited

Loading

codecov-commenter commented Mar 14, 2024 •

edited

Loading

michaelgrifalconi commented Mar 14, 2024

michaelgrifalconi commented Mar 14, 2024 •

edited

Loading

michaelgrifalconi commented Mar 19, 2024

okurz left a comment

okurz left a comment

okurz Mar 19, 2024

michaelgrifalconi Mar 19, 2024

michaelgrifalconi Mar 20, 2024

okurz left a comment

michaelgrifalconi commented Mar 20, 2024

michaelgrifalconi commented Mar 20, 2024

Approve jobs if at least older jobs passed #168

Approve jobs if at least older jobs passed #168

Conversation

michaelgrifalconi commented Feb 21, 2024 • edited Loading

okurz left a comment

Choose a reason for hiding this comment

michaelgrifalconi commented Feb 21, 2024

michaelgrifalconi commented Feb 22, 2024

This comment was marked as resolved.

michaelgrifalconi commented Feb 22, 2024

michaelgrifalconi commented Feb 22, 2024

okurz left a comment • edited Loading

Choose a reason for hiding this comment

This comment was marked as resolved.

okurz commented Mar 13, 2024

michaelgrifalconi commented Mar 13, 2024

okurz commented Mar 13, 2024

This comment was marked as off-topic.

This comment was marked as off-topic.

okurz left a comment

Choose a reason for hiding this comment

michaelgrifalconi commented Mar 14, 2024

okurz commented Mar 14, 2024 • edited Loading

codecov-commenter commented Mar 14, 2024 • edited Loading

Codecov Report

michaelgrifalconi commented Mar 14, 2024

michaelgrifalconi commented Mar 14, 2024 • edited Loading

michaelgrifalconi commented Mar 19, 2024

okurz left a comment

Choose a reason for hiding this comment

okurz left a comment

Choose a reason for hiding this comment

okurz Mar 19, 2024

Choose a reason for hiding this comment

michaelgrifalconi Mar 19, 2024

Choose a reason for hiding this comment

michaelgrifalconi Mar 20, 2024

Choose a reason for hiding this comment

okurz left a comment

Choose a reason for hiding this comment

michaelgrifalconi commented Mar 20, 2024

michaelgrifalconi commented Mar 20, 2024

michaelgrifalconi commented Feb 21, 2024 •

edited

Loading

okurz left a comment •

edited

Loading

okurz commented Mar 14, 2024 •

edited

Loading

codecov-commenter commented Mar 14, 2024 •

edited

Loading

michaelgrifalconi commented Mar 14, 2024 •

edited

Loading