Run a basic test scenario on o3 as part of CI checks #18182

Martchus · 2023-11-22T10:57:01Z

Related ticket: https://progress.opensuse.org/issues/150992

Martchus · 2023-11-22T10:59:25Z

I'm not sure whether we can see anything here just yet because this runs on:pull_request_target and not on:pull_request (as explained in the comment that is part of the diff itself). So I'm afraid we cannot really fully test whether this will work before merging. I'll try to schedule a scenario manually.

Martchus · 2023-11-22T11:43:56Z

After a few adjustments I was able to conduct a successful test run using the senario definitions and commands in this PR: https://openqa.opensuse.org/tests/3748177

I basically invoked (which is what this GitHub action will do):

build=$(curl -s ${OPENQA_HOST:-https://openqa.opensuse.org}/group_overview/${OPENQA_GROUP_ID:-1}.json | jq -r '([ .build_results[] | select(.tag.description=="published") | select(.version=="Tumbleweed") | .build ] | sort | reverse)[]' | head -n1)
openqa-cli schedule \
  --monitor \
  --host "${OPENQA_HOST:-https://openqa.opensuse.org}/" \
  --param-file SCENARIO_DEFINITIONS_YAML=scenario-definitions.yaml \
  DISTRI=openQA VERSION=Tumbleweed FLAVOR=dev ARCH=x86_64 \
  HDD_1=opensuse-Tumbleweed-x86_64-$build-textmode@64bit.qcow2 \
  UEFI_PFLASH_VARS=opensuse-Tumbleweed-x86_64-$build-textmode@64bit-uefi-vars.qcow2 \
  BUILD="Martchus/os-autoinst-distri-opensuse.git#fcbbfae1d069ef031b59710dd4d1fef7ef98702e" _GROUP_ID="0" \
  CASEDIR="https://github.com/Martchus/os-autoinst-distri-opensuse.git#fcbbfae1d069ef031b59710dd4d1fef7ef98702e"

The test run used the test code from Git:

[2023-11-22T11:35:25.459032Z] [debug] [pid:26556] Fetching 'fcbbfae1d069ef031b59710dd4d1fef7ef98702e' from origin manually

Needles are still coming from the default location.

foursixnine

@Martchus/@okurz security concerns aside, this is looking naisss (I wonder though, why the check is not showing up, I guess it was never ran??)

.github/workflows/openqa.yml

scenario-definitions.yaml

.github/workflows/openqa.yml

scenario-definitions.yaml

.github/workflows/openqa.yml

Martchus · 2023-11-24T11:52:55Z

@foursixnine

I wonder though, why the check is not showing up, I guess it was never ran??

As explained

… this runs on:pull_request_target and not on:pull_request (as explained in the comment that is part of the diff itself). So I'm afraid we cannot really fully test whether this will work before merging.

foursixnine · 2023-11-24T22:46:02Z

@foursixnine

I wonder though, why the check is not showing up, I guess it was never ran??

As explained

… this runs on:pull_request_target and not on:pull_request (as explained in the comment that is part of the diff itself). So I'm afraid we cannot really fully test whether this will work before merging.

Lets use then a separate branch, the same way we have it now for Trufflehog, on a target branch that isn't master, or address the items in #18182 (comment) (which if I understood @okurz correctly, are still planned to be addressed)

https://github.com/os-autoinst/os-autoinst-distri-opensuse/blob/f99df70fc4702425fc55668a06d45bc639bf5056/.github/workflows/trufflehog.yml#L4C19-L4C19

otherwise, this would (at least at the moment) add extra time to pass all PR checks in its current state.

okurz · 2023-11-24T22:49:57Z

@foursixnine I would suggest to just accept the changes as is as we have the same running fine for multiple months already in os-autoinst-distri-openQA as well as os-autoinst-distri-example. I would only plan the other points after we collect some feedback from production and see which points users really care about

scenario-definitions.yaml

foursixnine · 2023-11-24T23:52:03Z

@foursixnine I would suggest to just accept the changes as is as we have the same running fine for multiple months already in os-autoinst-distri-openQA as well as os-autoinst-distri-example.

I would accept the changes if the points I have mentioned have been addressed

I would only plan the other points after we collect some feedback from production and see which points users really care about

If you decide to enable it on a separate target, I can ask directly to QE-Core to help provide feedback, in terms of accepting the changes, as they are now, I'd like to collect some approvals from PO's and also from @DimStar77 and @lkocman, as they will be the ones who will be impacted, even if the priority of these PRs are set to low.

But that doesn't stop you from planning the other points, as they directly correlate with my comment above

I understand you wanted to have a timeboxed activity, and there the path of least resistance is moving it to its own branch, but I'm not willing to allow to spam o3 multiple times a day, even if the test is short-lived, they will be a lot and we know how bad it could get, taking from the learning experience (extrapolate data here) of Maintenance Incidents being scheduled on OSD.

and the average of 14 minutes of wait here: https://github.com/os-autoinst/os-autoinst-distri-openQA/actions/workflows/openqa.yml

Will only grow, when you look - https://github.com/os-autoinst/os-autoinst-distri-opensuse/actions/workflows/ci.yml?query=is%3Acompleted + verification runs scheduled in different places

So my advice is already stated #18182 (comment)

But I won't block a merge if others (as mentioned above) say it will be fine for them.

Another idea is to use one or two worker instances to do all the work, following up on this other comment.

okurz · 2023-11-26T08:45:42Z

@foursixnine I would suggest to just accept the changes as is as we have the same running fine for multiple months already in os-autoinst-distri-openQA as well as os-autoinst-distri-example.

I would accept the changes if the points I have mentioned have been addressed

The points you mention also include a smart lookup of which test code changed and constructing a dynamic schedule. That IMHO is hard and would take considerable effort to the point of we would not follow up with that right now from the side of the tools team. I would like to bring in a minimal proof of concept that additionally has (limited) value and then encourage anyone also working actively with this repo to come up with improvement ideas.

I would only plan the other points after we collect some feedback from production and see which points users really care about

If you decide to enable it on a separate target, I can ask directly to QE-Core to help provide feedback, in terms of accepting the changes, as they are now, I'd like to collect some approvals from PO's and also from @DimStar77 and @lkocman, as they will be the ones who will be impacted, even if the priority of these PRs are set to low.

What do you mean with "separate target"?

I understand you wanted to have a timeboxed activity, and there the path of least resistance is moving it to its own branch, but I'm not willing to allow to spam o3 multiple times a day, even if the test is short-lived, they will be a lot and we know how bad it could get, taking from the learning experience (extrapolate data here) of Maintenance Incidents being scheduled on OSD.

My expectation is that there would be no significant load increase on o3 given that we have a much higher capacity nowadays.

But I won't block a merge if others (as mentioned above) say it will be fine for them.

Another idea is to use one or two worker instances to do all the work, following up on this other comment.

Instead we could just set a higher prio value

Martchus · 2023-11-28T10:57:47Z

I'm not sure what the suggestion to do this according to https://github.com/os-autoinst/os-autoinst-distri-opensuse/blob/f99df70fc4702425fc55668a06d45bc639bf5056/.github/workflows/trufflehog.yml#L4C19-L4C19 means. I would leave that PR for now as-is because I'm not sure what to improve. We might also close the related ticket https://progress.opensuse.org/issues/150992 as resolved soon as it was just a spike solution ticket anyways.

foursixnine · 2023-12-14T14:14:14Z

@foursixnine

I wonder though, why the check is not showing up, I guess it was never ran??

As explained

… this runs on:pull_request_target and not on:pull_request (as explained in the comment that is part of the diff itself). So I'm afraid we cannot really fully test whether this will work before merging.

Lets use then a separate branch, the same way we have it now for Trufflehog, on a target branch that isn't master, or address the items in #18182 (comment) (which if I understood @okurz correctly, are still planned to be addressed)

https://github.com/os-autoinst/os-autoinst-distri-opensuse/blob/f99df70fc4702425fc55668a06d45bc639bf5056/.github/workflows/trufflehog.yml#L4C19-L4C19

otherwise, this would (at least at the moment) add extra time to pass all PR checks in its current state.

@okurz this is the comment about the separate branch, the follow ups can be done there, before disrupting test developers cc @Martchus

Martchus · 2023-12-14T14:55:11Z

What follow-ups are we planning to do?

foursixnine · 2023-12-14T15:14:45Z

What follow-ups are we planning to do?

I would still prefer if the changes are applied in the same way as truffle-hog is, but I won't block a merge

.github/workflows/openqa.yml

okurz · 2023-12-15T10:12:18Z

@okurz this is the comment about the separate branch, the follow ups can be done there, before disrupting test developers cc @Martchus

I don't understand the idea behind a separate branch. The idea is to provide CI checks on pull requests so that actual openQA tests are conducted for every pull request same as we already have in https://github.com/os-autoinst/os-autoinst-distri-openQA/

foursixnine · 2023-12-15T13:42:39Z

@okurz this is the comment about the separate branch, the follow ups can be done there, before disrupting test developers cc @Martchus

I don't understand the idea behind a separate branch. The idea is to provide CI checks on pull requests so that actual openQA tests are conducted for every pull request same as we already have in https://github.com/os-autoinst/os-autoinst-distri-openQA/

I would still prefer if the changes are applied in the same way as truffle-hog is, but I won't block a merge

In fact, I would appreciate if the tools team could work on a branch that isn't master for this case, but as I mentioned yesterday to you, I won't block a merge of this anymore.

foursixnine · 2023-12-23T12:50:22Z

@okurz this is the comment about the separate branch, the follow ups can be done there, before disrupting test developers cc @Martchus

I don't understand the idea behind a separate branch.

The Idea behind a separate branch, is so that reviewers aren't stuck waiting until a CI job that doesn't have anything to do with their code changes (unless by out of luck, happens to be what this PR's schedule is).

The idea is to provide CI checks on pull requests so that actual openQA tests are conducted for every pull request same as we already have in https://github.com/os-autoinst/os-autoinst-distri-openQA/

If you know it works there already, then take the steps to move the ball further. You already did the "Hello World", go ahead and implement the thing well enough from the beginning, instead of rewritting the hello world again, this isn't rocket science.

PS: Before you ask again: "What does this mean for this PR?", find your answer below:

@okurz this is the comment about the separate branch, the follow ups can be done there, before disrupting test developers cc @Martchus

I don't understand the idea behind a separate branch. The idea is to provide CI checks on pull requests so that actual openQA tests are conducted for every pull request same as we already have in https://github.com/os-autoinst/os-autoinst-distri-openQA/

I would still prefer if the changes are applied in the same way as truffle-hog is, but I won't block a merge

In fact, I would appreciate if the tools team could work on a branch that isn't master for this case, but as I mentioned yesterday to you, I won't block a merge of this anymore.

Clarified in comments

Martchus · 2024-01-10T15:52:20Z

Just for the record: It looks like this check works, e.g. "Run a basic openQA test / trigger_and_monitor_openqa (pull_request_target)" passes on #18422 and the logs look like it actually conducted the test run (https://github.com/os-autoinst/os-autoinst-distri-opensuse/actions/runs/7477357160/job/20349838534?pr=18422, https://openqa.opensuse.org/tests/3861047).

foursixnine · 2024-01-11T22:29:51Z

Just for the record: It looks like this check works, e.g. "Run a basic openQA test / trigger_and_monitor_openqa (pull_request_target)" passes on #18422 and the logs look like it actually conducted the test run (https://github.com/os-autoinst/os-autoinst-distri-opensuse/actions/runs/7477357160/job/20349838534?pr=18422, https://openqa.opensuse.org/tests/3861047).

But you already knew that from the experience with the openQA in openQA repo, this pr only enables that for this one.

okurz · 2024-01-12T05:54:29Z

But you already knew that from the experience with the openQA in openQA repo, this pr only enables that for this one.

Correct. And now we would know if any other future PR break the code so much that systems don't even boot anymore. And additionally in every pull request reviewers can ask to extend those tests accordingly where fitting

Martchus requested a review from foursixnine November 22, 2023 10:57

Martchus force-pushed the ci branch from fcbbfae to 1cc9599 Compare November 22, 2023 11:42

okurz approved these changes Nov 22, 2023

View reviewed changes

foursixnine previously requested changes Nov 24, 2023

View reviewed changes

Run a basic test scenario on o3 as part of CI checks

3a35f9b

Martchus force-pushed the ci branch from 1cc9599 to 3a35f9b Compare November 24, 2023 15:02

okurz approved these changes Nov 24, 2023

View reviewed changes

foursixnine reviewed Nov 24, 2023

View reviewed changes

scenario-definitions.yaml Show resolved Hide resolved

foursixnine added the notready label Nov 24, 2023

foursixnine reviewed Dec 14, 2023

View reviewed changes

.github/workflows/openqa.yml Show resolved Hide resolved

kalikiana approved these changes Jan 8, 2024

View reviewed changes

kalikiana removed the notready label Jan 10, 2024

kalikiana merged commit eadeb4d into os-autoinst:master Jan 10, 2024
8 checks passed

Martchus deleted the ci branch January 10, 2024 12:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Run a basic test scenario on o3 as part of CI checks #18182

Run a basic test scenario on o3 as part of CI checks #18182

Martchus commented Nov 22, 2023 •

edited

Loading

Martchus commented Nov 22, 2023

Martchus commented Nov 22, 2023 •

edited

Loading

foursixnine left a comment •

edited

Loading

Martchus commented Nov 24, 2023

foursixnine commented Nov 24, 2023 •

edited

Loading

okurz commented Nov 24, 2023

foursixnine commented Nov 24, 2023 •

edited

Loading

okurz commented Nov 26, 2023

Martchus commented Nov 28, 2023

foursixnine commented Dec 14, 2023

Martchus commented Dec 14, 2023

foursixnine commented Dec 14, 2023

okurz commented Dec 15, 2023

foursixnine commented Dec 15, 2023 •

edited

Loading

foursixnine commented Dec 23, 2023

Martchus commented Jan 10, 2024

foursixnine commented Jan 11, 2024

okurz commented Jan 12, 2024

Run a basic test scenario on o3 as part of CI checks #18182

Run a basic test scenario on o3 as part of CI checks #18182

Conversation

Martchus commented Nov 22, 2023 • edited Loading

Martchus commented Nov 22, 2023

Martchus commented Nov 22, 2023 • edited Loading

foursixnine left a comment • edited Loading

Choose a reason for hiding this comment

Martchus commented Nov 24, 2023

foursixnine commented Nov 24, 2023 • edited Loading

okurz commented Nov 24, 2023

foursixnine commented Nov 24, 2023 • edited Loading

okurz commented Nov 26, 2023

Martchus commented Nov 28, 2023

foursixnine commented Dec 14, 2023

Martchus commented Dec 14, 2023

foursixnine commented Dec 14, 2023

okurz commented Dec 15, 2023

foursixnine commented Dec 15, 2023 • edited Loading

foursixnine commented Dec 23, 2023

Martchus commented Jan 10, 2024

foursixnine commented Jan 11, 2024

okurz commented Jan 12, 2024

Martchus commented Nov 22, 2023 •

edited

Loading

Martchus commented Nov 22, 2023 •

edited

Loading

foursixnine left a comment •

edited

Loading

foursixnine commented Nov 24, 2023 •

edited

Loading

foursixnine commented Nov 24, 2023 •

edited

Loading

foursixnine commented Dec 15, 2023 •

edited

Loading