Check runner results after some settling time #7679

StefanBruens · 2019-06-13T17:32:29Z

Currently, when the runner results does not show the results immediately,
the check fails, the runner is closed with 'esc', and the system waits
for 30 seconds.

Instead, give the runner a reasonable chance to present its results,
and only repeat the sequence if the results does not show up after 25
seconds.

Related ticket: https://progress.opensuse.org/issues/53045
Related ticket: https://progress.opensuse.org/issues/51944

In case the runner does not show up properly after 10 attempts, fail the test as the error is hard to spot otherwise. Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de>

Currently, when the runner results does not show the results immediately (default timeout is 0), the check fails, the runner is closed with 'esc', and the system waits for 30 seconds. Instead, give the runner a reasonable chance to present its results, and only repeat the sequence if the results does not show up after the default timeout of 30 seconds. See poo#53045 Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de>

ggardet · 2019-06-13T18:51:03Z

Is it a replacement of #7540 ?

ggardet · 2019-06-13T18:51:54Z

Could we have some tests run on x86_64 and aarch64, please?

StefanBruens · 2019-06-13T22:13:18Z

Is it a replacement of #7540 ?

No, this branch is on top of #7540. Both PR touch the same region of code.

okurz · 2019-06-14T08:42:24Z

lib/susedistribution.pm

-        if ($retries > 1) {
+        if ($retries == 1) {
+            assert_screen('desktop-runner-plasma-suggestions', $timeout);
+        } elsif (!check_screen('desktop-runner-plasma-suggestions', $timeout)) {


I consider this approach racy as well. Can you evaluate if you can use a multi-tag assert_screen with match_has_tag instead of check_screen with non-zero timeout to prevent introducing any timing dependant behaviour, to save test execution time as well as state more explicitly from the testers point of view what are the expected alternatives. For example:

assert_screen([qw(yast2_console-finished yast2_missing_package)]); if (match_has_tag('yast2_missing_package')) { send_key 'alt-o'; # confirm package installation assert_screen 'yast2_console-finished'; }

I agree that the previous approach by @ggardet might also not be the perfect solution in the end. Please see https://progress.opensuse.org/issues/35589 with my – rather lengthy – investigation. And any try to fix this proberly demands good statistics for a proof :) If you like you can actually use ressources on openqa.opensuse.org yourself for testing, @ggardet does the same with good success already. http://open.qa/docs/#_triggering_tests_based_on_an_any_remote_git_refspec_or_open_github_pull_request describes how that can be achieved. Feel free to ping "okurz" on irc://chat.freenode.net/opensuse-factory for help and getting API access to the instance.

I consider this approach racy as well. Can you evaluate if you can use a multi-tag assert_screen with match_has_tag instead of check_screen with non-zero timeout to prevent introducing any timing dependant behaviour, to save test execution time as well as state more explicitly from the testers point of view what are the expected alternatives.

Can you please elaborate which events would lead to a race here? Obviously, krunner needs some time to show the results, so checking immediately after the last character typed has a very low success rate.
The check here is only for the krunner suggestions, so there is exactly one tag involved.

ok, sorry. in this case I don't mean "racy" but wasting time in case of no-match. In this specific case I actually think check_screen with a non-zero timeout would work ok but unfortunately I had been repeatedly fighting against many check_screen calls which are wasting time a lot and nobody realizes. I would appreciate if you can try to fix the same problem with the multi-tag assert_screen I suggested above. If you want to keep the check_screen call still I will accept that and merge the PR though :)

The "nobody realizes" can not happen here, as the final repetition uses an "assert_screen", see https://openqa.opensuse.org/tests/962535#step/dolphin/46 (that one was still missing a matching needle).
Anything but check_screen would likely waste more time:

with "0" timeout:

the command is typed, krunner starts processing

screen is grabbed, check_screen fails

wait some time?

check again?

with short timeout

the command is typed, krunner starts processing

screen is grabbed, check_screen does not match, but no timeout

the suggestions appear

screen is grabbed, check_screen succeeds immediately

An assert_screen [ 'desktop-runner-plasma-suggestions', 'desktop-runner' ] would immeditaly succeed even without the suggestions, so we would have to repeat it - but immediately, or after waiting, and how often? Imagine the suggestions appear only after 5 seconds (which does not seem to be the case here, but after 1 frame at most) ...

Well assert_screen ['desktop-runner-plasma-suggestions', 'desktop-runner' ] is obviously not the right call on plasma but we can agree that it simply has to show the suggestions before we should try to press return. IMHO 9e0534a was already wrong because it introduced a check for the suggestions when the task of "init_desktop_runner" should just be to type, nothing more. We are looking again for the suggestions window in x11_start_program where we also try to handle retries. Your PR here seems to make things better but I actually tend towards reverting 9e0534a as well. WDYT? @ggardet as well?

@okurz The thing is 9e0534a make things to work. Without 9e0534a there are a lots of failure on aarch64 (also seen on x86_64, with less occurrences). So, I would prefer to merge this current PR which improve things a lot. We still can find another working solution afterwards. But please do not revert PR which make things to work without another PR which fixes previously fixed/workarounded problems.

convinced :)

StefanBruens · 2019-06-18T18:41:18Z

All cases where a sufficient needle was available passed on the first try (especially oowriter, oocalc, oomath):
https://openqa.opensuse.org/tests/962535#

I scheduled two more runs, this time including the updated/added needle.
x86_64: https://openqa.opensuse.org/tests/962615
aarch64: https://openqa.opensuse.org/tests/962616

StefanBruens · 2019-06-18T21:23:08Z

The other two tests have also finished successfully without any hickups or running into timeouts.

StefanBruens · 2019-06-19T13:10:42Z

Some background information why this notoriously failed for oowriter etc, but not e.g. echo, xterm, firefox etc:

xterm: the command matches the "Application" name from the desktop file, as soon as "xte" is typed, the "Application: xterm" suggestion appears
echo/firefox: dito, but after the command name (which fills the suggestions) some parameters are typed
oowriter: Only the full binary name matches, i.e. a suggestion is only made after "oowriter" has been typed completely. The krunner history is empty, so nothing to fill in either.

ggardet · 2019-06-19T13:55:43Z

LGTM.

…gram" again This is a followup to os-autoinst#7442 as well as os-autoinst#7679 and os-autoinst#7540 which introduced further retries. This commit tries to move the retries into the place where we already handle the plasma suggestions list. Related progress issue: https://progress.opensuse.org/issues/35589

Fail in case the plasma runner is not shown properly on final attempt

7b640eb

In case the runner does not show up properly after 10 attempts, fail the test as the error is hard to spot otherwise. Signed-off-by: Stefan Brüns <stefan.bruens@rwth-aachen.de>

StefanBruens force-pushed the recheck_screen branch from 30bd1dc to cb7cb55 Compare June 13, 2019 18:16

okurz reviewed Jun 14, 2019

View reviewed changes

okurz mentioned this pull request Jun 14, 2019

Fail in case the plasma runner is not shown properly on final attempt #7540

Merged

okurz mentioned this pull request Jun 19, 2019

init_desktop_runner: ensure we have plasma-krunner suggestions #7442

Merged

okurz merged commit 4e47789 into os-autoinst:master Jun 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Check runner results after some settling time #7679

Check runner results after some settling time #7679

StefanBruens commented Jun 13, 2019 •

edited

ggardet commented Jun 13, 2019

ggardet commented Jun 13, 2019

StefanBruens commented Jun 13, 2019

okurz Jun 14, 2019

StefanBruens Jun 14, 2019

StefanBruens Jun 18, 2019

okurz Jun 19, 2019

StefanBruens Jun 19, 2019

okurz Jun 19, 2019

ggardet Jun 19, 2019

okurz Jun 19, 2019

StefanBruens commented Jun 18, 2019

StefanBruens commented Jun 18, 2019

StefanBruens commented Jun 19, 2019

ggardet commented Jun 19, 2019

Check runner results after some settling time #7679

Check runner results after some settling time #7679

Conversation

StefanBruens commented Jun 13, 2019 • edited

ggardet commented Jun 13, 2019

ggardet commented Jun 13, 2019

StefanBruens commented Jun 13, 2019

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

StefanBruens commented Jun 18, 2019

StefanBruens commented Jun 18, 2019

StefanBruens commented Jun 19, 2019

ggardet commented Jun 19, 2019

StefanBruens commented Jun 13, 2019 •

edited