Skip ha/drbd_passive test when not in TWO_NODES scenario #15324

alvarocarvajald · 2022-08-04T15:14:18Z

Currently the 3 cluster node scenario tested for Maintenance jobs is always scheduling the ha/drbd_passive test module, even when this particular test module is not designed to work in cluster scenarios with more than 2 nodes.

As a result, whenever a MU job is triggered with a package that requires a drbd test, the 3 node scenario job will fail as it is skipped in the third node before any of the barrier_wait() calls. This means the other nodes, remain blocked in a barrier_wait() call until they reach MAX_JOB_TIME and fail the whole scenario.

One possible solution would be to create a different schedule for the 3 cluster node scenario, but this implies a PR to this repo, as well as a change to the job settings.

This PR instead modifies ha/drbd_passive so it is skipped when running in scenarios with the setting TWO_NODES=no. It will also skip the test module ha/filesystem that is usually scheduled right after ha/drbd_passive, as there would be no block device on which to test the filesystem creation.

Related ticket: https://progress.opensuse.org/issues/114727
Needles: N/A
Failing job: https://openqa.suse.de/tests/9247409#step/drbd_passive/5
Verification runs:

QAM jobs (3 nodes): node1, node 2, node 3 & support server
QAM jobs (2 nodes): node 1, node 2, client & support server
Alpha Cluster on Milestone Build Validation: node 1, node 2 & support server

P.S.: I also include here a small optimization in ha/vg test module to avoid having multiple calls to hacluster::read_tag which sends commands to SUT.

emiura · 2022-08-05T08:55:30Z

One possible solution would be to create a different schedule for the 3 cluster node scenario, but this implies a PR to this repo, as well as a change to the job settings. - don't we need a PR to add the "TWO_NODES=no" anyway? Anyway, if it passes your vr, lgtm.

alvarocarvajald · 2022-08-05T10:08:00Z

One possible solution would be to create a different schedule for the 3 cluster node scenario, but this implies a PR to this repo, as well as a change to the job settings. - don't we need a PR to add the "TWO_NODES=no" anyway? Anyway, if it passes your vr, lgtm.

It is already there:

If memory serves, TWO_NODE=no is used by other test modules and added quite a while ago. I am just simply re-using the same setting for this particular test module.

In fact, I think L47 on tests/ha/drbd_passive.pm which reads:

    if ((!is_node(1) && !is_node(2)) || check_var('TWO_NODES', 'no')) {

Could be re-written safely as only:

    if (check_var('TWO_NODES', 'no')) {

But I am leaving the node checks just in case there is already a working test using ha/drbd_passive with more than 2 nodes and that has not set TWO_NODES=no.

dzedro · 2022-08-08T06:48:05Z

tests/ha/filesystem.pm

        $resource = 'drbd_active';
    }
+    elsif ($tag eq 'skip_fs_test') {


Isn't this duplicate of line 21 ?
And if it should be at begging because tag drbd_passive would be evaluated before this skip thus not skip.

Isn't this duplicate of line 21 ?

Not exactly the same. L21 would skip the test when not on drbd Maintenance Updates, but continue it if testing a drbd MU, such as in https://openqa.suse.de/tests/9247409#step/drbd_passive/5

L44 on the other hand is skipping the test on scenarios where the test would not work (such as in 3 node) per L47-L51 in ha/drbd_passive.

And if it should be at begging because tag drbd_passive would be evaluated before this skip thus not skip.

Not a problem. $tag would be one or the other, and never both: https://github.com/os-autoinst/os-autoinst-distri-opensuse/blob/master/lib/hacluster.pm#L458-L480

(This probably could've been implemented with get_var() and set_var() instead, but no idea why Loic went with a local file in the cluster nodes for these tags. Could be a candidate for a re-write in the future)

But instead of two places where the test will return there could be just one at the beginning of the if statement. Otherwise fine.

Ah, got it. Let me move both exit conditions to the same line and test.

New verification runs:

Alpha Cluster on Milestone Build Validation: node 1, node 2 & support server

Cannot run VR with the drbd MU as the incident repository is gone: http://mango.qa.suse.de/tests/4893#step/iscsi_client/12

Should we merge with the new change or do I rollback 7997ebe?

Merge this PR, you are writing skip_fs_test tag in first commit

Also skip related ha/filesystem test as if ha/drbd_passive test module is skipped, there would be no block device on which to create the FS.

dzedro reviewed Aug 8, 2022

View reviewed changes

alvarocarvajald changed the title ~~Skip ha/drbd_passive test when not in TWO_NODE scenario~~ Skip ha/drbd_passive test when not in TWO_NODES scenario Aug 9, 2022

Skip ha/drbd_passive test when not in TWO_NODE scenario

81dbecc

Also skip related ha/filesystem test as if ha/drbd_passive test module is skipped, there would be no block device on which to create the FS.

alvarocarvajald force-pushed the drbd-on-3nodes branch from 7997ebe to 81dbecc Compare August 10, 2022 09:16

alvarocarvajald merged commit 6663dfc into os-autoinst:master Aug 10, 2022

alvarocarvajald deleted the drbd-on-3nodes branch August 10, 2022 09:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Skip ha/drbd_passive test when not in TWO_NODES scenario #15324

Skip ha/drbd_passive test when not in TWO_NODES scenario #15324

alvarocarvajald commented Aug 4, 2022 •

edited

Loading

emiura commented Aug 5, 2022

alvarocarvajald commented Aug 5, 2022

dzedro Aug 8, 2022

alvarocarvajald Aug 9, 2022 •

edited

Loading

dzedro Aug 9, 2022

alvarocarvajald Aug 9, 2022

alvarocarvajald Aug 9, 2022 •

edited

Loading

dzedro Aug 9, 2022

Skip ha/drbd_passive test when not in TWO_NODES scenario #15324

Skip ha/drbd_passive test when not in TWO_NODES scenario #15324

Conversation

alvarocarvajald commented Aug 4, 2022 • edited Loading

emiura commented Aug 5, 2022

alvarocarvajald commented Aug 5, 2022

dzedro Aug 8, 2022

Choose a reason for hiding this comment

alvarocarvajald Aug 9, 2022 • edited Loading

Choose a reason for hiding this comment

dzedro Aug 9, 2022

Choose a reason for hiding this comment

alvarocarvajald Aug 9, 2022

Choose a reason for hiding this comment

alvarocarvajald Aug 9, 2022 • edited Loading

Choose a reason for hiding this comment

dzedro Aug 9, 2022

Choose a reason for hiding this comment

alvarocarvajald commented Aug 4, 2022 •

edited

Loading

alvarocarvajald Aug 9, 2022 •

edited

Loading

alvarocarvajald Aug 9, 2022 •

edited

Loading