378 test queue parameter #380

RamonAra209 · 2023-05-01T21:26:09Z

No description provided.

hategan · 2023-05-02T19:38:37Z

Thank you @RamonAra209!
There are some type and linting errors that should be fixed.

hategan · 2023-05-02T19:40:48Z

tests/test_queue.py

+    valid_queues = []
+    out = "".join(os.popen("bqueues -u $(whoami) -o 'QUEUE_NAME NJOBS PEND RUN SUSP STATUS'").read()).split("\n")
+    out = [l for l in out if len(l) != 0]
+    out = [l.split(" ") for l in out]


out seems to be both List[str] and List[List[str]] here. It might be best if we didn't overload it like that.

hategan · 2023-05-02T19:43:38Z

tests/test_queue.py

+
+def get_lsf_queues() -> List[str]:
+    valid_queues = []
+    out = "".join(os.popen("bqueues -u $(whoami) -o 'QUEUE_NAME NJOBS PEND RUN SUSP STATUS'").read()).split("\n")


bqueues appears both here and in SCHEDULER_COMMANDS. Is the latter not used?

Done, now using the dictionary as intended

hategan · 2023-05-02T19:50:53Z

tests/test_queue.py

+    slurm_queues = get_slurm_queues()
+    lsf_queues = get_lsf_queues()
+
+    queues.extend(slurm_queues)
+    queues.extend(lsf_queues)


This is kind of a general note, but we know what scheduler we have from execparams.

That's not the point I wanted to make though. The idea of running all possible get_*_qeues() and merging the results with the assumption that at most one of them will return non-empty results probably works. But it does so in an unnecessarily twisted way and it does rely on an assumption that isn't necessary to make or reason through.

hategan · 2023-05-02T19:57:29Z

tests/test_queue.py

+    if len(slurm_queues) != 0:
+        scheduler = "slurm"
+    elif len(lsf_queues) != 0:
+        scheduler = "lsf"


I see.

So execparams is there to parametrize executors when multiple executors are available on a system. For example, on a SLURM system, a test with an execparams parameter will be invoked multiple times for all combinations of executor in ["local", "batch-test", "slurm"] \crossproduct launcher in ["single", "multiple", "mpirun", "srun"}.

If you ignore execparams and detect what's installed the way it's done here, it will work, but it will run the same test multiple times for no good reason.

Instead, we should run this test on only one of the launchers (the launcher doesn't matter because we don't actually care about launching a job in this test) and using all executors. So something like if execparams.launcher == 'single' then do what we need to do with the assumption that our scheduler is execparams.executor.

Can you take a look at my implementation? I kept the way I was detecting it, but am now only running the test when execparams.launcher == "single"

It's not about detecting the LRM on the system but the fact that we test multiple executors on that system. So even if you restrict it to the single launcher, it will still be repeated for the local, batch-test, and whatever PSI/J detected to be the scheduler.

You could remove execparams, but then you risk not having access to other necessary parameters that might be set by the users that set up the tests. By the way, you may want to use execparams.custom_attributes, since some systems require setting various things, like an account or project.

hategan · 2023-05-02T20:04:01Z

tests/test_queue.py

+        scheduler = "lsf"
+
+    if len(queues) < 2:
+        pytest.raises(Exception("Need at least two queues to perform this test"))


I think you are looking for pytest.skip instead.
pytest.raises is used to test that a block of code throws a specific exception. For example,

def test_that_division_by_zero_correctly_raises_exception(): with pytest.raises(ZeroDivisionError): 1 / 0

In other words you use it to check that you test throws an exception. If you had 1 / 1 instead of 1 / 0, pytest.raises would actually cause the test to fail because it did not throw the exception that was expected.

hategan · 2023-05-02T20:07:11Z

tests/test_queue.py

+
+    job1 = make_job(test_queues[0])
+    executor.submit(job1)
+    qstat = get_queue_info(scheduler)


Most of these commands accept a job id as an argument to only return info about a specific job.

Done, updated the qstat equivalents to query just for the job.

hategan · 2023-05-02T20:08:42Z

tests/test_queue.py

+    job1_qstat_entry = [l for l in qstat if job1._native_id in l][0]
+    assert test_queues[0] in job1_qstat_entry


We might want to go a bit further than just checking if the queue name is somewhere in the qstat output. It's quite possible that we might have a queue named "test" and the word "test" appearing in an unrelated place in the qstat output.

Done, I applied some custom formatting on the qstat equivalents, so they now only report the queue name. This allows me to keep the existing assert logic

Ex:

$ bjobs -o "queue" 4775749 QUEUE pbatch

codecov · 2023-05-04T00:04:25Z

Codecov Report

Merging #380 (7a219d7) into main (c143685) will decrease coverage by 0.56%.
The diff coverage is 48.05%.

@@            Coverage Diff             @@
##             main     #380      +/-   ##
==========================================
- Coverage   69.26%   68.70%   -0.56%     
==========================================
  Files          74       75       +1     
  Lines        3208     3285      +77     
==========================================
+ Hits         2222     2257      +35     
- Misses        986     1028      +42

Impacted Files	Coverage Δ
tests/test_queue.py	`48.05% <48.05%> (ø)`

... and 1 file with indirect coverage changes

Ramon Adolfo Arambula and others added 6 commits May 1, 2023 12:15

tests: added test_queues(), works on LSF

9ba059e

test queue: now using dict to store commands

ef484f4

test queue: added kill_job()

d8aaf0a

test queues: cleanup

7d95799

test queue: raising queue error if <2 available

88b7cf6

test queue: added type hints

a110cad

RamonAra209 added the testing Related to tests or test infrastructure label May 1, 2023

RamonAra209 requested a review from hategan May 1, 2023 21:26

RamonAra209 self-assigned this May 1, 2023

RamonAra209 linked an issue May 1, 2023 that may be closed by this pull request

Test queue parameter #378

Open

Ramon Adolfo Arambula added 4 commits May 1, 2023 15:15

test queue: get_lsf_queues() using whoami

b958212

test queues: more robust parsing for lsf systems

64ccf90

test queue: cleanup + fixed type hints

0e3f372

test queue: debugging get_lsf_queues(), added early return

71c5fb3

hategan requested changes May 2, 2023

View reviewed changes

Ramon Adolfo Arambula and others added 5 commits May 3, 2023 15:41

now using dict to store commands

b53cc83

lsf: fixed output overloading

d69f568

test queue: using pytest skip instead

180fd10

qstats report just queue info, relative to job

bd072eb

test queue: running only on single launcher

7a219d7

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

378 test queue parameter #380

378 test queue parameter #380

RamonAra209 commented May 1, 2023

hategan commented May 2, 2023

hategan May 2, 2023

RamonAra209 May 3, 2023

hategan May 2, 2023

RamonAra209 May 3, 2023

hategan May 2, 2023

hategan May 2, 2023

RamonAra209 May 3, 2023

hategan May 4, 2023

hategan May 2, 2023

hategan May 2, 2023

RamonAra209 May 3, 2023

hategan May 2, 2023

RamonAra209 May 4, 2023

codecov bot commented May 4, 2023

		job1_qstat_entry = [l for l in qstat if job1._native_id in l][0]
		assert test_queues[0] in job1_qstat_entry

378 test queue parameter #380

Are you sure you want to change the base?

378 test queue parameter #380

Conversation

RamonAra209 commented May 1, 2023

hategan commented May 2, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

codecov bot commented May 4, 2023

Codecov Report