pytest: Mark tests using space_time_raster_dataset as needs_solo_run #3939

echoix · 2024-06-28T22:10:21Z

The test fixture space_time_raster_dataset used in some pytest tests, including in some of the Jupyter tests, is unexpectedly slow at the setup stage when running in pytest with multiple workers. There might be a deadlock problem, or might just use many processes or threads by itself.

Since it causes some intermittent timeouts, this PR adds the marker to run them separately, without other tests in parallel. The setup stage is still slow, but I highly expect after my couple tries that it will prevent some more timeouts that were still present after #3879.

On the positive side, there's (hopefully) less flaky tests to retry, that can cause delays when there is a lot of activity. On the negative side, since these tests are slow AND that the setup phase of that fixture is also slow, the overall time taken for the tests increases compared to the sum of parallel+serial test steps durations. So that means that there was some speedup of running them in parallel that we are temporarily letting go. It is still better than no parallel workers as before. And either way, some of deadlock issues, if they are caused by the multiprocessing using the fork startup method, will need to be fixed in order to run any of these tests on macOS and Windows. It's a work in progress (not this PR, this PR is ready)

echoix · 2024-06-28T22:13:29Z

Also note that the "coverage" change that you might see is still flawed, as this PR will add new files "seen" by the coverage tool. It doesn't know about all the files in the repo yet, so it now sees 54 more new files.

neteler · 2024-06-29T14:09:51Z

The test fixture space_time_raster_dataset used in some pytest tests, including in some of the Jupyter tests, is unexpectedly slow at the setup stage when running in pytest with multiple workers. There might be a deadlock problem, or might just use many processes or threads by itself.

Wild guess without checking the code: could it be a SQLite locking issue due to SQLite concurrent (read/) write access happening?

echoix · 2024-06-29T14:27:28Z

The test fixture space_time_raster_dataset used in some pytest tests, including in some of the Jupyter tests, is unexpectedly slow at the setup stage when running in pytest with multiple workers. There might be a deadlock problem, or might just use many processes or threads by itself.

Wild guess without checking the code: could it be a SQLite locking issue due to SQLite concurrent (read/) write access happening?

I'm not sure, as trying to follow around what is called within the temporal modules is a nightmare, everything gets invoked/imported. Technically, it is supposed to be 7 rasters, and that fixture valid for all the tests. But something isn't quite right.

Once coverage is properly set up (maybe once C-code is also tracked), it will be a good thing to incrementally change the tests and see what changes, to simplify redundant tests (by only using the coverage provided by one test vs another).

In a separate attempt last weekend to profile some tests (gunittest), I started with only a subfolder in temporal, t.rast.algebra, and a lot of time (18%) was spent on a single inner loop of PLY that checked if an element was in a list. Other than that, one big time wasted at importing tgis, as it imported almost everything with star imports.

So the problem can be anywhere.

For now, the goal is only to prevent useless failures to have correct feedback earlier (without waiting to retry a job)

echoix · 2024-06-29T14:31:19Z

I have other changes queued to these files (dating from the last day of the sprint), so I'm waiting for this to be merged first.

echoix · 2024-07-01T11:16:24Z

I had to rerun about 4-5 pytest failures on main that worked on a second try. I would've hoped that this would have fixed it

echoix · 2024-07-05T16:15:46Z

Now I'm trying to push for this PR to be merged, as we still have some pytest flaky timeouts that I'm convinced this will help a bit. And also was blocking me from continuing on other pytest improvements the last weekend. This weekend will be more quiet on my end though.

There's nothing left to change in this PR

ninsbl

I also guess SQLite DB locking may cause
the issues with long runtimes/timeouts.

The changes seem fine, and at very least help identifying the root cause of long run times...

echoix · 2024-07-09T01:35:10Z

In curious to see if the dbif.close() changes from https://github.com/OSGeo/grass/pull/3996/files#diff-30a61f23aa90129beedbff09dd723876e2a54703c7355eaef4e45a8827eaecda would change something…

It hasn’t been there enough for it, and for the CI on my side on my PR that is before all of them, I think I still caught some failures..

ninsbl · 2024-07-09T01:47:24Z

I agree, and for me it is not clear either how TGIS temporal database interactions are managed in total, and what is the overarching concept in dB handling, esp. in combination with TGIS python objects for example...

…SGeo#3939)

pytest: Mark tests using space_time_raster_dataset as needs_solo_run

b9fe75d

echoix added this to the 8.5.0 milestone Jun 28, 2024

github-actions bot added temporal Related to temporal data processing Python Related code is in Python libraries module notebook labels Jun 28, 2024

echoix requested a review from wenzeslaus June 29, 2024 20:33

echoix requested a review from ninsbl July 1, 2024 22:45

echoix enabled auto-merge (squash) July 2, 2024 09:55

ninsbl approved these changes Jul 9, 2024

View reviewed changes

echoix merged commit 18d11ad into OSGeo:main Jul 9, 2024
28 checks passed

echoix deleted the extra-solo-tests branch July 9, 2024 01:28

wenzeslaus removed their request for review July 9, 2024 09:14

a0x8o pushed a commit to a0x8o/grass that referenced this pull request Jul 23, 2024

pytest: Mark tests using space_time_raster_dataset as needs_solo_run (O…

a1d9455

…SGeo#3939)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

pytest: Mark tests using space_time_raster_dataset as needs_solo_run #3939

pytest: Mark tests using space_time_raster_dataset as needs_solo_run #3939

echoix commented Jun 28, 2024 •

edited

Loading

echoix commented Jun 28, 2024

neteler commented Jun 29, 2024

echoix commented Jun 29, 2024

echoix commented Jun 29, 2024 •

edited

Loading

echoix commented Jul 1, 2024

echoix commented Jul 5, 2024 •

edited

Loading

ninsbl left a comment

echoix commented Jul 9, 2024

ninsbl commented Jul 9, 2024

pytest: Mark tests using space_time_raster_dataset as needs_solo_run #3939

pytest: Mark tests using space_time_raster_dataset as needs_solo_run #3939

Conversation

echoix commented Jun 28, 2024 • edited Loading

echoix commented Jun 28, 2024

neteler commented Jun 29, 2024

echoix commented Jun 29, 2024

echoix commented Jun 29, 2024 • edited Loading

echoix commented Jul 1, 2024

echoix commented Jul 5, 2024 • edited Loading

ninsbl left a comment

Choose a reason for hiding this comment

echoix commented Jul 9, 2024

ninsbl commented Jul 9, 2024

echoix commented Jun 28, 2024 •

edited

Loading

echoix commented Jun 29, 2024 •

edited

Loading

echoix commented Jul 5, 2024 •

edited

Loading