-
Notifications
You must be signed in to change notification settings - Fork 106
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flaky tests using MPI #600
Comments
The job you have linked actually succeeded. Do you have an example for a failing job? |
Not atm. This is basically just an FYI for when you come across a failed MPI job. |
One example of a failing test is here, where the problem seems to be sqlite related: https://zivgitlab.uni-muenster.de/pymor/pymor/-/jobs/32104 What is kind of strange: by default there is a timeout of 5 seconds on aquiering a lock, and I don't see that there is any operation which might take any longer .. |
Suggestion 1: I replace the SQLiteRegion implementation with a new DiskRegion based on https://github.com/grantjenks/python-diskcache Suggestion 2: use an ORM like sqlachemy to abstract to a SQLRegion implementation where the actual database connection is a configuration detail. |
|
We haven't seen MPI tests failing since the adoption of diskcache. Let's hope for the best and close this issue. |
Since I haven't been able to quickly determine what's going on with the newly failing MPI tests, I propose to disable them until I've found a solution. Which might take a couple of weeks until I really have time for that. |
If there are none, I'll merge #735 when ready and rebase open PR branches on that |
None from me. |
I've rebased the active PRs where jobs were failing in MPI on master now. |
It looks like the 'MPI' CI job sometimes fails due to some intermittent system condition. I have not been able to reproduce it yet. A restart of the job fixes it.
The text was updated successfully, but these errors were encountered: