Workflow tests #187

liamhuber · 2023-10-12T17:05:39Z

@jan-janssen, some of this might be duplicate, e.g. exceptions seem to be tested here, but for now I thought I'd throw all the tests I had at it and we can trim them down as needed -- I figure you'll know better where to look for duplication anyhow #159

Right now I use PyMPISingleTaskExecutor, but if you think there's good cause we can switch this over to PyMPIExecutor.

Purpose (copied from the init notes)

pympipool should be able to handle the case where no elements of the execution can be pickled with the traditional pickle module but rather require cloudpickle.

This is particularly important for compatibility with pyiron_workflow, which dynamically defines (unpickleable) all sorts of objects.

Currently, pyiron_workflow defines its own executor, pyiron_workflow.executors.CloudPickleProcessPool, which can handle these unpickleable things, but is otherwise very primitive compared to pympipool.mpi.executor.PyMPISingleTaskExecutor.

Simply replacing CloudPickleProcessPool with PyMPISingleTaskExecutor in the pyiron_atomistics tests mostly works OK, and work perfectly when the tests are ported to a notebook, but some tests hang indefinitely on CI and running unittests locally.

To debug this, we break the tests up into their individual components (so hanging doesn't stop us from seeing the results of other tests). Once everything is running, these can be re-condensed into a single test file and the entire new tests subdirectory can be deleted.

Unrelated

I finally tracked down why I was unable to push directly to pyiron/atomistics: my remote was set using https, and pycharm wasn't playing nicely with my 2FA. Switching over to ssh (git@...) solves things nicely.

However, here the pyiron admin team still doesn't seem to have admin rights. This confuses me a bit, since over on the new pyiron_workflow that "administration" and "pyiron" teams got their rights automatically --maybe this is a (positive) side effect of using the module template? But I would have expected this behaviour from just creating a repo inside the org... Anyhow, just a heads-up that the repo settings should probably be adjusted.

liamhuber · 2023-10-12T17:45:38Z

The tests all pass, but that's because the new ones aren't actually getting run. I expected something like run: coverage run -m unittest discover tests (this is also similar in non-centralized repos), but the tests here are explicitly only scraping for the first layer of test files matching a pattern: run: for f in $(ls tests/test_*.py); do echo $f; python -m unittest $f; done. I'm going to be super pragmatic and just push the new tests up a level instead of messing with the CI.

liamhuber · 2023-10-12T18:00:22Z

Splitting things into different test files didn't have the desired effect -- the entire action is timing out on the first test (test_args, which I am indeed expecting to hang in a non-notebook setting). However, @jan-janssen, since you said you were pretty confident you knew what needed to be done to resolve this hanging, I'm just going to leave it as-is -- you can make the necessary modifications in this branch, then we should see all the tests pass, then we can merge the new tests into a single file.

liamhuber · 2023-10-12T18:00:52Z

A bit more disconcerting to me is that I'm making a PR from a fork, but on: pull_request is triggering and using actions/checkout to get the fork's code. My understanding was that on: pull_request should not be treating forked PRs this way, but rather only checking out the target branch's code, because otherwise it poses a security risk. @jan-janssen, did you come in here and approve the tests super fast or something? Or does GitHub somehow recognize that I'm a pyiron-org member and this is a pyiron-org repo, even though the repo teams don't seem to be set up? I'm really not an expert on the security side of things, but I know just enough to be a bit nervous about what I see here. @niklassiemer, any wisdom?

liamhuber · 2023-10-12T18:04:40Z

A bit more disconcerting to me is that I'm making a PR from a fork, but on: pull_request is triggering and using actions/checkout to get the fork's code. My understanding was that on: pull_request should not be treating forked PRs this way, but rather only checking out the target branch's code, because otherwise it poses a security risk. @jan-janssen, did you come in here and approve the tests super fast or something? Or does GitHub somehow recognize that I'm a pyiron-org member and this is a pyiron-org repo, even though the repo teams don't seem to be set up? I'm really not an expert on the security side of things, but I know just enough to be a bit nervous about what I see here. @niklassiemer, any wisdom?

Indeed, I'm quite confused why the workflow here is seeing my PR code and not the main branch code. From the github docs:

For pull requests from a forked repository to the base repository, GitHub sends the pull_request, issue_comment, pull_request_review_comment, pull_request_review, and pull_request_target events to the base repository. No pull request events occur on the forked repository.

When a first-time contributor submits a pull request to a public repository, a maintainer with write access may need to approve running workflows on the pull request. For more information, see "Approving workflow runs from public forks."

Maybe I'm not a "first-time contributor" here and I've just forgotten...

liamhuber · 2023-10-12T18:06:35Z

Maybe I'm not a "first-time contributor" here and I've just forgotten...

Nope, just went over to PR's and filtered by myself and this is my only one. ... @jan-janssen I really hope you just snuck in and approved these tests, otherwise my understanding of on: pull_request is severely deficient 😝

niklassiemer · 2023-10-12T19:44:31Z

I think since you happen to be a maintainer with write access yourself this might already allow you to run these actions without a approval. Lets test that with a new github account?
Otherwise, the on: pull_request should (to my understanding) check out the final branch (i.e. the potential result of a merge to main) to run the tests. Otherwise running any tests is useless (the target branch should have been tested already).
One difference is the exposed credentials. If run from a fork, there should not be any of the repos credentials available in the workflow! Therefore, things like codacy are bound to fail due to missing credentials.

liamhuber · 2023-10-12T21:00:51Z

I think since you happen to be a maintainer with write access yourself this might already allow you to run these actions without a approval. Lets test that with a new github account?

That's what disturbs me; on this particular repo I'm not a maintainer. I also wasn't allow to push a branch directly to the repo, so I don't think I even have write access.

Otherwise, the on: pull_request should (to my understanding) check out the final branch (i.e. the potential result of a merge to main) to run the tests. Otherwise running any tests is useless (the target branch should have been tested already).
One difference is the exposed credentials. If run from a fork, there should not be any of the repos credentials available in the workflow! Therefore, things like codacy are bound to fail due to missing credentials.

Ah, indeed. And none of the run stuff uses credentials so that all tracks. Thanks for the clarification!

The docs state " maintainer with write access may need to approve running workflows on the pull request" (my emphasis); in the settings for another repo I can see there is the option to not require approval, so maybe that's all that's going on here.

jan-janssen · 2023-10-13T06:21:07Z

However, here the pyiron admin team still doesn't seem to have admin rights. This confuses me a bit, since over on the new pyiron_workflow that "administration" and "pyiron" teams got their rights automatically --maybe this is a (positive) side effect of using the module template?

The permissions still have to be set manually, I did this for pyiron_workflow and I now fixed the permission for pympipool.

jan-janssen · 2023-11-02T09:07:47Z

I feel the issues here are related to the ones we see in #162

liamhuber · 2023-11-02T13:41:34Z

I feel the issues here are related to the ones we see in #162

That is a closed pull request with no commentary... could you provide a bit of context?

jan-janssen · 2023-11-02T13:54:57Z

I feel the issues here are related to the ones we see in #162

That is a closed pull request with no commentary... could you provide a bit of context?

I just merged the latest changes from main to see if it solves the issue but it does not.

liamhuber · 2023-11-07T21:22:07Z

As discussed in #210, these are now working. I just merged main and they run great on my local machine inside pycharm too.

I'll recombine these into something sensible and polish it up!

Don't double down on the word "result"

liamhuber · 2023-11-07T22:31:28Z

======================================================================
ERROR: test_timeout (test_with_dynamic_objects.TestDynamicallyDefinedObjects)
Timeouts for dynamically defined callables should be handled ok.
----------------------------------------------------------------------
Traceback (most recent call last):
  File "/home/runner/work/pympipool/pympipool/tests/test_with_dynamic_objects.py", line 207, in test_timeout
    fs.result(timeout=0.0001)
  File "/usr/share/miniconda3/envs/test/lib/python3.10/concurrent/futures/_base.py", line 460, in result
    raise TimeoutError()
concurrent.futures._base.TimeoutError

----------------------------------------------------------------------

I'm rather confused why this is throwing an error -- line 207 in test_timeout is literally wrapped in a with self.assertRaises( TimeoutError,, this is exactly the behaviour we want. Also it runs fine on my local machine.

I briefly thought I had this nailed when I realized I'm using the builtin TimeoutError, but concurrent.futures._base.TimeoutError is literally just an alias to the builtin guy.

liamhuber · 2023-11-07T22:33:00Z

Ahhh, but my local interpreter is 3.11 and these are all running for 3.11 here too. Ok, that's a solid lead.

liamhuber · 2023-11-07T22:35:58Z

I have to go now, but I'll come back to this tomorrow to look at:
a) Why this fails for <3.11
b) Why it won't plug-and-play with pyiron_workflow (somehow the running attribute fails to get updated to False, even though the callback is clearly triggering and the object id is the exact same inside the callback and outside it. This should be an identical case to test_dynamic_callable, which also registers a done callback and is working fine.

liamhuber · 2023-11-08T02:18:18Z

Huh, it looks like concurrent.futures had its own TimeoutError until 3.11? At any rate my very naive fix actually worked for all but mpich on 3.11. That simply timed out. I'll restart it for now, but if it times out again I'll dig into it tomorrow

jan-janssen · 2023-11-08T07:14:50Z

The issue with the MacOS tests failing was still related to #191 I restarted the tests a couple of times, but now they are working fine.

liamhuber · 2023-11-08T16:00:47Z

Super.

Let me try to track down why it's still not working with the workflows, then I'll merge. Since we test the callback I'm hopeful the problem is purely on the workflow side, but there might be a more complex test case I still need to add here.

Unsurprisingly, this passed totally fine on my local machine.

liamhuber · 2023-11-09T19:56:49Z

Super.

Let me try to track down why it's still not working with the workflows, then I'll merge. Since we test the callback I'm hopeful the problem is purely on the workflow side, but there might be a more complex test case I still need to add here.

This was resolved in #214. This guy should be good to go now.

liamhuber · 2023-11-09T20:12:50Z

Just rerunning now until mac rolls a nat 20

liamhuber added 2 commits October 12, 2023 09:17

Add gitignore

de2fad2

Reproduce and split up pyiron_workflow executor tests

d0c6676

Flatten the tests so the CI command can actually see them

1728524

liamhuber assigned jan-janssen Oct 16, 2023

liamhuber mentioned this pull request Oct 19, 2023

Executors for composite pyiron/pyiron_workflow#39

Merged

This was referenced Nov 5, 2023

Next steps pyiron/pyiron_workflow#68

Open

Test Workflows again #210

Closed

Merge branch 'main' into workflow_tests

890fd27

liamhuber added 7 commits November 7, 2023 13:35

Recombine all similar tests into a single module

2270484

Directly test the user-facing executor

4a2af62

Replace "unpickleable" with "dynamic"

820e14e

Fix grammar for indefinite articles after consonant-vowel switch

de20716

Complete docstrings for the individual tests

f5cfdab

Use better variable name

7cef19b

Don't double down on the word "result"

Complete messages for asserts

f0ec5b6

Naively try including the error that's breaking things

c407511

liamhuber added 6 commits November 8, 2023 08:43

Simplify test names

847a0a5

Ensure that callback _methods_ can modify their owners

54bfa60

Unsurprisingly, this passed totally fine on my local machine.

Refactor: rename variable

02d5e71

Refactor: rename function

109f612

Merge branch 'main' into workflow_tests

95c84e5

Use universal executor

7f8dcf0

This was referenced Nov 8, 2023

[bug] futures negatively impacted by garbage collection #213

Closed

Depend on pympipool instead of CloudPickleProcessPoolExecutor pyiron/pyiron_workflow#19

Closed

Don't wait on deletion #214

Merged

Merge branch 'main' into workflow_tests

f8fb784

jan-janssen merged commit 893ec2a into pyiron:main Nov 9, 2023
17 checks passed

liamhuber deleted the workflow_tests branch November 9, 2023 20:57

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Workflow tests #187

Workflow tests #187

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023 •

edited

Loading

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

niklassiemer commented Oct 12, 2023

liamhuber commented Oct 12, 2023

jan-janssen commented Oct 13, 2023

jan-janssen commented Nov 2, 2023

liamhuber commented Nov 2, 2023

jan-janssen commented Nov 2, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 8, 2023

jan-janssen commented Nov 8, 2023

liamhuber commented Nov 8, 2023

liamhuber commented Nov 9, 2023

liamhuber commented Nov 9, 2023

Workflow tests #187

Workflow tests #187

Conversation

liamhuber commented Oct 12, 2023

Purpose (copied from the init notes)

Unrelated

liamhuber commented Oct 12, 2023 • edited Loading

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

liamhuber commented Oct 12, 2023

niklassiemer commented Oct 12, 2023

liamhuber commented Oct 12, 2023

jan-janssen commented Oct 13, 2023

jan-janssen commented Nov 2, 2023

liamhuber commented Nov 2, 2023

jan-janssen commented Nov 2, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 7, 2023

liamhuber commented Nov 8, 2023

jan-janssen commented Nov 8, 2023

liamhuber commented Nov 8, 2023

liamhuber commented Nov 9, 2023

liamhuber commented Nov 9, 2023

liamhuber commented Oct 12, 2023 •

edited

Loading