pypestworker #628
Replies: 3 comments 7 replies
-
|
hey @cnicol-gwlogic - any chance you could send me a test model? |
Beta Was this translation helpful? Give feedback.
-
|
@cnicol-gwlogic can you check the rmr file to see if the master is requesting to kill runs? The pypestworker does not implement the kill request, so Im wondering the master is sending a kill request to the worker and the worker is ignoring it and then when run that should have been killed finishes, the worker sends it to the master and thats where things get mixed up? |
Beta Was this translation helpful? Give feedback.
-
|
fyi - same problem on straight-up ubuntu as via WSL. |
Beta Was this translation helpful? Give feedback.

Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
I've been using
pypestworkera lot (for pastas models with some non-stock stuff to slow things down a bit). I find that for larger pest problems (with longer forward run times) I run into issues. I have played withtimeout(andsocket_timeout) a lot, and sometimes this can resolve my issue. But this is what I tend to see (from pest.rmr):It seems like ppw / netpack has killed/restarted the ppw instance while the model is still running/post-processing, so is being sent results from a killed worker (would that make sense?). If that is the case, I wonder if we need a check for process return codes before deciding to kill the worker - eg
multiprocessing.Process().exitcode # None=still running, 0=success, >0=error- but I can't quite see where the kill/reconnect process happens (maybe it doesn't...).Any thoughts / tips on this? Thanks a lot.
Beta Was this translation helpful? Give feedback.
All reactions