Fixes #722 issue affecting failed workflows #728

muhrin · 2017-09-27T13:29:15Z

We overwrote the constructor for Persistence but did not call the super
class. This was a problem because the super constructor was actually
adding the class as a listener of the MONITOR. It is the MONITOR that
would tell the persistence when a process had crashed (because of
exception). This means that the pickle was not being moved to failed
and therefore the workflow would be re-ran.

We overwrote the constructor for Persistence but did not call the super class. This was a problem because the super constructor was actually adding the class as a listener of the MONITOR. It is the MONITOR that would tell the persistence when a process had crashed (because of exception). This means that the pickle was not being moved to failed and therefore the workflow would be re-ran.

sphuber · 2017-09-27T13:31:08Z

Martin! Good to hear from you mate. I am curious though, why did this not happen when I just ran the daemon as opposed to manually calling tick_workflow_engine?

muhrin · 2017-09-27T13:49:02Z

@sphuber I asked myself the same thing. Turns out the bug was sorta there with the daemon as well. What would happen is that the process would crash but the flock would remain (because persistence hadn't received the message about it failing). So the daemon wouldn't try to re-run it...until it was restarted. So it's a little subtle to notice.

sphuber · 2017-09-27T14:09:31Z

Pulled the changes locally and the zombie test now runs as expected.

lekah · 2017-09-27T20:27:06Z

Thanks guys, it also works for me!

muhrin requested a review from lekah September 27, 2017 13:29

Merge branch 'develop' into fix_722_failed_wf

bc91c96

sphuber approved these changes Sep 27, 2017

View reviewed changes

sphuber merged commit 1aa2bd3 into aiidateam:develop Sep 27, 2017

muhrin deleted the fix_722_failed_wf branch September 27, 2017 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fixes #722 issue affecting failed workflows #728

Fixes #722 issue affecting failed workflows #728

muhrin commented Sep 27, 2017

sphuber commented Sep 27, 2017

muhrin commented Sep 27, 2017

sphuber commented Sep 27, 2017

lekah commented Sep 27, 2017

Fixes #722 issue affecting failed workflows #728

Fixes #722 issue affecting failed workflows #728

Conversation

muhrin commented Sep 27, 2017

sphuber commented Sep 27, 2017

muhrin commented Sep 27, 2017

sphuber commented Sep 27, 2017

lekah commented Sep 27, 2017