Reverse actor stopping order (was: Check that actor is still runable after message has been recieved.) #8

adamcik · 2011-06-26T16:01:23Z

On line https://github.com/jodal/pykka/blob/master/pykka/actor.py#L164 the check for the run-able state is done before inbox.get() which can block for any amount of time, during this waiting run-able can very well have changed.

Symptom of this problem is not finding the refs that you expect in the actor registry. This happens easily when stress-testing with a lot of clients in my current mopidy branch. For each new connection a ThreadingActor is started and as socket data is received it is delivered as a message. Then I CTRL^C mopidy and it stops all the actors, however, since some of my actors have messages in their inbox still they will go on to handle them despite being stopped. Part of this handling includes looking for other non-running actors which of course fails.

The text was updated successfully, but these errors were encountered:

adamcik · 2011-06-26T16:26:25Z

Hmm, problem could also be due to simple thread switching, as I'm testing with while sleep 0.1; do mpc status &; done it is very much possible that a thread that has started running loses its established actor proxy midway. Not sure if this really can be solved within pykka.

adamcik · 2011-06-26T16:42:58Z

«The actor will finish processing any messages already in its queue before stopping. It may not be restarted.» - so I guess this is intended behavior. So, I guess my new question is, how def _run(self): manages to keep this promise. Based on my understanding of that pykka code, it would only handle the last message received before being set to not run-able.

jodal · 2011-06-29T11:27:49Z

When using ThreadingActor, the actor inbox is a Queue.Queue, which is a FIFO queue. Thus self.actor_inbox.get() will first return all messages queued before the stop message, then the stop message.

By using Queue.PriorityQueue, we could change the implementation to give stop messages higher priority than regular messages, and thus stop the actor faster, with probably less probability of the actor processing a message which requires interaction with other already stopped actors, causing an error before the stop message is processed.

To shut down actors in a sequence that takes dependencies into consideration, avoiding the case where living-soon-to-stop actors depends on stopped actors, I think we would need some kind of supervision hierarchy. If we're to go that way, we should probably have a second look at Erlang and/or Akka to see how this can be solved once and for all.

adamcik · 2011-06-29T11:31:18Z

I was thinking about rewriting mopidy's stop_all code to do frontends/lsiteners before backends, would probably be a fairly simple way of starting to improve/avoid this for mopidy's case.

jodal · 2011-07-03T21:19:15Z

For the records: after some discussion on IRC, we've landed at a simple solution that will make ActorRegistry.stop_all() work in simple cases:

By default, when block=False is not passed as an kwarg, ActorRegistry.stop_all() blocks after every stop() call to an actor, waiting for it to respond. This makes the order actors is stopped in deterministic.
By implementation, ActorRegistry appends actors to a list when they are registered.
Generally, if actor A depends on actor B, then actor B is started before actor A, yielding a list [B, A] in the ActorRegistry.
AFAICT, there is no advantage with stopping actors like we currently do, first one started is the first one stopped.
Thus, if we simply reverse the list of actors before we stop them, we will generally stop the actors which depends on other actors first.

In non-simple cases, the application developer will have to shut down actors in the correct order himself. This should be documented.

jodal closed this as completed in 35695b0 Jul 30, 2011

ghost assigned jodal Sep 19, 2012

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reverse actor stopping order (was: Check that actor is still runable after message has been recieved.) #8

Reverse actor stopping order (was: Check that actor is still runable after message has been recieved.) #8

adamcik commented Jun 26, 2011

adamcik commented Jun 26, 2011

adamcik commented Jun 26, 2011

jodal commented Jun 29, 2011

adamcik commented Jun 29, 2011

jodal commented Jul 3, 2011

Reverse actor stopping order (was: Check that actor is still runable after message has been recieved.) #8

Reverse actor stopping order (was: Check that actor is still runable after message has been recieved.) #8

Comments

adamcik commented Jun 26, 2011

adamcik commented Jun 26, 2011

adamcik commented Jun 26, 2011

jodal commented Jun 29, 2011

adamcik commented Jun 29, 2011

jodal commented Jul 3, 2011