Thoughts on Parallelism #47

koaning · 2018-01-02T21:42:47Z

We already apply some performance tricks with the .evaluate() mechanic but we may be able to add some form of parallelism/queing to perhaps make things even more performant.

In terms of easy win: it seems like the .map (and thus mutate) can be run in parallel in general. Same would hold for .evaluate() in the BasePopulation.

Do we want to explore this?

The text was updated successfully, but these errors were encountered:

rogiervandergeer · 2018-01-03T06:56:45Z

I think we should. If there is a way to easily make evol more than twice as performant, then we have to implement that.

But we should be careful. It is not difficult to come up with mutate functions which will break any parallelism (e.g. any lambda or a function which tracks state). Therefore we have to make the parallelism optional.

I think the biggest win can be achieved when working with islands. Then you can make basically everything parallel. So although it certainly doesn't hurt to work on this before we've implemented the islands, we need to make sure the two can work together.

jasondemorrow · 2018-12-27T15:27:55Z

FWIW, as a user of this library (great work BTW, thank you), I'd be fine with a set of "lower level" concurrency modules, provided with the caveat that mutate and breed should be implemented with care.

koaning · 2018-12-27T20:13:01Z

glad to hear you like it! we’re slowly considering how we might want to do things towards parellism/concurrency but we’re not super sure on what the best method is. we’re certainly open to suggestions. most likely we’ll offer a type of population that is able to distribute its workload. what is the use-case?

jasondemorrow · 2018-12-28T01:37:34Z

Yes, I'd like to speed up the evolution by distributing the work among CPU/GPU cores on one machine. Parallelism on several nodes would be great, but not an immediate need for me personally. It seems like the fitness evaluation is done serially across the population? I would think fitness evaluation could be spawned as a thread without much risk of concurrent access to shared resources.

koaning · 2018-12-28T10:17:06Z

Note that the fitness function is something that we currently evaluate lazily. Suppose that we do two mutate steps and then a survive step: we only need to evaluate an individual at the survive step, not at either mutation steps. The evaluation can be expensive, which is why the main tactic we deploy is to delay it.

Assuming that the functions that you supply to mutate aren't lambda functions it shouldn't be too difficult to use python's multiprocessing module to ensure that certain steps are able to run in parallel. This would initially be implemented in a ParallelPopulation. Would this work for your use-case? I think certain steps can be done in parallel (anything that is like a map) but other steps cannot easily work that way (anything that is like a reduce).

Note that a ParallelPopulation on a single machine is something we could start implementing on the short term, but a multi-machine approach would take a but more experience/investigation. I also think we'll limit the ParallelPopulation to CPU for the short term.

jasondemorrow · 2018-12-28T14:53:49Z

Thanks for the link. Coming from mostly a C++/Java background, I was interested to learn that Python implements threading much differently than I'd expect. But yes, the solution you suggest sounds perfect for my use case. I'll be glad to help in whatever way I can.

jasondemorrow · 2019-01-01T06:53:59Z

I've just submitted a PR with a very simple, arg-driven impl. using multiproc (the pathos port that uses dill in place of pickle). At one point I updated the population unit test to compare execution times. On my machine, evaluating a population with 3 concurrent workers was 3 times faster, as expected.

koaning · 2019-01-01T12:28:14Z

Interesting. I'll have a look, I've never had any experience with pathos. Is there a good reason to favour it over multiprocessing? At the moment our only dependency is pytest for testing and if possible we'd love to keep this package as light as possible.

@rogiervandergeer opinions?

jasondemorrow · 2019-01-01T18:19:57Z

The main reason is that, unlike pickle, dill is capable of serializing instance methods and lambdas so they can be piped to the new process. It's possible to drop that dependency, but (if I understand correctly) it would mean detaching all functions needed from their instances and making them module-scoped.

koaning · 2019-01-01T18:35:33Z

There's great value in being able to support lambdas.

koaning · 2019-01-14T17:11:09Z

@rogiervandergeer close this?

koaning mentioned this issue Jan 4, 2019

light-weight, param-driven implementation of parallelism using multiproc #99

Merged

koaning closed this as completed Jan 15, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thoughts on Parallelism #47

Thoughts on Parallelism #47

koaning commented Jan 2, 2018

rogiervandergeer commented Jan 3, 2018

jasondemorrow commented Dec 27, 2018

koaning commented Dec 27, 2018 via email •

edited

jasondemorrow commented Dec 28, 2018 via email •

edited

koaning commented Dec 28, 2018 •

edited

jasondemorrow commented Dec 28, 2018

jasondemorrow commented Jan 1, 2019

koaning commented Jan 1, 2019 •

edited

jasondemorrow commented Jan 1, 2019

koaning commented Jan 1, 2019

koaning commented Jan 14, 2019

Thoughts on Parallelism #47

Thoughts on Parallelism #47

Comments

koaning commented Jan 2, 2018

rogiervandergeer commented Jan 3, 2018

jasondemorrow commented Dec 27, 2018

koaning commented Dec 27, 2018 via email • edited

jasondemorrow commented Dec 28, 2018 via email • edited

koaning commented Dec 28, 2018 • edited

jasondemorrow commented Dec 28, 2018

jasondemorrow commented Jan 1, 2019

koaning commented Jan 1, 2019 • edited

jasondemorrow commented Jan 1, 2019

koaning commented Jan 1, 2019

koaning commented Jan 14, 2019

koaning commented Dec 27, 2018 via email •

edited

jasondemorrow commented Dec 28, 2018 via email •

edited

koaning commented Dec 28, 2018 •

edited

koaning commented Jan 1, 2019 •

edited