Parallel Differential Evolution based on pools #5054

pavelponomarev · 2015-07-15T16:33:18Z

The Diferential Evolution algorithm modified for parallel execution. This is beneficial for computationally-expensive objective functions. MPI and joblib pools are also provided taken from emcee project.The API is modified to fix legacy problems and use particulars of population-based optimization algorithms in full. Solves #4864.

…uasi-aggressive which means updating the best individual after subpopulation is evaluated. The size of the subpopulation is the number of parallel workers.

…lation. Required for qulitative assessment of the convergence and for setting custom stopping criterions

…ong to the alghorithm and it is applicable and feasible to a limited class of objective functions.

dlax · 2015-07-15T20:22:11Z

A few remarks following a quick review of your commits:

when you introduce backwards incompatible changes, you need to add some deprecation warnings
there is no test related to your changes, we need some
there are some commits at the end that do not appear clearly related to parallelization of DE, these should be submitted as separate PR
the commit history is quite messy, it would ease reviewing if you could rewrite the history by isolating changes into atomic commits and avoid changing things back and forth between commits

pavelponomarev · 2015-07-16T09:26:01Z

Yes,
Before I rewrite the code, please check the first and second commits. Is this a right way to include additional helper modules (pools) to the directory tree of the project? Is additional folder pools acceptable? Is the usage of the file pools/__init__.py correct?

pv · 2015-07-16T10:59:31Z

To the technical parts of adding the pools: . (i) having same name for the module and the classes causes problems. (ii) the pool modules should be private, ie., rename `Spool.py` -> `_spool.py` (iii) in addition to adding the pools to `__all__` in `__init__.py`, you need to import them, via `from ._spool import SPool`. (iv) nitpick -- the classes should be named `SPool`, `MPIPool`, `JLPool` (maybe `JoblibPool`) to follow proper camelcase . The bikeshedding question is whether `scipy.optimize.pools` is a good place for them. Perhaps not, because they could be useful also for other things than `scipy.optimize`. There are a some other alternatives such as putting them in `scipy.misc`, or, putting them in `scipy._lib` and importing from there to `scipy.optimize` top-level (and similarly in other modules using them).

andyfaff · 2015-07-17T08:41:34Z

I think that the introduction of pool code is going to complicate this PR. Presumably all the pool objects have a map method. A default pool map for testing parallelisation could be from the multiprocessing module. Users can supply their own pool objects instead if they want to.
It's not necessary to introduce these pools to scipy to continue work on this PR at this point.

pavelponomarev · 2015-07-17T13:10:08Z

@andyfaff , agree.
Then there should be a serial pool, which is a wrapper over normal serial execution of the objective function, and a parallel pool based on multiprocessing with minimal functionality. The proper place for these two pools is then in scipy.misc.

rgommers · 2015-08-11T19:10:06Z

To who looks at this PR: the main discussion on how the API for parallelization should look is in gh-4864. This PR cannot be merged until a decision is made there.

The quasi-agressive execution of DE in parallel is enabled here. The whole population is broken on subpopulations of lengthes pool.poolsize(), which are executed in parallel. Significantly reduces optimization time in case of computationally-expensive objective functions. Solves scipy#5054

pavelponomarev · 2015-08-11T19:45:24Z

This PR is broken to several smaller PRs. Follow here #5141.

rgommers · 2015-08-11T20:07:48Z

Thanks @pavelponomarev

The quasi-agressive execution of DE in parallel is enabled here. The whole population is broken on subpopulations of lengthes pool.poolsize(), which are executed in parallel. Significantly reduces optimization time in case of computationally-expensive objective functions. Solves scipy#5054

pavelponomarev added 10 commits June 29, 2015 12:58

ENH: Added pools for parallelization.

a8aeae2

ENH: changed the call signature for objective function to use pool.map

a0ee7ad

ENH: Added parallelization based on pools. The mutation strategy is q…

44d6618

…uasi-aggressive which means updating the best individual after subpopulation is evaluated. The size of the subpopulation is the number of parallel workers.

MAINT: DOC: fixed documentation of pools.

e9d0ebe

DOC: updated docstrings

5a3e95b

DOC: removed duplicate docstrings

88efe5d

MAINT: DOC: API: removed obsolete parameter maxfun

d231a8d

DOC: API: renamed variable popsize to be less misleading scipy#5046

faf386e

API: ENH: DOC: changed the callback function to expose the whole popu…

92a507a

…lation. Required for qulitative assessment of the convergence and for setting custom stopping criterions

API: Changed default value to polish=False, as polishing does not bel…

c67c997

…ong to the alghorithm and it is applicable and feasible to a limited class of objective functions.

dlax added scipy.optimize needs-work Items that are pending response from the author labels Jul 15, 2015

pavelponomarev mentioned this pull request Aug 11, 2015

ENH: Parallel differential evolution #5141

Closed

pavelponomarev closed this Aug 11, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallel Differential Evolution based on pools #5054

Parallel Differential Evolution based on pools #5054

pavelponomarev commented Jul 15, 2015

dlax commented Jul 15, 2015

pavelponomarev commented Jul 16, 2015

pv commented Jul 16, 2015 via email

andyfaff commented Jul 17, 2015

pavelponomarev commented Jul 17, 2015

rgommers commented Aug 11, 2015

pavelponomarev commented Aug 11, 2015

rgommers commented Aug 11, 2015

Parallel Differential Evolution based on pools #5054

Parallel Differential Evolution based on pools #5054

Conversation

pavelponomarev commented Jul 15, 2015

dlax commented Jul 15, 2015

pavelponomarev commented Jul 16, 2015

pv commented Jul 16, 2015 via email

andyfaff commented Jul 17, 2015

pavelponomarev commented Jul 17, 2015

rgommers commented Aug 11, 2015

pavelponomarev commented Aug 11, 2015

rgommers commented Aug 11, 2015