Could you code an archipelago to use multithreading when it initalizes solutions? #170

bsugerman · 2018-05-04T14:18:29Z

I have noticed that while archi.evolve() uses multithreading to evolve as many islands as possible with multithreading, the archipelago class does not seem to take advantage of multithreading when it is finding initial candidate solutions within the given boundaries upon first being initialized.

I have defined my own problem CM(x,bounds) which happens to run fairly slowly because it is solving a huge problem. I initialize an archipelago as follows:

 es = pg.algorithm(pg.de1220(gen = 20, variant_adptv=2,xtol=0.01))
 archi=pg.archipelago(algo=es,pop_size=50,prob=pg.problem(CM(x,bounds)),n=8)

I notice that only 1 processor runs (for about a minute in my case) to set up this archipelago. But when I run archi.evolve(), all my processors run. It seems to me that each island should initialize its own candidate solutions on its own processor, just as happens when the populations are being evolved.

Is that a feature that could be added?

The text was updated successfully, but these errors were encountered:

bluescarni · 2018-05-07T12:53:12Z

This seems like a duplicate of #135. It is a feature request we recieve fairly regularly, we plan to add the capability eventually.

MikolajMizera · 2018-05-09T08:24:29Z

@bsugerman here is my solution using ipyparallel:

import ipyparallel as ipp
import pygmo as pg

pop_size = 50
n_islands=8

def pop_init(pop_size, x, bounds):    
    prob_def = CM(x, bounds)
    prob = pg.problem(prob_def)
    return pg.population(prob, pop_size)

if __name__ == '__main__':
    rc = ipp.Client()
    lview = rc.load_balanced_view()
    populations=list(lview.map(pop_init, [pop_size]*n_islands, [x]*n_islands, [bounds]*n_islands))
    
    udi=pg.ipyparallel_island()
    es = pg.algorithm(pg.de1220(gen = 20, variant_adptv=2,xtol=0.01))
    for pop in populations:
        archi.push_back(algo=es, pop=pop, udi=udi)
    archi.evolve(20)

Just start ipcluster with desired number of ipengines and each island will initialize on different ipenegine (separate process).

bsugerman · 2018-05-09T12:16:36Z

MikolajMizera, thanks for that option. I have been able to do the same thing with multiprocessing.pool, and the code needed is even shorter. However, I hope the good folks maintaining pygmo can add this feature into their overall structure, since they already have archipelagos running on multiple processors.

bluescarni · 2018-05-09T19:45:21Z

The main reason why we haven't done this yet is that there's not much overlap with the existing parallel computing infrastructure we have in pagmo, and it seems like we would end up having to maintain dual codepaths for parallel initialisation and evolution.

(A secondary reason is that before tackling other parallel-related tasks, we first need to finish up the work on migration/topology, which is likely to induce further constraints and possibly internal API changes in the portions of the code dealing with parallel computing)

Argysh · 2019-05-06T08:14:07Z

This is how I do it with multiproccessing instead of ipyparallel

import pygmo as pg
def pop_init(pop_size):
    prob_def = pg.rosenbrock(1000) # your or a standard udp
    prob = pg.problem(prob_def)
    return pg.population(prob, pop_size)

if __name__ == "__main__":
    # find number of threads and use nThreads*usage of them
    def getThreads():
        if sys.platform == 'win32':
            return (int)(os.environ['NUMBER_OF_PROCESSORS'])
        return (int)(os.popen('grep -c cores /proc/cpuinfo').read())
    nWorker = min(int(nIslands), int(getThreads()*usage))

    # pre-processing
    [. . . ]

    # initialise your pops in parallel in an mp pool
    pool = mp.Pool(nWorker)
    populations = pool.map(pop_init, [nPop]*nIslands)

    # add them to new islands in the otherwise empty archepelago
    archipelago = pg.archipelago()
    for pop in populations:
        archipelago.push_back(algo=algorithm, pop=pop, udi=pg.mp_island())
    archipelago.wait()

    # post-processing
    [. . .]

on linux this will only work with python >3.4

bluescarni · 2020-01-10T08:35:30Z

The batch fitness evaluation framework (which includes parallel initialisation for populations/islands/archipelagos) has now been completed on the Python side with the release of pagmo 2.13. I will close this report.

bluescarni added the enhancement label Apr 24, 2019

bluescarni closed this as completed Jan 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Could you code an archipelago to use multithreading when it initalizes solutions? #170

Could you code an archipelago to use multithreading when it initalizes solutions? #170

bsugerman commented May 4, 2018 •

edited

Loading

bluescarni commented May 7, 2018

MikolajMizera commented May 9, 2018 •

edited

Loading

bsugerman commented May 9, 2018

bluescarni commented May 9, 2018

Argysh commented May 6, 2019

bluescarni commented Jan 10, 2020

Could you code an archipelago to use multithreading when it initalizes solutions? #170

Could you code an archipelago to use multithreading when it initalizes solutions? #170

Comments

bsugerman commented May 4, 2018 • edited Loading

bluescarni commented May 7, 2018

MikolajMizera commented May 9, 2018 • edited Loading

bsugerman commented May 9, 2018

bluescarni commented May 9, 2018

Argysh commented May 6, 2019

bluescarni commented Jan 10, 2020

bsugerman commented May 4, 2018 •

edited

Loading

MikolajMizera commented May 9, 2018 •

edited

Loading