-
Notifications
You must be signed in to change notification settings - Fork 1.6k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Look into joblib for parallelizing pipeline evaluations #95
Comments
Isn't DEAP supposed to provide some support for parallelizing the evaluation? Or does it make copies of the data set? |
Not sure. I never looked into it much, TBH. |
I dug around and found this. It seems all we SHOULD need to do is replace the toolbox's map function with one that interfaces with some parallel lib. I've run it once with multiprocessing but get the following:
|
Try moving the |
Strange; I'm getting a slightly different error (might be because I'm running
|
How are you running it? |
No; changed the one import to |
@bartleyn how are you structuring your code, i.e. the addition of the |
I ended up getting that error as well. I bet you that I have similarly structured code. I've tried a few different iterations on where I instantiate the pool object (inside the class, etc). I've also tried messing around with the TPOT object's dictionary to see if we can ignore pickling the pool object (using the get/set_state functions), but I think I'm barking up the wrong tree, as the TPOT object isn't the object getting pickled. |
@magsol: You won't need to change the export_utils import if you call TPOT as
|
@rhiever This is just plain bizarre:
How is it not recognizing the |
Change the code in from ._version import __version__ |
That's what it currently is; if I remove the prepended
EDIT Nevermind; stupid mistake on my part. Ignore this post. |
Do you have the latest source? Looks like you have TPOT 0.2.7. |
Yep; upgraded. Am now getting the same sequence of errors as @bartleyn (with |
phew Progress! Likely need to move |
…allelizing the pipeline evaluations.
First-pass PR incoming. Warning: it's nasty. |
Can't believe how easy that looks when it's all said and done, @magsol! Thank you for figuring that out. I added some comments on the PR directly. |
Think there Ipyparalle may solve the issue (add the possibility to work on remote server): |
joblib should, in theory, make it possible to evaluate the entire TPOT GP population in parallel without making copies of the data set. We should look into using joblib to do this and verify whether this is true.
The text was updated successfully, but these errors were encountered: