New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cleaning takes too long time on multi-cores cpu #40
Comments
Drift_thresholder() has same problem. |
Hum... sounds very weird ! Because it takes only 2 sec on my computer (7 cores). Have you tried to set n_jobs = 1 and run again ? |
Thank you for reply. I think joblib or multiprocessing cause this problem, and trying to solve it. set n_jobs=1, seems OK set n_jobs=2, it dies. |
from http://pythonhosted.org/joblib/parallel.html#common-usage Problem solved. |
Yes this is what I was wondering. At the moment, MLBox does not support Windows but soon :) |
I've got same issue, where should I set n_jobs=1 ? |
Hello @DarquesM !
Otherwise, I will release soon a new version with reading and cleaning separate classes... |
Hello, thanks for reporting this issue. I will close it since this will be fixed in a next release (MLBox 0.7.1 probably) |
Cleaning takes 276s for house price dataset on intel E5-2683v3
As E5-2683 has more 14cores and 28threads.
I guess the problem may cause by n-job=-1 in here.
` if (self.verbose):
print("cleaning data ...")
I don't know how to fix it, may be add a n_jobs arguments for class Reader?
Looking for you response. Thank you.
The text was updated successfully, but these errors were encountered: