-
Notifications
You must be signed in to change notification settings - Fork 7
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Error in unserialize(socklist[[n]]) : error reading from connection #44
Comments
Hello and sorry you run into a problem. I probably cannot solve the error coming from parallel computing on Windows. But maybe I can help you with the single-threaded problem (no movement in error bar):
What do you observe? |
Thank you for the swift answer. I was able to solve the problem thanks to your ideas and some tinkering. Initially, my bg_x and x were the same, both being 27k x 140. I set x to be a subset of 2000 rows and bg_x to be 500 with parallel processing for 12 threads enabled with exact = FALSE. Processing finished after about 5 minutes. Will the fact that I am now using a smaller dataset for x and bg_x change my results relative to using my full dataset for x and bg_x? I am able to use the full dataset for x and bg_x with treeSHAP. |
Sweet! Thanks for testing. With 500 rows, your background data is still very large, but 5 minutes is quite acceptable. Using even larger X and bg_X will change the result, but only slightly. I usually decompose between 1000 and 2000 predictions, even with TreeSHAP. TreeSHAP is indeed magnitudes faster than KernelSHAP but can only be used for tree-based methods, while KernelSHAP works for all model classes. For trees, I would not use KernelSHAP in practice. By default, in your case, |
Since I'm doing research, a longer processing time is acceptable but I suppose for production TreeSHAP would be preferred. Thanks again! |
I am dealing with a large dataset (27000 x 140) and when I first try without parallel computation of kernalSHAP, there is no movement in the progress bar (it stays at 0). When I turn on parallel processing using what you have described on the home page of your repo, I get the error:
Error in unserialize(socklist[[n]]) : error reading from connection
What should I do? I am using windows, core i7 12700k.
The text was updated successfully, but these errors were encountered: