-
-
Notifications
You must be signed in to change notification settings - Fork 192
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"case.weights" take very long #54
Comments
No this is not as expected. I can reproduce the issue on Windows but not on Mac or Linux. I will check the code for some Windows-specific problems. |
The problem seems to be Any idea how to solve this instead of waiting for a newer gcc? |
Using boost::random::discrete_distribution as a replacement helps:
after:
|
Thanks! However I'm reluctant to merge it in the master because of the Boost dependency... ;) |
That is a temporary simple solution while waiting for a newer gcc. I didn't do extensive testing, but a quick check showed very similar model performance (see below). That should make it at least feasible for me to run some prototyping with ranger on my windows laptop, as I frequently need to use weights. And the real dependency is only for the windows R version, which is already a neglected child with no multithreading :)
The gcc's It's your choice in the end. I'm just telling you what I know. I'm glad I've noticed this discussion, since my initial observations didn't agree with the claims of ranger being very fast, so I didn't even try it on a linux server. |
I just released a version (0.4.2) based on the new toolchain. As reported, the problem is solved there. In addition, multithreading is finally working! This version can also be installed on the current R version by using the binary, see https://github.com/imbs-hl/ranger/releases. I hope it's solved with R-3.3.0! |
This is brilliant, thank you very much for these investigations. Even on the current R version, the issue seems to be fixed with ranger 0.4.2. Wow! |
The factory fresh option of using case weights in drawing the bootstrap sample is very important in practice. However I recognized an explosion in runtime when using it. In below example, time consumption with case weights is about ten times as large as without. Is this as expected?
The text was updated successfully, but these errors were encountered: