Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Slow for classification on many classes #79

Open
rec3141 opened this issue Jun 16, 2016 · 2 comments
Open

Slow for classification on many classes #79

rec3141 opened this issue Jun 16, 2016 · 2 comments
Labels
long-term runtime Runtime-related issues

Comments

@rec3141
Copy link

rec3141 commented Jun 16, 2016

Hi, I'm trying to figure out why ranger is taking longer than the equivalent command in randomForest in R. Are there options that are slowing it down? Thanks for any help.

dim(data_simple[training,])
[1] 4104   95

length(unique(as.factor(k.row)))  
[1] 704

num.trees <- 100000

#completes in < 12 hours on 1 thread
rf.out <- randomForest(x=data_simple[training,], y=as.factor(k.row), importance=TRUE, proximity=TRUE, ntree=num.trees, keep.forest=T, do.trace=100)

#predicted completion in 39 hours on 2 threads
ranger.out <- ranger(data=data.frame("classes"=as.factor(k.row),data_simple[training,]),importance="impurity", num.trees=num.trees, num.threads=2, write.forest=T, verbose=T, dependent.variable.name="classes", classification=T)
@mnwright
Copy link
Member

There seems to be a problem for classification on many classes. I haven't yet optimised for that case, so it's likely there is a simple bottleneck somewhere. I will do some profiling next week.

@rec3141
Copy link
Author

rec3141 commented Jun 17, 2016

Thanks, I'll keep an eye out.
On Jun 16, 2016 10:10 PM, "Marvin N. Wright" notifications@github.com
wrote:

There seems to be a problem for classification on many classes. I haven't
yet optimised for that case, so it's likely there is a simple bottleneck
somewhere. I will do some profiling next week.


You are receiving this because you authored the thread.
Reply to this email directly, view it on GitHub
#79 (comment), or mute
the thread
https://github.com/notifications/unsubscribe/ABDUI-abaMndH0XoAXCSFKOpGdAt8qoOks5qMjpngaJpZM4I319k
.

@mnwright mnwright changed the title slower than randomForest? Slow for classification on many classes Aug 22, 2016
@mnwright mnwright added runtime Runtime-related issues long-term labels Sep 25, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
long-term runtime Runtime-related issues
Projects
None yet
Development

No branches or pull requests

2 participants