Join GitHub today
GitHub is home to over 50 million developers working together to host and review code, manage projects, and build software together.
Sign upRandom Forest regressor producing really bad results #493
Comments
|
This is due to the |
|
Improve the Model parameters with proper tuning and use the n-estimator (No of trees as per Python ) more than 500. |
|
Is there a status on this being resolved? |
|
@sucheta-jawalkar This has been resolved (it was not a bug) -- the default number of trees in scikit-learn is very small (10) and since we are trying to be scikit-learn compatible, we have chosen to also use a default of 10 trees. However, in order to get good results, you will need to increase that number from the default to something much larger (e.g. 100, 500, 1000). |
There is an issue with the performance of the Random Forest regressor. I have found that the results are very bad compared to GBM and also to the randomForest R package.
Another dataset: