Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Remove n_estimators as a parameter from the random forest operator and other ensemble methods #119

Closed
rhiever opened this issue Mar 24, 2016 · 3 comments

Comments

@rhiever
Copy link
Contributor

rhiever commented Mar 24, 2016

There's really no point to optimizing the number of estimators in ensemble models: Basically, the more estimators they're provided, the better they perform. Let's remove n_estimators as an optimizable parameter for all ensemble models and instead default it to, say, 100 or 500 estimators.

@rhiever
Copy link
Contributor Author

rhiever commented Mar 27, 2016

The RF operator has had its n_estimators parameter removed, and it now defaults to 500 for n_estimators. The remaining ensemble operators still need to be reworked.

@minimumnz
Copy link

Increasing the estimators will increase performance, but I would think when you're trying a lot of different parameters it will increase the time for all those decision trees to run, and whether it wouldn't be better to increase n_estimators later. I guess if they don't perform well enough in the first instance they're not going to make it into the next generation. Really just curious on your thoughts.

@rhiever
Copy link
Contributor Author

rhiever commented Mar 27, 2016

I suppose the default could be n_estimators=100 and it should still work fine. I just want to make sure that RFs and other ensemble methods don't get skipped over simply because they didn't have enough estimators.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants