Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Implement Bootstrapping #25

Closed
mbillingr opened this issue Jun 25, 2018 · 0 comments · Fixed by #28
Closed

Implement Bootstrapping #25

mbillingr opened this issue Jun 25, 2018 · 0 comments · Fixed by #28
Labels
enhancement New feature or request
Milestone

Comments

@mbillingr
Copy link
Owner

mbillingr commented Jun 25, 2018

Add an option to train on bootstrap resamples of the training data.

Pros:

  • Similar to the original Random Forest
  • Can speed up fitting by using less samples

Cons:

  • Requires either more copying of data or another indirection layer to acess data

Open questions:

Resolved questions:

  • Who should do the reampling - the forest builder before passing the data to the tree builder, or the tree builder after it got passed the data? I don't think it matters for the current sequential implementation, but it may be relevant for parallelization in the future. The tree builder should do the resampling.
@mbillingr mbillingr added the enhancement New feature or request label Jun 25, 2018
This was referenced Jun 25, 2018
@mbillingr mbillingr added this to the Version 0.1.0 milestone Jun 25, 2018
@mbillingr mbillingr added this to In progress in Version 0.1.0 via automation Jun 25, 2018
This was referenced Jun 25, 2018
Version 0.1.0 automation moved this from In progress to Closed Jun 27, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
Version 0.1.0
  
Closed
Development

Successfully merging a pull request may close this issue.

1 participant