Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Require score_tree_interval to be set when seed is set for Decision Tree Algorithms #9372

Closed
exalate-issue-sync bot opened this issue May 12, 2023 · 2 comments

Comments

@exalate-issue-sync
Copy link

When a user sets a seed when training a GBM or Random Forest model, we can assume they would like to be able to reproduce the model. If, however, they use early stopping with no score_tree_interval set, the model will not be reproducible.

Add a requirement that the score_tree_interval must be set if the seed is set to ensure the user creates a reproducible model.

@exalate-issue-sync
Copy link
Author

Michal Kurka commented: In this Jira we will introduce a warning, we can introduce a stricter check if the warning would prove ineffective. [https://github.com//pull/5447#issuecomment-821259860|https://github.com//pull/5447#issuecomment-821259860]

@h2o-ops
Copy link
Collaborator

h2o-ops commented May 15, 2023

JIRA Issue Migration Info

Jira Issue: PUBDEV-6249
Assignee: Michal Kurka
Reporter: Megan Kurka
State: Resolved
Fix Version: 3.32.1.2
Attachments: N/A
Development PRs: Available

Linked PRs from JIRA

#5447

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant