-
Notifications
You must be signed in to change notification settings - Fork 86
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Best pipeline trained by AutoMLSearch gets different score than cloned version trained on X_train
#2844
Comments
@angela97lin Does it happen for regression problems? Could it be threshold tuning? |
@freddyaboulton Nope, it doesn't happen for regression problems so I think you're correct. Setting Next question then is: Do we want this type of behavior / is this intentional? I can see why we would want this, but question if it's a little confusing for users. Only the best pipeline is able to trained with the threshold, right?
TLDR: the behavior seems a bit inconsistent and can be confusing or lead to misunderstandings. Is this what we're okay with, or should we clarify this somehow? |
@angela97lin Great points. I think this behavior is intentional. I think it was introduced in these PRs: #1943 Made it so that we tuned thresholds by default
Every pipeline during automl search is trained with threshold tuning because of the default values set in #1943 and #2320. So the problem is that AutoMLSearch is tuning thresholds by default but there's no way for users to know that/recreate that outside of search if they wanted to "export" the pipelines out of AutoMLSearch. Would one of the two fix the problem:
|
Discussion looks good. Whoever picks this up, we need to decide on what to do next! Questions from refinement:
|
Repro:
This score (0.933) is different from:
This feels like a bug to me. If we set
train_best_pipeline
toFalse
and thenfit
onX_train
, andy_train
, we get the same result as the clone:Is it possible that the data we're using to train the best pipeline is not the whole X_train/y_train? Or some data transformation is happening?
The text was updated successfully, but these errors were encountered: