Chapter 6, page 203 #113

GermanCM · 2019-10-21T18:31:43Z

Dear professor Sebastian,

I think there is no need to retrain the best estimator with the train set after using 'GridSearchCV', since the 'GridSearchCV' class already implements, by default (with the 'refit' param = True by default), a model re-training on the whole train dataset with the best found hyperparameters (based on the defined metric).
So in your example in this page, we could implement directly:
clf = gs.best_estimator_
--> this line would not be necessary: clf.fit(X_train, y_train)
print('Test accuracy: %.3f' % clf.score(X_test, y_test))
Test accuracy: 0.974

I hope I explained this clearly.
Best regards and thanks for your excellent book.

Germán

rasbt · 2019-10-21T23:50:19Z

Hi Germán,

you are absolutely right. I think I listed this as two independent steps to make the general workflow/concept more clear (independent of scikit-learn). However, you can set refit=True to achieve the same effect. Computationally, it should make no difference in terms of how expensive it is to run the code. Thanks for the note though, I should mention this.

rasbt · 2019-10-22T00:10:38Z

Oh I see that the clf.fit(X_train, y_train) is completely redundant since it's refit=True by default as you correctly pointed out!

Edit: just clarified this in the notebook. Thanks!

GermanCM · 2020-01-11T22:02:20Z

Thanks dear Sebastian for including a note about this in your third edition, I will close the issue :)

GermanCM closed this as completed Jan 11, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chapter 6, page 203 #113

Chapter 6, page 203 #113

GermanCM commented Oct 21, 2019 •

edited

Loading

rasbt commented Oct 21, 2019

rasbt commented Oct 22, 2019 •

edited

Loading

GermanCM commented Jan 11, 2020

Chapter 6, page 203 #113

Chapter 6, page 203 #113

Comments

GermanCM commented Oct 21, 2019 • edited Loading

rasbt commented Oct 21, 2019

rasbt commented Oct 22, 2019 • edited Loading

GermanCM commented Jan 11, 2020

GermanCM commented Oct 21, 2019 •

edited

Loading

rasbt commented Oct 22, 2019 •

edited

Loading