WIP Use score in tree hyperparameter notebook #503

glemaitre · 2022-01-06T08:40:06Z

This PR isolate the call to score in the hyperparameter notebook.
It is linked to the comment: #464 (comment)

In this PR, we should therefore address the concern of @ogrisel:

I don't see the point of measuring the scores only the training set. Here we speak about hyper-parameter tuning so this would be confusing to only display the training score. I think this notebook needs to be reworked to do a train test split and the plots should display both training and test errors, or neither.

Maybe the plots should be duplicated to each do 2 subplots: one with the prediction function displayed on top of a scatter plot of the samples of the training set (with the training score in the title) and another with the same prediction function displayed on top of a scatter plot of the samples of the testing set (with the testing score in the title).

And then we should comment on those scores to summarize the impact of the hyper-parameters in terms of the overfitting / underfitting trade-off.

ogrisel

I started to review this PR but I noticed that it requires blackification first.

ogrisel · 2023-09-01T11:56:21Z

python_scripts/trees_hyperparameters.py

+accuracy = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])
+
+_ = plt.title(
+    f"Shallow regression tree with max-depth of {max_depth}"
+    f"\n R$^2$ of the fit: {accuracy:.2f}"
+)


Suggested change

accuracy = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])

_ = plt.title(

f"Shallow regression tree with max-depth of {max_depth}"

f"\n R$^2$ of the fit: {accuracy:.2f}"

)

r2 = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])

_ = plt.title(

f"Shallow regression tree with max-depth of {max_depth}"

f"\n R$^2$ of the fit: {r2:.2f}"

)

ogrisel · 2023-09-01T11:56:46Z

python_scripts/trees_hyperparameters.py

+accuracy = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])
+
+_ = plt.title(
+    f"Shallow regression tree with max-depth of {max_depth}"
+    f"\n R$^2$ of the fit: {accuracy:.2f}"
+)


Suggested change

accuracy = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])

_ = plt.title(

f"Shallow regression tree with max-depth of {max_depth}"

f"\n R$^2$ of the fit: {accuracy:.2f}"

)

r2 = tree_reg.score(data_reg[data_reg_columns], data_reg[target_reg_column])

_ = plt.title(

f"Shallow regression tree with max-depth of {max_depth}"

f"\n R$^2$ of the fit: {r2:.2f}"

)

Use score in tree hyperpatameter notebook

3241f4f

lesteve changed the title ~~Use score in tree hyperpatameter notebook~~ Use score in tree hyperparameter notebook Jan 6, 2022

ogrisel changed the title ~~Use score in tree hyperparameter notebook~~ WIP Use score in tree hyperparameter notebook Jan 7, 2022

ogrisel reviewed Sep 1, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP Use score in tree hyperparameter notebook #503

WIP Use score in tree hyperparameter notebook #503

glemaitre commented Jan 6, 2022 •

edited by ogrisel

ogrisel left a comment

ogrisel Sep 1, 2023

ogrisel Sep 1, 2023

WIP Use score in tree hyperparameter notebook #503

Are you sure you want to change the base?

WIP Use score in tree hyperparameter notebook #503

Conversation

glemaitre commented Jan 6, 2022 • edited by ogrisel

ogrisel left a comment

Choose a reason for hiding this comment

ogrisel Sep 1, 2023

Choose a reason for hiding this comment

ogrisel Sep 1, 2023

Choose a reason for hiding this comment

glemaitre commented Jan 6, 2022 •

edited by ogrisel