Add validation_score as 1st CV fold score into rankings #1221

jeremyliweishih · 2020-09-24T14:59:43Z

codecov · 2020-09-24T15:05:52Z

Codecov Report

Merging #1221 into main will increase coverage by 0.00%.
The diff coverage is 100.00%.

@@           Coverage Diff           @@
##             main    #1221   +/-   ##
=======================================
  Coverage   99.92%   99.92%           
=======================================
  Files         200      200           
  Lines       12365    12369    +4     
=======================================
+ Hits        12356    12360    +4     
  Misses          9        9

Impacted Files	Coverage Δ
evalml/automl/automl_search.py	`99.58% <100.00%> (ø)`
evalml/tests/automl_tests/test_automl.py	`100.00% <100.00%> (ø)`

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4387fe0...07e8e1d. Read the comment docs.

bchen1116 · 2020-09-25T13:53:30Z

If we're using the first CV fold score as the validation score, shouldn't the score reported then be the scores of the remaining elements, excluding this first fold score?

freddyaboulton

@jeremyliweishih Looks good to me! Are you planning on updating the docs/user guide? Might be worth explaining the difference between score and validation_score ?

jeremyliweishih · 2020-09-28T18:38:33Z

@freddyaboulton good idea, i'll take a look.

dsherry

🚢

dsherry · 2020-09-25T14:54:25Z

evalml/automl/automl_search.py

@@ -697,7 +697,8 @@ def _add_result(self, trained_pipeline, parameters, training_time, cv_data, cv_s
            "high_variance_cv": high_variance_cv,
            "training_time": training_time,
            "cv_data": cv_data,
-            "percent_better_than_baseline": percent_better
+            "percent_better_than_baseline": percent_better,
+            "validation_score": cv_scores[0]


Awesome, I didn't realize how simple this code change would be! 👍

dsherry · 2020-09-29T03:34:23Z

docs/source/user_guide/automl.ipynb

@@ -236,7 +236,7 @@
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
-   "version": "3.8.2"
+   "version": "3.7.4"


dsherry · 2020-09-29T03:34:31Z

docs/source/user_guide/automl.ipynb

@@ -134,7 +134,7 @@
   "metadata": {},
   "source": [
    "## View Rankings\n",
-    "A summary of all the pipelines built can be returned as a pandas DataFrame which is sorted by score."
+    "A summary of all the pipelines built can be returned as a pandas DataFrame which is sorted by score. The score column contains the average score across all cross-validation folds while the validation_score column is computed from the first cross-validation fold."


dsherry · 2020-09-29T03:34:53Z

evalml/tests/automl_tests/test_automl.py

@@ -75,14 +75,15 @@ def test_search_results(X_y_regression, X_y_binary, X_y_multi, automl_type):
            for score in all_objective_scores.values():
                assert score is not None
        assert automl.get_pipeline(pipeline_id).parameters == results['parameters']
+        assert results['validation_score'] == pd.Series([fold['score'] for fold in results['cv_data']])[0]


bchen1116

LGTM

docs/source/release_notes.rst

add validation_score as 1st CV fold score into rankings

61c8c46

jeremyliweishih changed the title ~~add validation_score as 1st CV fold score into rankings~~ Add validation_score as 1st CV fold score into rankings Sep 24, 2020

rl

f4a4b54

jeremyliweishih marked this pull request as ready for review September 24, 2020 15:21

jeremyliweishih self-assigned this Sep 24, 2020

jeremyliweishih requested review from freddyaboulton, angela97lin, dsherry, bchen1116, christopherbunn and eccabay and removed request for freddyaboulton and angela97lin September 24, 2020 15:23

freddyaboulton approved these changes Sep 28, 2020

View reviewed changes

jeremyliweishih added 2 commits September 28, 2020 14:46

Add blurb in autml guide

ad99489

Merge branch 'main' of github.com:alteryx/evalml into js_1115_validation

71a8ac5

dsherry approved these changes Sep 29, 2020

View reviewed changes

jeremyliweishih added 3 commits September 29, 2020 10:09

Add test case for large datasets

76dd594

Merge branch 'main' of github.com:alteryx/evalml into js_1115_validation

e16806a

remove version

8bcab28

bchen1116 approved these changes Sep 29, 2020

View reviewed changes

docs/source/release_notes.rst Outdated Show resolved Hide resolved

Forgot to press ave

c4d51c7

jeremyliweishih closed this Sep 29, 2020

jeremyliweishih reopened this Sep 29, 2020

remove Rl

bf033e4

jeremyliweishih added 2 commits September 29, 2020 10:42

add back rl

3e62721

Merge branch 'main' into js_1115_validation

07e8e1d

jeremyliweishih merged commit e9cba15 into main Sep 29, 2020

angela97lin mentioned this pull request Sep 29, 2020

Release v0.14.1 #1241

Merged

freddyaboulton deleted the js_1115_validation branch May 13, 2022 15:16

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add validation_score as 1st CV fold score into rankings #1221

Add validation_score as 1st CV fold score into rankings #1221

jeremyliweishih commented Sep 24, 2020

codecov bot commented Sep 24, 2020 •

edited

Loading

bchen1116 commented Sep 25, 2020

freddyaboulton left a comment

jeremyliweishih commented Sep 28, 2020

dsherry left a comment

dsherry Sep 25, 2020

dsherry Sep 29, 2020

dsherry Sep 29, 2020

dsherry Sep 29, 2020

bchen1116 left a comment

Add validation_score as 1st CV fold score into rankings #1221

Add validation_score as 1st CV fold score into rankings #1221

Conversation

jeremyliweishih commented Sep 24, 2020

codecov bot commented Sep 24, 2020 • edited Loading

Codecov Report

bchen1116 commented Sep 25, 2020

freddyaboulton left a comment

Choose a reason for hiding this comment

jeremyliweishih commented Sep 28, 2020

dsherry left a comment

Choose a reason for hiding this comment

dsherry Sep 25, 2020

Choose a reason for hiding this comment

dsherry Sep 29, 2020

Choose a reason for hiding this comment

dsherry Sep 29, 2020

Choose a reason for hiding this comment

dsherry Sep 29, 2020

Choose a reason for hiding this comment

bchen1116 left a comment

Choose a reason for hiding this comment

codecov bot commented Sep 24, 2020 •

edited

Loading