Calculate multiple metrics #985

rabsr · 2020-10-26T17:40:50Z

This change will provide users an option scoring_functions to calculate list of metrics for each pipeline. The calculated metrics will be available from attribute cv_results_.
This will provide users with more insights for each pipeline executed.

Usage:

automl = autosklearn.classification.AutoSklearnClassifier(
    time_left_for_this_task=30,
    per_run_time_limit=10,
    scoring_functions=['accuracy', 'balanced_accuracy', 'log_loss'],)
automl.fit(X_train,y_train)
print(automl.cv_results_)

…uto-sklearn into calculate_multiple_score

mfeurer · 2020-11-10T08:49:13Z

This implements #981 I guess?

Development

mfeurer

Thanks a lot for the PR! I left some initial comments you could address, and then it would be good to incrementally work on the following things:

rebase on the development branch to fix the current issues CI
add an example about tracking auxiliary metrics
think about additional unit tests to test this new functionality

Looking forward to the changes

autosklearn/metrics/__init__.py

test/test_evaluation/test_train_evaluator.py

rabsr · 2020-11-10T14:22:01Z

@mfeurer

This implements #981 I guess?

Yeah. This PR implements #981

Made changes based on review comments and rebased with latest development.

Todo

additional unit tests to test this new functionality
add an example about tracking auxiliary metrics

I will be working on these items.

test/test_evaluation/test_train_evaluator.py

autosklearn/estimators.py

autosklearn/metrics/__init__.py

mfeurer · 2020-11-11T10:19:39Z

This looks great, looking forward to the example and unit tests.

mfeurer

A few updates from my side:

I cannot reproduce the failure for the pandas example locally, so I think everything is fine here. Nevertheless, I restarted the tests on travis-ci.
The code looks good and I would merge it as is, therefore, you could now move to adding unit tests (I've been thinking about one for the score calculation function, and making the tests in test_train_evaluator stricter by checking the entries in the additional run info, please let me know if you have any questions regarding that) and an example on how to use this new feature.

I'm really looking forward to this new feature as I actually also just found a good use case within our research :)

codecov · 2020-11-16T16:38:13Z

Codecov Report

Merging #985 (d281763) into development (3743b25) will increase coverage by 0.03%.
The diff coverage is 84.37%.

@@               Coverage Diff               @@
##           development     #985      +/-   ##
===============================================
+ Coverage        85.34%   85.37%   +0.03%     
===============================================
  Files              125      125              
  Lines             9858     9881      +23     
===============================================
+ Hits              8413     8436      +23     
  Misses            1445     1445

Impacted Files	Coverage Δ
autosklearn/ensemble_builder.py	`76.43% <ø> (ø)`
autosklearn/ensembles/ensemble_selection.py	`68.42% <ø> (ø)`
autosklearn/evaluation/test_evaluator.py	`90.90% <ø> (ø)`
autosklearn/evaluation/train_evaluator.py	`72.74% <50.00%> (ø)`
autosklearn/evaluation/abstract_evaluator.py	`88.42% <80.00%> (ø)`
autosklearn/automl.py	`84.68% <85.00%> (+0.01%)`	⬆️
autosklearn/estimators.py	`93.02% <100.00%> (+0.10%)`	⬆️
autosklearn/evaluation/__init__.py	`82.62% <100.00%> (ø)`
autosklearn/smbo.py	`82.29% <100.00%> (+0.06%)`	⬆️
autosklearn/util/logging_.py	`94.30% <0.00%> (-1.63%)`	⬇️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3743b25...d281763. Read the comment docs.

rabsr · 2020-11-18T13:45:16Z

@mfeurer Test cases and example will take some time. Currently I am on vacation for 2 weeks. I will start working on test cases and example once I am back.

I actually also just found a good use case within our research

If possible, can you highlight what is the use case you are looking forward to?

mfeurer · 2020-11-18T15:01:36Z

Thanks for the heads up @rabsr enjoy your vacation

mfeurer

Thanks a lot for the tests! I just made a PR which makes the examples work. How about I merge this one and you start working on a new PR updating the example for using different metrics?

If possible, can you highlight what is the use case you are looking forward to?

I'm planning to use it to generate meta-data for the Auto-sklearn 2.0 project.

test/test_metric/test_metrics.py

Co-authored-by: Matthias Feurer <lists@matthiasfeurer.de>

…uto-sklearn into calculate_multiple_score

rabsr · 2020-12-02T18:18:04Z

How about I merge this one and you start working on a new PR updating the example for using different metrics?

@mfeurer That works for me. You can merge this PR. I will open a new PR for example in couple of days.

ra-amex and others added 5 commits October 26, 2020 18:40

Option to allow users to calculate multiple metrics for a pipeline

bef8106

Fix: metric score was not calculated when scoring_function passed

2b08168

Merge branch 'development' into calculate_multiple_score

30e939e

Fixing code formatting

e931b1a

Merge branch 'calculate_multiple_score' of https://github.com/rabsr/a…

1ff12df

…uto-sklearn into calculate_multiple_score

Merge pull request #1 from automl/development

9d89b66

Development

mfeurer reviewed Nov 10, 2020

View reviewed changes

mfeurer mentioned this pull request Nov 10, 2020

Feature: Seperate Cost Function and Test Metric for Runhistory #981

Closed

ra-amex and others added 3 commits November 10, 2020 18:39

Merge development

2fcb6b9

Incorporating review comments

3ab0ac2

Update run_auto-sklearn_for_metadata_generation.py

b4a7260

mfeurer reviewed Nov 11, 2020

View reviewed changes

test/test_evaluation/test_train_evaluator.py Outdated Show resolved Hide resolved

Update test_train_evaluator.py

d94b6ac

mfeurer reviewed Nov 11, 2020

View reviewed changes

autosklearn/estimators.py Outdated Show resolved Hide resolved

mfeurer reviewed Nov 11, 2020

View reviewed changes

autosklearn/metrics/__init__.py Outdated Show resolved Hide resolved

rabsr and others added 4 commits November 11, 2020 16:11

Update estimators.py

3d79a3e

Update __init__.py

7d86ddd

Fix build

3bbeda8

Removed unnecessary checks

75ac077

mfeurer reviewed Nov 16, 2020

View reviewed changes

ra-amex and others added 2 commits December 2, 2020 16:50

Adding test cases

fcd4175

Merge branch 'development' into calculate_multiple_score

a84d626

mfeurer reviewed Dec 2, 2020

View reviewed changes

test/test_metric/test_metrics.py Outdated Show resolved Hide resolved

Update test/test_metric/test_metrics.py

9f955a3

Co-authored-by: Matthias Feurer <lists@matthiasfeurer.de>

ra-amex added 2 commits December 2, 2020 23:45

Fixing lint

9023c04

Merge branch 'calculate_multiple_score' of https://github.com/rabsr/a…

d281763

…uto-sklearn into calculate_multiple_score

mfeurer approved these changes Dec 2, 2020

View reviewed changes

mfeurer merged commit eab0ddb into automl:development Dec 2, 2020

rabsr mentioned this pull request Dec 29, 2020

Example multiple metric #1045

Merged

Calculate multiple metrics #985

Calculate multiple metrics #985

Uh oh!

Conversation

rabsr commented Oct 26, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Nov 10, 2020

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

rabsr commented Nov 10, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

mfeurer commented Nov 11, 2020

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Nov 16, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

rabsr commented Nov 18, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mfeurer commented Nov 18, 2020

Uh oh!

mfeurer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

rabsr commented Dec 2, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rabsr commented Oct 26, 2020 •

edited

Loading

rabsr commented Nov 10, 2020 •

edited

Loading

codecov bot commented Nov 16, 2020 •

edited

Loading

rabsr commented Nov 18, 2020 •

edited

Loading

rabsr commented Dec 2, 2020 •

edited

Loading