[Custom Metrics] Enabled support for logging of numerical metrics #5389

MarkYHZhang · 2022-02-18T01:30:13Z

Signed-off-by: Mark Zhang mark.zhang@databricks.com

What changes are proposed in this pull request?

Added support for users to pass in custom metric functions into mlflow.evaluate. This PR is a part of more PRs to come for custom metrics support. Specifically, this PR focuses on enabling logging of numerical metrics generated by user's custom metric functions into MLflow.

How is this patch tested?

Wrote unit tests located in mlflow/tests/models/test_default_evaluator.py

Does this PR change the documentation?

Updates to the documentation will be done once the entire feature of custom metrics is complete.

No. You can skip the rest of this section.
Yes. Make sure the changed pages / sections render correctly by following the steps below.

Check the status of the ci/circleci: build_doc check. If it's successful, proceed to the
next step, otherwise fix it.
Click Details on the right to open the job page of CircleCI.
Click the Artifacts tab.
Click docs/build/html/index.html.
Find the changed pages / sections and make sure they render correctly.

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

Enables MLflow tracking of custom metrics produced by user-provided metric functions.

(Details in 1-2 sentences. You can just refer to another PR with a description if this PR is part of a larger change.)

What component(s), interfaces, languages, and integrations does this PR affect?

Components

Interface

area/uiux: Front-end, user experience, plotting, JavaScript, JavaScript dev server
area/docker: Docker use across MLflow's components, such as MLflow Projects and MLflow Models
area/sqlalchemy: Use of SQLAlchemy in the Tracking Service or Model Registry
area/windows: Windows support

Language

language/r: R APIs and clients
language/java: Java APIs and clients
language/new: Proposals for new client languages

Integrations

integrations/azure: Azure and Azure ML integrations
integrations/sagemaker: SageMaker integrations
integrations/databricks: Databricks integrations

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

MarkYHZhang · 2022-02-18T18:59:03Z

Hey @apurvakoti @dbczumar @jinzhang21 @WeichenXu123, can you all take a look? It seems like I don't have permission to add reviewers. So I'm tagging you all here. Thanks!

jinzhang21

Added some initial comment on the API. I'll review the rest later today.

mlflow/models/evaluation/base.py

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

mlflow/models/evaluation/base.py

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

apurva-koti

Some small comments, will do a full review again

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

…. Since DefaultEvaluator saves Dataframe as CSV. And CSV files can also be viewed in the MLflow UI Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

apurva-koti

Looks good! Let's use [] as the default argument for custom_metrics rather than None.

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

MarkYHZhang · 2022-02-23T01:30:16Z

Looks good! Let's use [] as the default argument for custom_metrics rather than None.

Thanks! I think we should avoid using mutable default argument in python due to issues described here. Although, I could initialize it to a list inside the constructor, but considering other default arguments like

mlflow/mlflow/models/evaluation/base.py

Line 655 in 2c0d13d

feature_names: list = None,

are not initialized, I think it might be good idea just to leave it as is.

apurva-koti · 2022-02-23T18:42:54Z

Ah good catch @MarkYHZhang ! Let's keep it None then.

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

apurva-koti

LGTM once all comments addressed!

mlflow/models/evaluation/default_evaluator.py

tests/models/test_default_evaluator.py

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

dbczumar

@MarkYHZhang This looks great! Left a few small comments - should be ready to merge soon!

…ate_custom_metric function. Modified docstring to only include metrics support Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

dbczumar

@MarkYHZhang Do we plan to add an example to the examples section or extend one of the existing examples? Can we do that as part of this PR?

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

mlflow/models/evaluation/base.py

dbczumar

LGTM when small docs nits have been addressed and example has been included. Thanks @MarkYHZhang ! Awesome stuff!

mlflow/models/evaluation/default_evaluator.py

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

…urn validation. Plus a few minor stylistic changes Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

mlflow/models/evaluation/base.py

mlflow/models/evaluation/default_evaluator.py

tests/models/test_default_evaluator.py

…builtin_metrics per custom metric function Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

Added custom metric function support for logging numerical metrics

ed00083

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

github-actions bot added area/tracking Tracking service, tracking client APIs, autologging rn/feature Mention under Features in Changelogs. labels Feb 18, 2022

Merge branch 'mlflow:master' into custom-metrics-metrics-logging

c9f1a2c

jinzhang21 reviewed Feb 18, 2022

View reviewed changes

mlflow/models/evaluation/base.py Show resolved Hide resolved

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

apurva-koti requested review from WeichenXu123, dbczumar and apurva-koti February 18, 2022 21:53

Updated function name to custom_metric_fns, and added docstring

1dad078

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

dbczumar reviewed Feb 18, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

Updated tests to reflect new mlflow.evaluate function spec

a7c8a42

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

apurva-koti reviewed Feb 19, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

mlflow/models/evaluation/default_evaluator.py Show resolved Hide resolved

MarkYHZhang added 3 commits February 18, 2022 16:10

Added todo for artifact support in custom metrics logging

d4d3501

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

Revert back to using 'custom_metrics' as parameter name

8d475b8

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

Save Dataframe objects as CSV instead of parquet for consistency sake…

040031c

…. Since DefaultEvaluator saves Dataframe as CSV. And CSV files can also be viewed in the MLflow UI Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

MarkYHZhang mentioned this pull request Feb 23, 2022

[Custom Metrics] Artifact type detection and logging #5405

Merged

29 tasks

apurva-koti reviewed Feb 23, 2022

View reviewed changes

Minor updates inaccordance to PR reviews

4d607ca

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

apurva-koti approved these changes Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

tests/models/test_default_evaluator.py Outdated Show resolved Hide resolved

Updated unit test for exception message assertion

de09f3b

Signed-off-by: Mark Zhang <mark.zhang@databricks.com>

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 23, 2022

View reviewed changes

Using getattr for custom metric function name, added index for _evalu…

7b26a88

…ate_custom_metric function. Modified docstring to only include metrics support Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

MarkYHZhang requested a review from dbczumar February 23, 2022 23:04

dbczumar reviewed Feb 24, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 24, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

dbczumar reviewed Feb 24, 2022

View reviewed changes

mlflow/models/evaluation/base.py Outdated Show resolved Hide resolved

dbczumar approved these changes Feb 24, 2022

View reviewed changes

harupy reviewed Feb 24, 2022

View reviewed changes

mlflow/models/evaluation/default_evaluator.py Outdated Show resolved Hide resolved

jinzhang21 reviewed Feb 24, 2022

View reviewed changes

Raise more surgical exception messages for custom metric function ret…

ec5a659

…urn validation. Plus a few minor stylistic changes Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

MarkYHZhang requested review from jinzhang21 and harupy February 25, 2022 01:01

jinzhang21 reviewed Feb 25, 2022

View reviewed changes

harupy reviewed Feb 25, 2022

View reviewed changes

tests/models/test_default_evaluator.py Outdated Show resolved Hide resolved

jinzhang21 approved these changes Feb 25, 2022

View reviewed changes

harupy approved these changes Feb 25, 2022

View reviewed changes

Added tests for lambda custom metric functions. Deepcopy eval_df and …

0bb6789

…builtin_metrics per custom metric function Signed-off-by: Mark Zhang <markzhang.inbox@gmail.com>

MarkYHZhang merged commit cb7b361 into mlflow:master Feb 25, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Custom Metrics] Enabled support for logging of numerical metrics #5389

[Custom Metrics] Enabled support for logging of numerical metrics #5389

MarkYHZhang commented Feb 18, 2022 •

edited

Loading

MarkYHZhang commented Feb 18, 2022

jinzhang21 left a comment

apurva-koti left a comment

apurva-koti left a comment

MarkYHZhang commented Feb 23, 2022 •

edited

Loading

apurva-koti commented Feb 23, 2022

apurva-koti left a comment

dbczumar left a comment

dbczumar left a comment

dbczumar left a comment

[Custom Metrics] Enabled support for logging of numerical metrics #5389

[Custom Metrics] Enabled support for logging of numerical metrics #5389

Conversation

MarkYHZhang commented Feb 18, 2022 • edited Loading

What changes are proposed in this pull request?

How is this patch tested?

Does this PR change the documentation?

Release Notes

Is this a user-facing change?

What component(s), interfaces, languages, and integrations does this PR affect?

How should the PR be classified in the release notes? Choose one:

MarkYHZhang commented Feb 18, 2022

jinzhang21 left a comment

Choose a reason for hiding this comment

apurva-koti left a comment

Choose a reason for hiding this comment

apurva-koti left a comment

Choose a reason for hiding this comment

MarkYHZhang commented Feb 23, 2022 • edited Loading

apurva-koti commented Feb 23, 2022

apurva-koti left a comment

Choose a reason for hiding this comment

dbczumar left a comment

Choose a reason for hiding this comment

dbczumar left a comment

Choose a reason for hiding this comment

dbczumar left a comment

Choose a reason for hiding this comment

MarkYHZhang commented Feb 18, 2022 •

edited

Loading

MarkYHZhang commented Feb 23, 2022 •

edited

Loading