Add EarlyStopping integration to TensorFlow.Keras autologging #2301

juntai-zheng · 2020-01-13T23:21:39Z

What changes are proposed in this pull request?

Mirrors work on #2219 by adding EarlyStopping callback integration to TensorFlow.Keras (both 1.X and 2.X). Does not include fit_generator() support.

How is this patch tested?

Unit tests written in tests/tensorflow_autolog/test_tensorflow_autolog.py and tests/tensorflow_autolog/test_tensorflow2_autolog.py

Release Notes

Is this a user-facing change?

No. You can skip the rest of this section.
Yes. Give a description of this change to be included in the release notes for MLflow users.

Same as #2219 , but this is for TensorFlow.Keras.

What component(s) does this PR affect?

How should the PR be classified in the release notes? Choose one:

rn/breaking-change - The PR will be mentioned in the "Breaking Changes" section
rn/none - No description will be included. The PR will be mentioned only by the PR number in the "Small Bugfixes and Documentation Updates" section
rn/feature - A new user-facing feature worth mentioning in the release notes
rn/bug-fix - A user-facing bug fix worth mentioning in the release notes
rn/documentation - A user-facing documentation change worth mentioning in the release notes

smurching

Mostly looks good. We should decide what to do regarding the tradeoff of:

Log both loss & epoch_loss, accuracy & epoch_acc metrics, including in the non-early stopping case (the current behavior - IMO this might confuse users). We could make this better by only logging the extra loss & acc metrics in the early-stopping case
Remove Tensorboard callback from Keras model to avoid generating epoch_loss, epoch_acc metrics. This has the downside of us no longer generating Tensorboard logs as an artifact, & if the user does include a TensorBoard callback we'll still log epoch_loss etc.
Provide special handling when logging metrics of the restored model after early stopping - log loss & accuracy with the epoch_ prefix, & don't log an extra step of other metrics

Out of these options I think I like 3 the most, if it's not too complicated - let me know what you think.

smurching

LGTM

juntai-zheng · 2020-01-15T22:39:56Z

We decided it was best to stick with option number 3 for simplicity. Note that the callback logs epoch_loss and epoch_acc for TF 1.X, but loss and acc for TF 2.X.

* Add autodetection of job environments to R client (mlflow#2272) * R client detection * Efficiency * Simplify * Return tags * Add test cases * Lint * Tweak test name * EarlyStopping Callback support for Keras Autologging (mlflow#2219) Keras autologging now supports the EarlyStopping callback. If EarlyStopping.restore_best_weights==True, then the metrics of the restored model will be logged as an extra step. * Add REPL-aware listener for Spark datasource autologging (mlflow#2249) Add REPL-aware listener for Spark datasource autologging (mlflow#2249) * Fix XGBoost and LightGBM flavor tests (mlflow#2244) Add objective and num_class to xgb.train() and lgb.train() because they try to solve a regression task by default, but the iris dataset is a dataset for a classification task. * Add status in RunView and ExperimentRunsTable (mlflow#1816) * Changes needed to support the sqlplugin (mlflow#2285) * Changes needed to support the sqlplugin * Edited plugin name in setup file * Updated plugin name in setup file Co-authored-by: dbczumar <39497902+dbczumar@users.noreply.github.com> * Elevate how to load mleap flavor (mlflow#2211) * Elevate how to load mleap flavor. * Load using loadPipeline. * Get rid of extra char. Co-authored-by: dbczumar <39497902+dbczumar@users.noreply.github.com> * pillow fix (mlflow#2307) * Add EarlyStopping integration to TensorFlow.Keras autologging (mlflow#2301) Merging TF.Keras EarlyStopping integration * Document MLflow plugin system (mlflow#2270) Adds an mlflow.org doc page explaining how to write & use MLflow plugins. Co-authored-by: dbczumar <39497902+dbczumar@users.noreply.github.com> Co-authored-by: juntai-zheng <39497939+juntai-zheng@users.noreply.github.com> Co-authored-by: Siddharth Murching <smurching@gmail.com> Co-authored-by: Harutaka Kawamura <hkawamura0130@gmail.com> Co-authored-by: Nicolas Laille <nlaille@users.noreply.github.com> Co-authored-by: Avrilia Floratou <avflor@microsoft.com> Co-authored-by: Stephanie Bodoff <stephanie.bodoff@databricks.com>

…#2301) Merging TF.Keras EarlyStopping integration

juntai-zheng added 7 commits January 10, 2020 15:48

WIP

e46a9e8

WIP

ef26060

added tf2_keras tests

a8bc160

rearranged test

f27921d

added tests for tf1

6fd59c1

linting

0c33020

variable rename

f4e19e3

smurching reviewed Jan 14, 2020

View reviewed changes

log 'loss' and' 'acc' with 'epoch_' prefix

13be10c

smurching approved these changes Jan 15, 2020

View reviewed changes

restored incorrectly deleted on_epoch_end

d94e934

juntai-zheng merged commit 3bec7c1 into mlflow:master Jan 15, 2020

juntai-zheng added the rn/feature Mention under Features in Changelogs. label Jan 15, 2020

avflor pushed a commit to avflor/mlflow that referenced this pull request Aug 22, 2020

Add EarlyStopping integration to TensorFlow.Keras autologging (mlflow…

800b7ed

…#2301) Merging TF.Keras EarlyStopping integration

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add EarlyStopping integration to TensorFlow.Keras autologging #2301

Add EarlyStopping integration to TensorFlow.Keras autologging #2301

juntai-zheng commented Jan 13, 2020

smurching left a comment •

edited

smurching left a comment

juntai-zheng commented Jan 15, 2020

Add EarlyStopping integration to TensorFlow.Keras autologging #2301

Add EarlyStopping integration to TensorFlow.Keras autologging #2301

Conversation

juntai-zheng commented Jan 13, 2020

What changes are proposed in this pull request?

How is this patch tested?

Release Notes

Is this a user-facing change?

What component(s) does this PR affect?

How should the PR be classified in the release notes? Choose one:

smurching left a comment • edited

Choose a reason for hiding this comment

smurching left a comment

Choose a reason for hiding this comment

juntai-zheng commented Jan 15, 2020

smurching left a comment •

edited