Simplify logging behavior #2645

eccabay · 2021-08-17T16:32:03Z

Closes #1174 - design doc here

codecov · 2021-08-23T14:20:57Z

Codecov Report

Merging #2645 (8e3ac71) into main (4dffcfa) will decrease coverage by 0.1%.
The diff coverage is 100.0%.

@@           Coverage Diff           @@
##            main   #2645     +/-   ##
=======================================
- Coverage   99.9%   99.9%   -0.0%     
=======================================
  Files        301     301             
  Lines      27851   27751    -100     
=======================================
- Hits       27802   27702    -100     
  Misses        49      49

Impacted Files	Coverage Δ
evalml/data_checks/no_variance_data_check.py	`100.0% <ø> (ø)`
...nts/transformers/preprocessing/text_transformer.py	`100.0% <ø> (ø)`
evalml/utils/logger.py	`100.0% <ø> (ø)`
evalml/automl/automl_search.py	`99.9% <100.0%> (+0.1%)`	⬆️
evalml/automl/callbacks.py	`100.0% <100.0%> (ø)`
...derstanding/prediction_explanations/_algorithms.py	`99.0% <100.0%> (ø)`
evalml/objectives/sensitivity_low_alert.py	`100.0% <100.0%> (ø)`
evalml/pipelines/component_graph.py	`99.8% <100.0%> (+0.1%)`	⬆️
evalml/pipelines/components/component_base.py	`100.0% <100.0%> (ø)`
evalml/pipelines/components/utils.py	`100.0% <100.0%> (ø)`
... and 9 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4dffcfa...8e3ac71. Read the comment docs.

evalml/automl/automl_search.py

docs/source/start.ipynb

evalml/automl/automl_search.py

dsherry · 2021-09-01T18:20:24Z

evalml/automl/automl_search.py

-                logger.info("Exiting AutoMLSearch.")
+                self._activate_logger
+                self.logger.info("Exiting AutoMLSearch.")
+                self._deactivate_logger


Warning: missing parentheses to invoke the methods here

I'm surprised our lint didn't catch this! I wonder if there's a way / file an issue to have lint fail if we try to invoke a non-property as a property. Tagging @angela97lin since she's been messing with linting / style recently

That's a really good question @dsherry! I'm not sure I've run into anything that would have caught this--probably tricky to do since calling this without the paraentheses still returns the method and is "valid" 🤔 . Will keep an eye out though!

dsherry · 2021-09-01T18:21:29Z

evalml/automl/automl_search.py

                "AutoMLSearch.search() has already been run and will not run again on the same instance. Re-initialize AutoMLSearch to search again."
            )
+            self._deactivate_logger()


What would happen if we deleted _deactivate_logger entirely from this PR?

Deleting _deactivate_logger would mean we can no longer run AutoML search with verbose=False after running with verbose=True, since our get_logger function (run when verbose is True) modifies the same logger that's grabbed by logging.getLogger when verbose is False.

Just now in reading your comments I had a thought - it doesn't have to be the case that the logger obtained when verbose is True vs when verbose is False have the same name. Instead, we could create another sub-logger for when verbose is True (like I did with pipeline.describe() and others) so that it would maintain any user-configured logging behavior without needing to be reset with _deactivate_logger at the end of each method that uses it. This would also resolve your question above about setting the logging level back to logging.WARNING in _deactivate_logger, as that would no longer be necessary. Thoughts?

out of date

evalml/tests/automl_tests/test_automl_search_classification.py

bchen1116

Left a question and a few nits, but this was great work!

docs/source/user_guide/utilities.ipynb

bchen1116 · 2021-09-07T19:05:04Z

evalml/automl/automl_search.py

@@ -1358,7 +1369,7 @@ def _check_for_high_variance(self, pipeline, cv_scores, threshold=0.5):
            return high_variance_cv
        cv_range = max(cv_scores) - min(cv_scores)
        if cv_range >= threshold * allowed_range:
-            logger.warning(
+            self.logger.warning(


Not sure why this is happening, but seems like I get warnings about high variance when I set verbose to False, but I don't get the same warning when verbose is True:

Would you know why this is happening?

The warning is still there in both cases, it just appears in different ways! When verbose=False, the default logging is used, so warnings and higher are printed in the default format of red highlighting. When verbose=True, we override the default formatting settings, so while warnings are still printed to stdout (along with all the info level information), they are no longer highlighted in red.

You can see the same warning in the verbose=True case where the report of the Decision Tree cv scores are.

evalml/pipelines/utils.py

evalml/tests/automl_tests/test_automl.py

chukarsten

This is a great PR. I like what's going on here and the simplicity of it.

I think the one thing we should address before merge is the testing strategy. Does it make sense to test this feature by running a variety of tests with verbose on and off? I think this is OK for tests that don't drastically increase the test runtime by much (don't run automl.search). What are your thoughts on keeping the tests as written before this PR, but adding verbose=True to all of them, then have just a single test that verifies that nothing is logged when verbose=False. It seems that if verbose=False, a single test should capture that entirely.

Let me know your thoughts. If you don't want to do that, I think it would be sufficient to show the runtime of the tests isn't drastically affected!

Another nice to have would be if you could silence the dask/cfengine stuff in the docs. Out of scope, so feel free to tell me to pound sand.

docs/source/user_guide/automl.ipynb

evalml/automl/automl_search.py

evalml/tests/automl_tests/test_automl.py

chukarsten · 2021-09-07T23:05:38Z

evalml/tests/automl_tests/test_automl.py

@@ -337,10 +360,17 @@ def test_pipeline_fit_raises(AutoMLTestEnv, X_y_binary, caplog):
                assert np.isnan(score)


-def test_pipeline_score_raises(AutoMLTestEnv, X_y_binary, caplog):
+@pytest.mark.parametrize("verbose", [True, False])


Ok, do we need to run this test for both verbosities if we're not checking for anything different?

chukarsten · 2021-09-07T23:07:17Z

evalml/tests/automl_tests/test_automl.py

@@ -362,7 +392,9 @@ def test_pipeline_score_raises(AutoMLTestEnv, X_y_binary, caplog):


 @patch("evalml.objectives.AUC.score")
-def test_objective_score_raises(mock_score, X_y_binary, caplog):
+@pytest.mark.parametrize("verbose", [True, False])


Same question here, except I think now we're also going to pay a penalty of running search. I worry that all the parametrization of verbosity doesn't really provide much testing benefit while increasing the runtime of tests.

chukarsten · 2021-09-07T23:09:46Z

evalml/tests/automl_tests/test_automl.py

@@ -2564,7 +2607,9 @@ def test_early_stopping(caplog, logistic_regression_binary_pipeline_class, X_y_b

    assert not automl._should_continue()
    out = caplog.text
-    assert "2 iterations without improvement. Stopping search early." in out
+    assert (


I think this test makes sense and can test both verbosities without much penalty in runtime.

eccabay · 2021-09-08T17:54:45Z

@chukarsten thanks for the testing suggestions! I took another look at the tests with parameterizations of verbose values, and I agree, a lot of them weren't necessary. I've filtered through them, according to a few criteria:

Any tests that didn't change behavior whether verbose was True or False (i.e. the message logged was either warning or error level) don't need parameterization. If they pass when verbose is False, they will pass when verbose is True by definition.
I removed some redundant tests, i.e. testing verbose/not in methods that other tests already covered. For example, parameterizing verbose=[True, False] in test_pipeline_limits made sure the distinction between the values worked correctly in both __init__ and search. Therefore, we didn't need to parameterize verbose in test_automl_drop_unknown_columns, since the logging differences there only occur in __init__.

At this point, every test that includes @pytest.mark.parametrize("verbose", [True, False]) should be testing something unique with respect to the verbose flag's functionality and behavior.

eccabay added 9 commits August 13, 2021 10:38

Remove file writing part of get_logger

5784bd6

Replace all calls to get_logger with logging.getLogger

9b6d7fe

Add verbose flag to AutoMLSearch

2b2748a

Allow for verbose=False after verbose=True

e0e0801

Adjust tests to set up loggers

0f6ac54

AutoMLSearch should remove verbose handler after search

ba7930c

Include plotting supression when verbose=False

d3f56d9

Update log_error_callback to use error/warning instead of info/debug

2458efc

Merge branch 'main' into 1174_logging

46a7f92

eccabay added 10 commits August 23, 2021 15:25

Adjust automl tests to test verbosity

939809d

So much lint fixing

a727b81

Merge branch 'main' into 1174_logging

4e3990c

Fix min-deps test

5184440

Update release notes

65098b2

Update documentation

de03b2a

One more attempt to fix min-deps test

ac167c4

lint fix

2fed152

Merge branch 'main' into 1174_logging

e07a825

Merge branch 'main' into 1174_logging

081ffb6

eccabay self-assigned this Aug 24, 2021

eccabay changed the title ~~1174 logging~~ Simplify logging behavior Aug 24, 2021

Cleanup of unnecessary loggers

d787ee9

eccabay commented Aug 24, 2021

View reviewed changes

evalml/automl/automl_search.py Show resolved Hide resolved

eccabay marked this pull request as ready for review August 24, 2021 19:41

eccabay requested review from angela97lin, dsherry, freddyaboulton, bchen1116 and christopherbunn August 24, 2021 19:42

dsherry reviewed Sep 1, 2021

View reviewed changes

docs/source/start.ipynb Outdated Show resolved Hide resolved

dsherry reviewed Sep 1, 2021

View reviewed changes

evalml/automl/automl_search.py Outdated Show resolved Hide resolved

dsherry reviewed Sep 1, 2021

View reviewed changes

eccabay added 2 commits September 1, 2021 15:47

Merge branch 'main' into 1174_logging

23fd03d

Remove activate/deactivate logger, use verbose sublogger instead

b61f922

angela97lin reviewed Sep 2, 2021

View reviewed changes

evalml/tests/automl_tests/test_automl_search_classification.py Outdated Show resolved Hide resolved

eccabay added 4 commits September 2, 2021 11:03

Remove some unnecessary loggers

4e4ca0b

Merge branch 'main' into 1174_logging

d4ed911

fix failing cli tests

32c1f1e

Fix merge error

cb585ac

eccabay requested review from dsherry, angela97lin and freddyaboulton September 2, 2021 17:18

eccabay and others added 5 commits September 2, 2021 14:56

Fix up docs

35297e7

Merge branch 'main' into 1174_logging

2072eca

Fix docs linting

0983807

Merge branch 'main' into 1174_logging

06f0f29

Merge branch 'main' into 1174_logging

3c8ac98

bchen1116 reviewed Sep 7, 2021

View reviewed changes

eccabay added 2 commits September 8, 2021 08:20

PR comments

446de0d

Merge branch 'main' into 1174_logging

8ad04d3

chukarsten approved these changes Sep 8, 2021

View reviewed changes

Remove redundant verbosity tests

3b49658

Merge branch 'main' into 1174_logging

8e3ac71

eccabay merged commit adb844b into main Sep 9, 2021

chukarsten mentioned this pull request Sep 10, 2021

Release v0.32.1 #2769

Merged

eccabay deleted the 1174_logging branch March 10, 2022 15:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Simplify logging behavior #2645

Simplify logging behavior #2645

eccabay commented Aug 17, 2021 •

edited

Loading

codecov bot commented Aug 23, 2021 •

edited

Loading

dsherry Sep 1, 2021

angela97lin Sep 1, 2021

dsherry Sep 1, 2021

eccabay Sep 1, 2021

bchen1116 left a comment

bchen1116 Sep 7, 2021

eccabay Sep 8, 2021

chukarsten left a comment •

edited

Loading

chukarsten Sep 7, 2021

chukarsten Sep 7, 2021

chukarsten Sep 7, 2021

eccabay commented Sep 8, 2021 •

edited

Loading

Simplify logging behavior #2645

Simplify logging behavior #2645

Conversation

eccabay commented Aug 17, 2021 • edited Loading

codecov bot commented Aug 23, 2021 • edited Loading

Codecov Report

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

bchen1116 left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

chukarsten left a comment • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eccabay commented Sep 8, 2021 • edited Loading

eccabay commented Aug 17, 2021 •

edited

Loading

codecov bot commented Aug 23, 2021 •

edited

Loading

chukarsten left a comment •

edited

Loading

eccabay commented Sep 8, 2021 •

edited

Loading