Apply flake8-logging-format changes to providers #24933

josh-fell · 2022-07-08T19:31:26Z

This is some pre-work for hopefully/maybe including the flake8-logging-format extension in CI. There are roughly 88 file changes so splitting these up into smaller chunks.

After the initial sweeps are completed we can do a final pass when formally implementing the extension in CI.

I'm not entirely certain on if there are impacts to useful info that will be missed with log handlers but would love feedback on general patterns. ~~Core PR to come later based on feedback.~~

Fix failing tests

^ Add meaningful description above

Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

josh-fell · 2022-08-01T14:37:35Z

tests/providers/amazon/aws/log/test_s3_task_handler.py

@@ -107,9 +106,8 @@ def test_hook_raises(self):

            mock_error.assert_called_once_with(
                'Could not create an S3Hook with connection id "%s". Please make '
-                'sure that apache-airflow[aws] is installed and the S3 connection exists. Exception : "%s"',


This an example of where I'm unsure if accounting for G200 error ("Logging statements should not include the exception in logged string [use exception or exc_info=True]") will negatively affect what's logged and, I suppose more importantly, what the log handlers will ingest.

Here the "Exception: " is part of the logging line directly but disallowed as part of the flake8 plugin. But does this matter when exc_info=True or when there is a following raise? Is there really any lost information in the logs? Maybe there is a slick workaround that I'm missing?

An option could be ignoring the G200 error as well as part of flake8-logging-format should we want to introduce it. @kaxil Do you have an opinion here since you'd taken a crack at using this flake8 plugin elsewhere?

josh-fell · 2022-08-01T14:40:49Z

tests/providers/google/cloud/log/test_gcs_task_handler.py

@@ -172,7 +172,7 @@ def test_failed_write_to_remote_on_close(self, mock_blob, mock_client, mock_cred
            (
                "airflow.providers.google.cloud.log.gcs_task_handler.GCSTaskHandler",
                logging.ERROR,
-                "Could not write logs to gs://bucket/remote/log/location/1.log: Failed to connect",


Another example of a potential impact to missing log information that I'm unsure if it's actually an issue.

uranusjr · 2022-08-02T05:44:05Z

airflow/providers/alibaba/cloud/hooks/oss.py

-            self.log.error(e)
-            raise AirflowException(f"Errors when deleting: {key}")
+            self.log.error("Errors when deleting %s", key)
+            raise AirflowException(e)


Since this is a provider, I wonder if we should just switch to re-raise the original exception. Coercing the error to AirflowException offers no benefits at all

Agree :). We used to use that pattern in the past but unless you use dedicated AirflowSkip or AirflowFailException it's better to raise the original exception

Ah yes, this totally makes sense. Agreed. I'll do a clean sweep of all the provider files I'm touching for this as well.

uranusjr · 2022-08-02T05:45:43Z

airflow/providers/amazon/aws/hooks/glue.py

-        except Exception as general_error:
-            self.log.error("Failed to create aws glue job, error: %s", general_error)
+        except Exception:
+            self.log.error("Failed to create aws glue job.")


Do we want to use exception here? (same for many below)

Yep. Sounds strange. We shoud catch specific exception that we know about and let all the rest buble up directly, there is no point in extra logging here unless we want to provide any "specific" information resulting in helping the user to react to some known exceptions. There is no point in logging meaningless log here - the user knows, Glue job failed to be created already and providing this extra line with no actual "Help" for the user and without instructions on what to do is borderline harrasing the user "Hello you already know we failed, so let us repeat it here").

Do we want to use exception here? (same for many below)

Generally if there was a raise I stuck to using error so the traceback wasn't logged twice.

Oof yeah @potiuk there are a lot of generic exceptions in these files. I'll clean them up.

josh-fell · 2022-08-29T01:42:31Z

Not stale. Just need to find some time to tackle #24933 first. There are similar patterns and want to make sure there is consensus and consistency.

github-actions · 2022-10-14T00:20:24Z

This pull request has been automatically marked as stale because it has not had recent activity. It will be closed in 5 days if no further activity occurs. Thank you for your contributions.

boring-cyborg bot added provider:cncf-kubernetes Kubernetes provider related issues area:logging area:providers provider:Apache provider:amazon-aws AWS/Amazon - related issues provider:google Google (including GCP) related issues labels Jul 8, 2022

josh-fell mentioned this pull request Jul 8, 2022

Automatically detect if non-lazy logging interpolation is used #24910

Merged

josh-fell force-pushed the flake8-logging-format-providers branch from 2e33f19 to 4c80aab Compare August 1, 2022 14:33

josh-fell commented Aug 1, 2022

View reviewed changes

Apply flake8-logging-format changes to providers

b141bb3

josh-fell marked this pull request as ready for review August 1, 2022 19:36

josh-fell requested review from jedcunningham and turbaszek as code owners August 1, 2022 19:36

josh-fell force-pushed the flake8-logging-format-providers branch from 4c80aab to b141bb3 Compare August 1, 2022 19:36

uranusjr reviewed Aug 2, 2022

View reviewed changes

github-actions bot added the stale Stale PRs per the .github/workflows/stale.yml policy file label Oct 14, 2022

github-actions bot closed this Oct 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Apply flake8-logging-format changes to providers #24933

Apply flake8-logging-format changes to providers #24933

josh-fell commented Jul 8, 2022 •

edited

Loading

josh-fell Aug 1, 2022 •

edited

Loading

josh-fell Aug 1, 2022

josh-fell Aug 1, 2022

uranusjr Aug 2, 2022

potiuk Aug 2, 2022

josh-fell Aug 2, 2022

uranusjr Aug 2, 2022 •

edited

Loading

potiuk Aug 2, 2022 •

edited

Loading

josh-fell Aug 2, 2022

josh-fell commented Aug 29, 2022

github-actions bot commented Oct 14, 2022

Apply flake8-logging-format changes to providers #24933

Apply flake8-logging-format changes to providers #24933

Conversation

josh-fell commented Jul 8, 2022 • edited Loading

josh-fell Aug 1, 2022 • edited Loading

Choose a reason for hiding this comment

josh-fell Aug 1, 2022

Choose a reason for hiding this comment

josh-fell Aug 1, 2022

Choose a reason for hiding this comment

uranusjr Aug 2, 2022

Choose a reason for hiding this comment

potiuk Aug 2, 2022

Choose a reason for hiding this comment

josh-fell Aug 2, 2022

Choose a reason for hiding this comment

uranusjr Aug 2, 2022 • edited Loading

Choose a reason for hiding this comment

potiuk Aug 2, 2022 • edited Loading

Choose a reason for hiding this comment

josh-fell Aug 2, 2022

Choose a reason for hiding this comment

josh-fell commented Aug 29, 2022

github-actions bot commented Oct 14, 2022

josh-fell commented Jul 8, 2022 •

edited

Loading

josh-fell Aug 1, 2022 •

edited

Loading

uranusjr Aug 2, 2022 •

edited

Loading

potiuk Aug 2, 2022 •

edited

Loading