[AIRFLOW-922] Update PrestoHook to enable synchronous execution by patrickmckenna · Pull Request #2206 · apache/airflow

patrickmckenna · 2017-03-31T18:45:49Z

JIRA

https://issues.apache.org/jira/browse/AIRFLOW-922

Description

This updates PrestoHook so that it can block until a statement finishes executing. Currently, PrestoHook.run returns as soon as it sends a statement, so Operators that use it won't run synchronously.

There are other, minor changes too that seemed worth making (hence the separate commits), though they're unrelated to the primary goal of this PR. Of course if you'd rather jettison those, or put them in a separate PR, that's fine by me.

Tests

I added some under tests/contrib/hooks, though I wasn't sure if that was the proper location. (PrestoHook is built in, not user-contributed, but AFAICT there are no existing tests for it.)

Commits

I haven't squashed commits yet, because I wanted to keep the history easy to see until the code is in a satisfactory state. Once it is, I'm happy to rewrite the commit history (unless you plan to just squash and merge to take care of that?).

mention-bot · 2017-03-31T18:45:50Z

@patrickmckenna, thanks for your PR! By analyzing the history of the files in this pull request, we identified @mistercrunch, @artwr and @smarden1 to be potential reviewers.

patrickmckenna · 2017-04-28T22:55:22Z

Sorry for letting this PR languish with broken tests! I had assumed using pytest was acceptable, since it was employed in an existing test. However, it looks like that test has been modified to no longer use pytest. I'll make similar updates to this PR.

The previous version of this hook included a fair amount of error message reformatting. But there was no logic to actually recover from the errors; they were just re-raised with the original error hidden. I thought it might be better to remove that reformatting (in ead2f33). However, I see that 6dd4b3b added more error message reformatting (though the new method doesn't appear to be called anywhere).

@artwr @mistercrunch Would you prefer then I that I revert ead2f33b9c6051e65c147f8d34c7ec92c11d4544 and incorporate 6dd4b3b?

patrickmckenna · 2017-04-29T18:01:49Z

The tests are passing.

SamWildmo · 2018-11-19T19:13:04Z

@patrickmckenna are you still working on this PR?

Rotemlofer · 2019-01-02T11:45:42Z

As presto user I would love to see this one it. I have several use cases for this.
I wonder why this didn't get any comments/code review?

patrickmckenna · 2019-01-25T22:24:44Z

I think giving PrestoHook.run the ability to execute synchronously (and possibly even doing so by default) is still desired, missing functionality. If that isn't the case, please LMK and I'll close this PR.

For now, I've updated it to incorporate (most of) the latest commits on apache:master, and removed some of the earlier, stylistic changes that weren't strictly related to AIRFLOW-922.

I'm a bit confused by the partial test failures, which occur only on some Python 2 builds. Anyone else have insights here? (The test suite does seem a bit flaky—e.g. this is recent successful build of apache/master, but a different build of the same commit partially failed—but I'm unsure if that's the root cause.)

/cc @SamWildmo @Rotemlofer (as interested users)
/cc @Fokko @kaxil @feng-tao (as maintainers who've touched presto_hook the most recently)

mik-laj · 2019-01-25T22:39:35Z

airflow/hooks/presto_hook.py

+            return cursor.poll() is None
+        except Exception as ex:
+            msg = "Couldn't determine statement execution status: ".format(ex)
+            self.log.error(msg)


You should pass unformatted text to logger.

Suggested change

self.log.error(msg)

self.log.error("Couldn't determine statement execution status: %s", ex)

@mik-laj ah, is that the agreed upon style preference? Happy to change it, just wasn't aware (didn't see anything in the docs or linting tests enforcing that, but may very well have missed it 😄).

This is not the style of the code, but the principle of using loggers. If you format before pass to the logger, you create an unnecessary object that the logger can ignore when the level is too low. One of the ways to increase application performance is to reduce the number of logs collected.

In special cases, the logger does not format the text. It can save the message separately and separate data to make analysis easier. If you format data before pass it to the logger, this functionality disappears.

These notes applies to any programming language.

Fokko · 2019-01-29T12:38:35Z

It looks like Python2 isn't passing on the tests:


======================================================================
66) FAIL: test_run_continues_polling_if_execution_status_unknown (tests.hooks.test_presto_hook.TestPrestoHook)
----------------------------------------------------------------------
   Traceback (most recent call last):
    .tox/py27-backend_sqlite-env_docker/lib/python2.7/site-packages/mock/mock.py line 1305 in patched
      return func(*args, **keywargs)
    tests/hooks/test_presto_hook.py line 87 in test_run_continues_polling_if_execution_status_unknown
      mock_sleep.assert_called_once_with(POLL_INTERVAL)
    .tox/py27-backend_sqlite-env_docker/lib/python2.7/site-packages/mock/mock.py line 947 in assert_called_once_with
      raise AssertionError(msg)
   AssertionError: Expected 'sleep' to be called once. Called 0 times.
   -------------------- >> begin captured stdout << ---------------------
   [2019-01-24 21:08:46,786] {base_hook.py:83} INFO - Using connection to: id: presto_default. Host: localhost, Port: 3400, Schema: hive, Login: None, Password: None, extra: {}
   
   --------------------- >> end captured stdout << ----------------------
   -------------------- >> begin captured logging << --------------------
   airflow.utils.log.logging_mixin.LoggingMixin: INFO: Using connection to: id: presto_default. Host: localhost, Port: 3400, Schema: hive, Login: None, Password: None, extra: {}
   --------------------- >> end captured logging << ---------------------
======================================================================
67) FAIL: test_run_optionally_blocks_while_statement_executes (tests.hooks.test_presto_hook.TestPrestoHook)
----------------------------------------------------------------------
   Traceback (most recent call last):
    .tox/py27-backend_sqlite-env_docker/lib/python2.7/site-packages/mock/mock.py line 1305 in patched
      return func(*args, **keywargs)
    tests/hooks/test_presto_hook.py line 74 in test_run_optionally_blocks_while_statement_executes
      mock_sleep.assert_called_once_with(POLL_INTERVAL)
    .tox/py27-backend_sqlite-env_docker/lib/python2.7/site-packages/mock/mock.py line 947 in assert_called_once_with
      raise AssertionError(msg)
   AssertionError: Expected 'sleep' to be called once. Called 0 times.

patrickmckenna · 2019-02-01T01:11:27Z

Apologies for my slow reply! All of the currently failing builds show the same 2 errors, which appear unrelated to this PR:

failing tests

======================================================================
84) FAIL: test_scheduler_sla_miss_callback_exception (tests.test_jobs.SchedulerJobTest)
----------------------------------------------------------------------
   Traceback (most recent call last):
    tests/test_jobs.py line 3025 in test_scheduler_sla_miss_callback_exception
      sla_callback.assert_called()
    .tox/py35-backend_postgres-env_docker/lib/python3.5/site-packages/mock/mock.py line 906 in assert_called
      raise AssertionError(msg)
   AssertionError: Expected 'None' to have been called.
   -------------------- >> begin captured stdout << ---------------------
   [2019-02-01 00:36:45,730] {test_task_view_type_check.py:52} INFO - class_instance type: <class 'unusual_prefix_61c0ab525b75d060ea41c0aa11a98c88efc72c26_test_task_view_type_check.CallableClass'>
   
   --------------------- >> end captured stdout << ----------------------
   -------------------- >> begin captured logging << --------------------
   root: INFO: class_instance type: <class 'unusual_prefix_61c0ab525b75d060ea41c0aa11a98c88efc72c26_test_task_view_type_check.CallableClass'>
   --------------------- >> end captured logging << ---------------------
======================================================================
85) FAIL: test_scheduler_sla_miss_email_exception (tests.test_jobs.SchedulerJobTest)
----------------------------------------------------------------------
   Traceback (most recent call last):
    /usr/lib/python3.5/unittest/mock.py line 1157 in patched
      return func(*args, **keywargs)
    tests/test_jobs.py line 3069 in test_scheduler_sla_miss_email_exception
      'test_sla_miss')
    .tox/py35-backend_postgres-env_docker/lib/python3.5/site-packages/mock/mock.py line 925 in assert_called_with
      raise AssertionError('Expected call: %s\nNot called' % (expected,))
   AssertionError: Expected call: exception('Could not send SLA Miss email notification for DAG %s', 'test_sla_miss')
   Not called
   -------------------- >> begin captured stdout << ---------------------
   [2019-02-01 00:36:45,820] {test_task_view_type_check.py:52} INFO - class_instance type: <class 'unusual_prefix_61c0ab525b75d060ea41c0aa11a98c88efc72c26_test_task_view_type_check.CallableClass'>
   
   --------------------- >> end captured stdout << ----------------------
   -------------------- >> begin captured logging << --------------------
   root: INFO: class_instance type: <class 'unusual_prefix_61c0ab525b75d060ea41c0aa11a98c88efc72c26_test_task_view_type_check.CallableClass'>
   --------------------- >> end captured logging << ---------------------

I'm not sure how to understand this (and am all the more confused because the latest build of apache/master is passing). I'll try rebasing this branch to generate the same tree as the currently failing build but trigger a new test...

This will allow PrestoHook to be used by Operators derived from BaseOperator, which (https://git.io/fhamQ) should perform or trigger certain tasks synchronously (wait for completion) Notes on the other differences between this and DbApiHook.run: - no need for utf-8 encoding (https://git.io/vD9LI) b/c PyHive does it automatically (https://git.io/vD9Lm) - no closing/commiting the cursor/conn (https://git.io/vD9Lc), because those are no-ops w/ PyHive (https://git.io/vD9L6, https://git.io/vD9L1) presto.Cursor does have a _poll_interval attribute, but it has no public accessor, so it seemed safer to make that value a parameter to pass to PrestoHook.run.

Catch only network-related exceptions when polling Presto. And make str handling work in Python 2, too.

codecov-io · 2019-02-01T01:58:45Z

Codecov Report

Merging #2206 into master will increase coverage by 0.03%.
The diff coverage is 95.65%.

@@            Coverage Diff             @@
##           master    #2206      +/-   ##
==========================================
+ Coverage    74.3%   74.34%   +0.03%     
==========================================
  Files         426      426              
  Lines       27867    27888      +21     
==========================================
+ Hits        20706    20732      +26     
+ Misses       7161     7156       -5

Impacted Files	Coverage Δ
airflow/hooks/presto_hook.py	`63.38% <95.65%> (+25.38%)`	⬆️

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update ee5b8c2...bd1c4d6. Read the comment docs.

patrickmckenna · 2019-02-01T02:16:16Z

airflow/hooks/presto_hook.py

+            cursor.execute(stmt, parameters)
+
+            if poll_interval is not None:
+                while not self.execution_finished(cursor):


It's definitely A Bad Idea™️ to poll indefinitely, but I'm assuming all the timeout logic is expected to live in the operators using this hook. @Fokko please LMK if that's assumption's inaccurate, and if there ought to be some minimal safeguards here, e.g. an upper bound on the number of pings to send or time to wait.

In other operators there are similar constructions that define an upper bound to thrown an exception when the maximum time is exceeded: https://github.com/apache/airflow/blob/master/airflow/hooks/druid_hook.py#L90

Fokko · 2019-02-12T09:26:20Z

@patrickmckenna Can you pick up the latest suggestions from the PR?

r39132 · 2019-04-23T02:07:03Z

@patrickmckenna Closing this for lack of activity.

HaloKo4 · 2019-06-20T18:42:33Z

@patrickmckenna is there a chance you will continue to work on that? It's shame that this amazing work will go to waste. This PR is important without it we can not schedule Presto jobs on Airflow as everything is considered success... we can not set dependencies.

patrickmckenna force-pushed the update-presto-hook branch from a106623 to 62edae7 Compare January 24, 2019 20:52

mik-laj reviewed Jan 25, 2019

View reviewed changes

patrickmckenna added 8 commits January 31, 2019 17:11

add minimal unit tests

d00fa5f

remove unused imports

e14e93e

handle errors while checking execution status

cb1e75d

make names, setup a bit more explicit

1579d66

more better exception handling, str processing

cf98234

Catch only network-related exceptions when polling Presto. And make str handling work in Python 2, too.

consolidate mock setup in setUp

b8e621c

avoid mocking time.sleep directly

bd1c4d6

patrickmckenna force-pushed the update-presto-hook branch from fffe2aa to bd1c4d6 Compare February 1, 2019 01:12

patrickmckenna commented Feb 1, 2019

View reviewed changes

r39132 closed this Apr 23, 2019

eladkal mentioned this pull request Jul 2, 2019

[AIRFLOW-4879] add poll_interval and schema to PrestoHook #5515

Closed

1 task

potiuk mentioned this pull request Mar 17, 2022

Add description on the vendoring process we use #22204

Merged

	self.log.error(msg)
	self.log.error("Couldn't determine statement execution status: %s", ex)

Conversation

patrickmckenna commented Mar 31, 2017

JIRA

Description

Tests

Commits

Uh oh!

mention-bot commented Mar 31, 2017

Uh oh!

patrickmckenna commented Apr 28, 2017

Uh oh!

patrickmckenna commented Apr 29, 2017

Uh oh!

SamWildmo commented Nov 19, 2018

Uh oh!

Rotemlofer commented Jan 2, 2019

Uh oh!

patrickmckenna commented Jan 25, 2019

Uh oh!

mik-laj Jan 25, 2019

Choose a reason for hiding this comment

Uh oh!

patrickmckenna Feb 1, 2019

Choose a reason for hiding this comment

Uh oh!

mik-laj Feb 2, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Fokko commented Jan 29, 2019

Uh oh!

patrickmckenna commented Feb 1, 2019

Uh oh!

codecov-io commented Feb 1, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

patrickmckenna Feb 1, 2019

Choose a reason for hiding this comment

Uh oh!

Fokko Feb 1, 2019

Choose a reason for hiding this comment

Uh oh!

Fokko commented Feb 12, 2019

Uh oh!

r39132 commented Apr 23, 2019

Uh oh!

HaloKo4 commented Jun 20, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

9 participants

mik-laj Feb 2, 2019 •

edited

Loading

codecov-io commented Feb 1, 2019 •

edited

Loading