Emit "logs not found" message when ES logs appear to be missing #21261

dstandish · 2022-02-01T23:07:07Z

Current ES log handler will wait up to 5 minutes for logs to appear (or for more logs to appear since last log message was emitted). This produces undesirable behavior when the log message has been deleted from the elasticsearch cluster. A user may wait a long time thinking that the logs are coming when they are not.

To resolve this, if no logs whatsoever have been retrieved after 5 seconds of trying, we give up and emit a "logs not found" message.

If the task has only just started, this may be a "false negative", and we guide the user to refresh if they think that might be the case.

airflow/providers/elasticsearch/log/es_task_handler.py

tests/providers/elasticsearch/log/test_es_task_handler.py

airflow/providers/elasticsearch/log/es_task_handler.py

uranusjr · 2022-02-03T04:26:50Z

Logic lgtm.

Current ES log handler will wait up to 5 minutes for logs to appear (or for _more_ logs to appear since last log message was emitted). This produces undesirable behavior when the log message has been deleted from the elasticsearch cluster. A user may wait a long time thinking that the logs are coming when they are not. To resolve this, if no logs whatsoever have been retrieved after 5 seconds of trying, we give up and emit a "logs not found" message. If the task has only just started, this may be a "false negative", and we guide the user to refresh if they think that might be the case.

Co-authored-by: Jed Cunningham <66968678+jedcunningham@users.noreply.github.com>

uranusjr · 2022-02-03T07:46:10Z

tests/providers/elasticsearch/log/test_es_task_handler.py

+def get_ti(dag_id, task_id, execution_date, create_task_instance):
+    ti = create_task_instance(
+        dag_id=dag_id,
+        task_id=task_id,
+        execution_date=execution_date,
+        dagrun_state=DagRunState.RUNNING,
+        state=TaskInstanceState.RUNNING,
+    )
+    ti.try_number = 1
+    ti.raw = False
+    return ti


How about turning this into a fixture?

@pytest.fixture() def create_running_task_instance(create_task_instance): def _create_ti(**kwargs): ti = create_task_instance( dagrun_state=DagRunState.RUNNING, state=TaskInstanceState.RUNNING, **kwargs, ) ti.try_number = 1 ti.raw = False return ti return _create_ti

it was a fixture but i pulled it out so i could make a TI with diff params...

but i guess you can parametirize a fixture like so?

You can but it shouldn’t be generally be needed, it’s easier to make the fixture return a function that takes arguments instead (like this one here). I searched your changes and this implementation seems to be good enough for the usages in this PR. You’d do

@pytest.fixture() def ti(self, create_running_task_instance): yield create_running_task_instance( dag_id=self.DAG_ID, task_id=self.TASK_ID, execution_date=self.EXECUTION_DATE, ) clear_db_runs() clear_db_dags()

i'm confused @uranusjr
i'm not seeing how this fixture helps me.
there is already a fixture here like this now. i just pulled out a portion of it (and still use it in the existing fixture) but i just want to be able to specify a different execution date in my specific test than the one used by the fixture.

if you are saying i should just change the fixture so that it returns a create_ti(execution_date) funciton and update all the other tests call that function (insstead of just using a returned TI) then i can do -- lemme know

github-actions · 2022-02-07T01:56:13Z

The PR is likely OK to be merged with just subset of tests for default Python and Database versions without running the full matrix of tests, because it does not modify the core of Airflow. If the committers decide that the full tests matrix is needed, they will add the label 'full tests needed'. Then you should rebase to the latest main or amend the last commit of the PR, and push it with --force-with-lease.

dstandish requested a review from jedcunningham February 1, 2022 23:07

boring-cyborg bot added area:logging area:providers labels Feb 1, 2022

dstandish force-pushed the missing-es-logs branch from 31f79e7 to b0f1a4b Compare February 1, 2022 23:07

jedcunningham reviewed Feb 1, 2022

View reviewed changes

airflow/providers/elasticsearch/log/es_task_handler.py Outdated Show resolved Hide resolved

airflow/providers/elasticsearch/log/es_task_handler.py Outdated Show resolved Hide resolved

tests/providers/elasticsearch/log/test_es_task_handler.py Outdated Show resolved Hide resolved

jedcunningham requested a review from ashb February 1, 2022 23:19

dstandish force-pushed the missing-es-logs branch from 3e422bd to 8b6545f Compare February 1, 2022 23:51

uranusjr reviewed Feb 3, 2022

View reviewed changes

airflow/providers/elasticsearch/log/es_task_handler.py Outdated Show resolved Hide resolved

dstandish and others added 6 commits February 2, 2022 23:40

Apply suggestions from code review

17a3137

Co-authored-by: Jed Cunningham <66968678+jedcunningham@users.noreply.github.com>

fixup! Apply suggestions from code review

bc1285e

fixup! Apply suggestions from code review

c90c537

Update es_task_handler.py

6aadc0f

fix readabilitya

075f220

dstandish force-pushed the missing-es-logs branch from 124cadd to 075f220 Compare February 3, 2022 07:40

uranusjr reviewed Feb 3, 2022

View reviewed changes

fixup! fix readabilitya

cf8d5df

jedcunningham approved these changes Feb 7, 2022

View reviewed changes

github-actions bot added the okay to merge It's ok to merge this PR as it does not require more tests label Feb 7, 2022

potiuk approved these changes Feb 7, 2022

View reviewed changes

potiuk merged commit 6184fac into apache:main Feb 7, 2022

dstandish mentioned this pull request Feb 9, 2022

Status of testing Providers that were prepared on February 09, 2022 #21443

Closed

74 tasks

potiuk mentioned this pull request Feb 14, 2022

Status of testing Providers that were prepared on February 14, 2022 #21557

Closed

23 tasks

ashb deleted the missing-es-logs branch February 15, 2022 17:47

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Emit "logs not found" message when ES logs appear to be missing #21261

Emit "logs not found" message when ES logs appear to be missing #21261

dstandish commented Feb 1, 2022

uranusjr commented Feb 3, 2022

uranusjr Feb 3, 2022

dstandish Feb 3, 2022

uranusjr Feb 3, 2022

dstandish Feb 3, 2022

dstandish Feb 3, 2022

github-actions bot commented Feb 7, 2022

Emit "logs not found" message when ES logs appear to be missing #21261

Emit "logs not found" message when ES logs appear to be missing #21261

Conversation

dstandish commented Feb 1, 2022

uranusjr commented Feb 3, 2022

uranusjr Feb 3, 2022

Choose a reason for hiding this comment

dstandish Feb 3, 2022

Choose a reason for hiding this comment

uranusjr Feb 3, 2022

Choose a reason for hiding this comment

dstandish Feb 3, 2022

Choose a reason for hiding this comment

dstandish Feb 3, 2022

Choose a reason for hiding this comment

github-actions bot commented Feb 7, 2022