Fix flaky async sensor test by potiuk · Pull Request #40813 · apache/airflow

potiuk · 2024-07-16T11:25:38Z

I think this one should finally fix the flaky test. The issue with it was that in Pytest parsing the test might occur a long time before executing it - depending how many tests are run between and how slow the test run.

The original test calculated the reference time at parsing time and it assumed on hour ahead (when minutes where 0-ed were enough) for the "deferral" to happen. But if parsing was executed (thus reference time calculation happened) before end of hour and execution after end hour, then the +1 hour from beginning of the reference hour already passed.

The fix is to move the reference time calculation to inside the test, so that it is not calculated a long time before the test is run.

The test runs on public runners are generally slower "per container", that's why it happened more frequently on public runners.

^ Add meaningful description above
Read the Pull Request Guidelines for more information.
In case of fundamental code changes, an Airflow Improvement Proposal (AIP) is needed.
In case of a new dependency, check compliance with the ASF 3rd Party License Policy.
In case of backwards incompatible changes please leave a note in a newsfragment file, named {pr_number}.significant.rst or {issue_number}.significant.rst, in newsfragments.

I think this one should finally fix the flaky test. The issue with it was that in Pytest parsing the test might occur a long time before executing it - depending how many tests are run between and how slow the test run. The original test calculated the reference time at parsing time and it assumed on hour ahead (when minutes where 0-ed were enough) for the "deferral" to happen. But if parsing was executed (thus reference time calculation happened) before end of hour and execution after end hour, then the +1 hour from beginning of the reference hour already passed. The fix is to move the reference time calculation to inside the test, so that it is not calculated a long time before the test is run. The test runs on public runners are generally slower "per container", that's why it happened more frequently on public runners.

potiuk · 2024-07-16T11:26:23Z

I believe I found out why the async parse test was flaky finally @hussein-awala ?

hussein-awala

I had this hypothesis yesterday but the date looked good in the failed ci job, but yeah what you describe completes my analysis.

potiuk · 2024-07-16T13:13:02Z

Yeah. The tests in "Other" run for ~ 11 minutes and this one was somewhere around 90% , so chances to hit this one were quite significant. Plus - when it happened it happend in most of the tests because they were starting around the same time + it seems that for scheduled build in the night it happened always, because the workflow starts at 0:40 and it takes about 10 minutes to start the tests from starting of the CI workflow. 🤯

potiuk · 2024-07-16T13:17:55Z

And I think this is what confused me with my initial attempt about add - I noticed the hour difference from what I expected in the test (when I used "show timestamp" in the logs and I assumed it was the side effect of mutating the reference time ...

But in this case it was the TIME that shifted 😱

I think this one should finally fix the flaky test. The issue with it was that in Pytest parsing the test might occur a long time before executing it - depending how many tests are run between and how slow the test run. The original test calculated the reference time at parsing time and it assumed on hour ahead (when minutes where 0-ed were enough) for the "deferral" to happen. But if parsing was executed (thus reference time calculation happened) before end of hour and execution after end hour, then the +1 hour from beginning of the reference hour already passed. The fix is to move the reference time calculation to inside the test, so that it is not calculated a long time before the test is run. The test runs on public runners are generally slower "per container", that's why it happened more frequently on public runners.

potiuk requested a review from hussein-awala July 16, 2024 11:25

boring-cyborg bot added the area:core-operators label Jul 16, 2024

potiuk requested review from eladkal and ephraimbuddy July 16, 2024 11:25

eladkal approved these changes Jul 16, 2024

View reviewed changes

potiuk mentioned this pull request Jul 16, 2024

Fernet-key-rotation-optimisation #40786

Merged

hussein-awala approved these changes Jul 16, 2024

View reviewed changes

potiuk merged commit 4891976 into apache:main Jul 16, 2024

potiuk deleted the fix-time-delta-flaky-test branch July 16, 2024 13:08

ephraimbuddy added the changelog:skip Changes that should be skipped from the changelog (CI, tests, etc..) label Jul 23, 2024

ephraimbuddy added this to the Airflow 2.10.0 milestone Jul 23, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix flaky async sensor test#40813

Fix flaky async sensor test#40813
potiuk merged 1 commit intoapache:mainfrom
potiuk:fix-time-delta-flaky-test

potiuk commented Jul 16, 2024

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

hussein-awala left a comment

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

potiuk commented Jul 16, 2024

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

hussein-awala left a comment

Choose a reason for hiding this comment

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

potiuk commented Jul 16, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants