Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings #7901

polarmt · 2022-11-11T19:02:49Z

Dependency

Note: The two PRs are codependent and need to be merged together.

Description

These changes are necessary to fix celery/django-celery-beat#604 and is a dependency for celery/django-celery-beat#605. Both issues have more detailed information about the problem and how testing was performed for the changes.

celery/utils/time.py

auvipy · 2022-11-15T07:52:20Z

can you rebase on top top of master again please?

polarmt · 2022-11-15T17:22:47Z

can you rebase on top top of master again please?

Done rebasing.

auvipy · 2022-11-17T13:24:00Z

I am not sure why builds are failing, can you check please? are the tests passing locally?

polarmt · 2022-11-17T20:00:11Z

I am a little confused as to what the behavior should be for Celery beat. Currently, the only unit test that is failing is from https://github.com/celery/celery/blob/master/t/unit/app/test_schedules.py#L441. This unit test was added to deal with #1604.

This is where the confusion lies. During the end of DST, my understanding is that we do want the tasks to run twice. For example, a Celery task runs at 1:30 PDT and 1:30 PST. From our business perspective, this makes the most sense. However, the problem in #1604 seems to be that they do not want to the tasks to run twice.

What should be the expected behavior? Should we add a configuration parameter to control this? If so, what is the default parameter?

auvipy · 2022-11-29T06:45:01Z

I will come back to this soon

auvipy · 2022-12-14T11:28:43Z

can you please elaborate why a task should run twice?

auvipy · 2023-03-02T10:14:34Z

I am a little confused as to what the behavior should be for Celery beat. Currently, the only unit test that is failing is from https://github.com/celery/celery/blob/master/t/unit/app/test_schedules.py#L441. This unit test was added to deal with #1604.

This is where the confusion lies. During the end of DST, my understanding is that we do want the tasks to run twice. For example, a Celery task runs at 1:30 PDT and 1:30 PST. From our business perspective, this makes the most sense. However, the problem in #1604 seems to be that they do not want to the tasks to run twice.

What should be the expected behavior? Should we add a configuration parameter to control this? If so, what is the default parameter?

can you please revisit this? I want to know more of your thoughts

polarmt · 2023-06-30T16:54:25Z

For instance, let us assume that we have a task that we want to run every hour. An example of such important task is sending a timely email every hour at the 30 minute mark. Then, we would want this task to run both at 1:30 AM PDT AND 1:30 AM PST.

In our business, we send such timely emails for many of our features hourly at different minute marks and having one hour of downtime is detrimental.

auvipy · 2023-07-02T07:46:40Z

For instance, let us assume that we have a task that we want to run every hour. An example of such important task is sending a timely email every hour at the 30 minute mark. Then, we would want this task to run both at 1:30 AM PDT AND 1:30 AM PST.

In our business, we send such timely emails for many of our features hourly at different minute marks and having one hour of downtime is detrimental.

I think running a task multiple times in different timezone should be allowed.

polarmt · 2023-07-03T16:27:24Z

Thanks, @auvipy ! In that case, keep us posted on the progress of the review.

auvipy · 2023-07-06T04:46:56Z

can't review it as the builds are failing and there are merge conflicts. we need those two resolved before moving further.

polarmt · 2023-07-06T18:40:55Z

Can you run the builds again?

auvipy · 2023-07-08T03:37:43Z

re started and builds are failing

auvipy · 2023-11-06T17:16:39Z

can you restart this?

polarmt · 2023-11-07T19:39:43Z

Yes. Yesterday, we discovered a potential bug with this change that was exposed by the latest DST transition. I will summarize it and fix it when I have time.

polarmt · 2023-11-07T21:57:35Z

Root Cause of Bug

In the existing logic of the Celery source code, it will convert the start timezone to the current timezone (PDT to PST) and then compute the difference between start and now. If we compare 1:15 PM PST and 1:15 PM PDT, the difference should be one hour apart. However, with the unnecessary conversion, the remaining function would return 0 minutes. To fix this issue, we skipped the conversion of start from PDT to PST before computing the expected end time.

This works for periodic tasks that run every X minutes, but the logic seems to be incorrect if the beat task uses a crontab. time.remaining takes in three main parameters: now, ends_in, and start. The ends_in represents when we expect the task to run next. This value can either be a timedelta object or an ffwd object.

An ffwd object does not represent the time difference between two times. Instead, "adding" an ffwd object will replace the current datetime object with the times specified in the ffwd object. For instance, if we add ffwd(hour=7, minute=0, weekday=6) to 1:15 AM on Saturday, the resulting time would become 7:00 AM on the following Sunday.

If the Saturday in the example was prior to the transition and the Sunday was afterwards, then we expect the start (1:15 AM) to be in PDT and end_date (7:00 AM) to be PST. Because we skip the conversion of start in the new logic, the end_date will be incorrectly in PDT even though we have transitioned to PST. For ffwd, the changes will not work.

Symptoms of Bug

The first time that the Celery beat task will run will be an hour earlier than the scheduled time. Then, the Celery beat task will run again at the expected time. This causes the Celery beat task to run an additional time unnecessarily.

Proposed Fix

We can check if ends_in is an ffwd object or not before deciding whether to convert ends_in or not:

   if str(start.tzinfo) == str(now.tzinfo) and now.utcoffset() != start.utcoffset():
        if now.utcoffset() > start.utcoffset() or isinstance(ends_in, ffwd):
            # DST started
            start = start.replace(tzinfo=now.tzinfo)

polarmt · 2023-11-08T23:36:57Z

I think I fixed the issue. We can see whether the CI passes.

celery/utils/time.py

auvipy

The following test fails as per CI report. we need to fix it to make everything green again

=================================== FAILURES ===================================
________________________________ test_remaining ________________________________

  def test_remaining():
      # Relative
      remaining(datetime.utcnow(), timedelta(hours=1), relative=True)
  
      """
      The upcoming cases check whether the next run is calculated correctly
      """
      eastern_tz = ZoneInfo("US/Eastern")
      tokyo_tz = ZoneInfo("Asia/Tokyo")
      eastern_tz_pytz = pytz.timezone("US/Eastern")
      tokyo_tz_pytz = pytz.timezone("Asia/Tokyo")
  
      # Case 1: `start` in UTC and `now` in other timezone
      start = datetime.now(ZoneInfo("UTC"))
      now = datetime.now(eastern_tz)
      delta = timedelta(hours=1)
      assert str(start.tzinfo) == str(ZoneInfo("UTC"))
      assert str(now.tzinfo) == str(eastern_tz)
      rem_secs = remaining(start, delta, now).total_seconds()
      # assert remaining time is approximately equal to delta
      assert rem_secs == pytest.approx(delta.total_seconds(), abs=1)
  
      # Case 2: `start` and `now` in different timezones (other than UTC)
      start = datetime.now(eastern_tz)
      now = datetime.now(tokyo_tz)
      delta = timedelta(hours=1)
      assert str(start.tzinfo) == str(eastern_tz)
      assert str(now.tzinfo) == str(tokyo_tz)
      rem_secs = remaining(start, delta, now).total_seconds()
      assert rem_secs == pytest.approx(delta.total_seconds(), abs=1)
  
      """
      Case 3: DST check
      Suppose start (which is last_run_time) is in EST while next_run is in EDT,
      then check whether the `next_run` is actually the time specified in the
      start (i.e. there is not an hour diff due to DST).
      In 2019, DST starts on March 10
      """
      start = datetime(day=9, month=3, year=2019, hour=10, minute=0, tzinfo=eastern_tz)         # EST
      now = datetime(day=11, month=3, year=2019, hour=1, minute=0, tzinfo=eastern_tz)           # EDT
      delta = ffwd(hour=10, year=2019, microsecond=0, minute=0, second=0, day=11, weeks=0, month=3)
      # `next_actual_time` is the next time to run (derived from delta)
      next_actual_time = datetime(
          day=11, month=3, year=2019, hour=10, minute=0, tzinfo=eastern_tz)  # EDT
      assert start.tzname() == "EST"
      assert now.tzname() == "EDT"
      assert next_actual_time.tzname() == "EDT"
      rem_time = remaining(start, delta, now)
      next_run = now + rem_time
      assert next_run == next_actual_time
  
      """
      Case 4: DST check (ZoneInfo, timedelta)
      Suppose start (which is last_run_time) is in PDT while next_run is in PST,
      Check whether there is an hour added to the time between now and the `next_run`
      In 2022, DST ends on Nov 6
      """
      start = datetime(day=6, month=11, year=2022, hour=1, minute=15, tzinfo=eastern_tz, fold=0)
      now = datetime(day=6, month=11, year=2022, hour=1, minute=34, tzinfo=eastern_tz, fold=1)
      ends_in = timedelta(minutes=80)
      next_actual_time = datetime(day=6, month=11, year=2022, hour=1, minute=35, tzinfo=eastern_tz, fold=1)
      assert start.tzname() == "EDT"
      assert now.tzname() == "EST"
      assert next_actual_time.tzname() == "EST"
      rem_time = remaining(start, ends_in, now)
      print(start + ends_in - now)
      next_run = now + rem_time

  assert next_run == next_actual_time

E AssertionError: assert datetime.datetime(2022, 11, 6, 2, 35, tzinfo=ZoneInfo(key='US/Eastern')) == datetime.datetime(2022, 11, 6, 1, 35, tzinfo=ZoneInfo(key='US/Eastern'), fold=1)

t/unit/utils/test_time.py:193: AssertionError

t/unit/utils/test_time.py

for more information, see https://pre-commit.ci

codecov · 2023-11-09T17:45:11Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Comparison is base (7a27725) 87.24% compared to head (f079f86) 87.33%.

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #7901      +/-   ##
==========================================
+ Coverage   87.24%   87.33%   +0.09%     
==========================================
  Files         148      148              
  Lines       18637    18526     -111     
  Branches     3199     3167      -32     
==========================================
- Hits        16260    16180      -80     
+ Misses       2080     2060      -20     
+ Partials      297      286      -11

Flag	Coverage Δ
unittests	`87.30% <100.00%> (+0.09%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

auvipy · 2023-11-09T17:55:57Z

It seems unit tests are now passing. I restarted the failing pypy build as it seems to be a network issue

auvipy

you need to check/fix the lint issues as well. thanks for all your efforts here!! I will come back to this next morning to check if it is merge able with 5.3.5 release

auvipy · 2023-11-09T18:07:46Z

Integration tests are now passing

auvipy · 2023-11-09T18:10:11Z

t/unit/utils/test_time.py:136:5: F841 local variable 'tokyo_tz_pytz' is assigned to but never used
t/unit/utils/test_time.py:193:1: W293 blank line contains whitespace
t/unit/utils/test_time.py:223:118: E501 line too long (118 > 117 characters)
t/unit/utils/test_time.py:230:1: W293 blank line contains whitespace

auvipy · 2023-11-10T20:17:48Z

Will try to push lint fixes

auvipy · 2023-11-12T16:39:54Z

I might consider this for 5.3.6 if it do not break backward compat

auvipy · 2023-11-13T13:41:02Z

@Nusnus can we consider this for a patch release?

Nusnus · 2023-11-21T19:17:09Z

@auvipy

@Nusnus can we consider this for a patch release?

I haven't fully reviewed the entire PR, but from the original description:

celery/django-celery-beat#605

Note: The two PRs are codependent and need to be merged together.

It looks like the other PR isn't ready, besides this PR having failures in CI.

That being said, if this work can be baked enough, I support merging on, but we'll have it for v5.4 if it will be ready by then.

auvipy · 2023-11-21T19:18:33Z

Yeah it's better off for 5.4

cclauss · 2023-12-14T08:47:48Z

celery/utils/time.py

@@ -218,12 +218,15 @@ def remaining(
        ~datetime.timedelta: Remaining time.
    """
    now = now or datetime.utcnow()


https://docs.python.org/3/library/datetime.html#datetime.datetime.utcnow is deprecated in Python 3.12

polarmt changed the title ~~Remove logic to convert one timestamp to be the same as other~~ Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings Nov 11, 2022

This was referenced Nov 11, 2022

Fix Issue #604: Celery Beat Crashing at the End of Daylight Savings celery/django-celery-beat#605

Open

Celery Beat Crashing at the End of Daylight Savings celery/django-celery-beat#604

Open

auvipy requested changes Nov 12, 2022

View reviewed changes

celery/utils/time.py Outdated Show resolved Hide resolved

polarmt force-pushed the master branch from 0927350 to 9873bc1 Compare November 14, 2022 18:28

auvipy added this to the 5.3 milestone Nov 15, 2022

auvipy added Component: Celerybeat PR Type: Bugfix labels Nov 15, 2022

polarmt force-pushed the master branch from 9f41875 to 9f5400b Compare November 15, 2022 17:21

polarmt force-pushed the master branch 2 times, most recently from 36c6fc3 to 405ddfd Compare November 17, 2022 19:46

auvipy requested a review from a team November 18, 2022 05:11

Nusnus removed this from the 5.3 milestone Feb 19, 2023

polarmt force-pushed the master branch from 405ddfd to 3cac9f7 Compare July 6, 2023 16:54

Andrew Yoo added 5 commits November 8, 2023 09:21

update timedelta to minutes

3115d7f

Rebased

f9f9e83

remove unnecessary files

1b99fbb

Add one more blank space

0656799

fix crontab issue

b5f8fa7

polarmt force-pushed the master branch from 21ddbee to b5f8fa7 Compare November 8, 2023 17:23

create separate cases for pytz and ZoneInfo

6b83bb4

auvipy requested changes Nov 9, 2023

View reviewed changes

celery/utils/time.py Outdated Show resolved Hide resolved

auvipy requested changes Nov 9, 2023

View reviewed changes

t/unit/utils/test_time.py Outdated Show resolved Hide resolved

Remove print

9ae143a

polarmt force-pushed the master branch from 7f38def to 9ae143a Compare November 9, 2023 17:40

[pre-commit.ci] auto fixes from pre-commit.com hooks

5eba574

for more information, see https://pre-commit.ci

auvipy reviewed Nov 9, 2023

View reviewed changes

auvipy requested a review from a team November 9, 2023 18:10

auvipy mentioned this pull request Nov 10, 2023

added 2 debian package for better stability in Docker #8629

Merged

auvipy mentioned this pull request Nov 12, 2023

Celery beat stops sending tasks on DST changes #6438

Open

18 tasks

auvipy added this to the 5.4 milestone Nov 12, 2023

cclauss reviewed Dec 14, 2023

View reviewed changes

Merge branch 'main' into master

f079f86

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings #7901

Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings #7901

polarmt commented Nov 11, 2022 •

edited

auvipy commented Nov 15, 2022

polarmt commented Nov 15, 2022

auvipy commented Nov 17, 2022

polarmt commented Nov 17, 2022

auvipy commented Nov 29, 2022

auvipy commented Dec 14, 2022

auvipy commented Mar 2, 2023

polarmt commented Jun 30, 2023 •

edited

auvipy commented Jul 2, 2023

polarmt commented Jul 3, 2023

auvipy commented Jul 6, 2023

polarmt commented Jul 6, 2023

auvipy commented Jul 8, 2023

auvipy commented Nov 6, 2023

polarmt commented Nov 7, 2023

polarmt commented Nov 7, 2023

polarmt commented Nov 8, 2023

auvipy left a comment

codecov bot commented Nov 9, 2023 •

edited

auvipy commented Nov 9, 2023

auvipy left a comment

auvipy commented Nov 9, 2023

auvipy commented Nov 9, 2023

auvipy commented Nov 10, 2023

auvipy commented Nov 12, 2023

auvipy commented Nov 13, 2023

Nusnus commented Nov 21, 2023

auvipy commented Nov 21, 2023

cclauss Dec 14, 2023

Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings #7901

Are you sure you want to change the base?

Fix Issue #604 in django-celery-beat: Celery Beat Crashing at the End of Daylight Savings #7901

Conversation

polarmt commented Nov 11, 2022 • edited

Dependency

Description

auvipy commented Nov 15, 2022

polarmt commented Nov 15, 2022

auvipy commented Nov 17, 2022

polarmt commented Nov 17, 2022

auvipy commented Nov 29, 2022

auvipy commented Dec 14, 2022

auvipy commented Mar 2, 2023

polarmt commented Jun 30, 2023 • edited

auvipy commented Jul 2, 2023

polarmt commented Jul 3, 2023

auvipy commented Jul 6, 2023

polarmt commented Jul 6, 2023

auvipy commented Jul 8, 2023

auvipy commented Nov 6, 2023

polarmt commented Nov 7, 2023

polarmt commented Nov 7, 2023

Root Cause of Bug

Symptoms of Bug

Proposed Fix

polarmt commented Nov 8, 2023

auvipy left a comment

Choose a reason for hiding this comment

codecov bot commented Nov 9, 2023 • edited

Codecov Report

auvipy commented Nov 9, 2023

auvipy left a comment

Choose a reason for hiding this comment

auvipy commented Nov 9, 2023

auvipy commented Nov 9, 2023

auvipy commented Nov 10, 2023

auvipy commented Nov 12, 2023

auvipy commented Nov 13, 2023

Nusnus commented Nov 21, 2023

auvipy commented Nov 21, 2023

cclauss Dec 14, 2023

Choose a reason for hiding this comment

polarmt commented Nov 11, 2022 •

edited

polarmt commented Jun 30, 2023 •

edited

codecov bot commented Nov 9, 2023 •

edited