Make the timezone aware triggers resilient to delays #86

PeterJCLaw · 2019-08-30T16:14:12Z

The cron processing includes doing the work of each scheduled job inline at the point they detect they need to run. This works well for the jobs where the scheduling is entirely handled by the scheduler, because they set a next time to run and are then run at some point after that time occurs.

However for the timezone aware triggers we're somewhat abusing that mechanism to do our own timezone awareness filtering. As a result, our reliance on being run at exactly (within a minute of)
the right time meant that the triggers were missing a significant proportion of the actual times at which they should have run their processing.

This change makes the triggers stateful -- aware of the last real time at which they ran and thus able to account for those delays. They now check all the minutes between when they last ran and the current time, thus ensuring that delays still result in the trigger running.

There is a side effect of this approach -- that delays may cause a trigger to only run once where it might ideally run several times. This is a worthwhile trade-off for now and is certainly better than not running at all.

The cron processing includes doing the work of each scheduled job inline at the point they detect they need to run. This works well for the jobs where the scheduling is entirely handled by the scheduler, because they set a next time to run and are then run at some point after that time occurs. However for the timezone aware triggers we're somewhat abusing that mechanism to do our own timezone awareness filtering. As a result, our reliance on being run at exactly (within a minute of) the right time meant that the triggers were missing a significant proportion of the actual times at which they should have run their processing. This change makes the triggers stateful -- aware of the last real time at which they ran and thus able to account for those delays. They now check all the minutes between when they last ran and the current time, thus ensuring that delays still result in the trigger running. There is a side effect of this approach -- that delays may cause a trigger to only run once where it might ideally run several times. This is a worthwhile trade-off for now and is certainly better than not running at all.

coveralls · 2019-08-30T16:21:19Z

Coverage increased (+0.003%) to 99.457% when pulling 2492f58 on fix-timezone-aware-trigger-delay-handling into 490a9b6 on master.

routemaster/cron_processors.py

Use a timezone aware range comparison rather than evaluating every minute within the range. This should be faster while still being correct.

This copies the same time-range approach used to optimise the TimezoneAwareProcessor into the MetadataTimezoneAwareProcessor. The performance increase here is less stark (I suspect because iterating over all the timezones dominates), however this is likely to still be an improvement. In any case it's beneficial to have both processors working in the same way.

routemaster/tests/test_cron_processors.py

It's not really possible for the processor to run twice at the same instant, so change the tests not to do that.

Internal errors would break the overall cron processing, so we want to avoid that.

PeterJCLaw added the Work in Progress label Aug 30, 2019

danpalmer approved these changes Sep 2, 2019

View reviewed changes

routemaster/cron_processors.py Outdated Show resolved Hide resolved

PeterJCLaw added 4 commits September 2, 2019 17:40

Optimise the TimezoneAwareProcessor

bade457

Use a timezone aware range comparison rather than evaluating every minute within the range. This should be faster while still being correct.

Clarify docstring

bd5c7b5

Remove now redundant timezone instants util

4613bfe

PeterJCLaw requested a review from danpalmer September 2, 2019 16:58

PeterJCLaw removed the Work in Progress label Sep 2, 2019

danpalmer approved these changes Sep 2, 2019

View reviewed changes

routemaster/tests/test_cron_processors.py Show resolved Hide resolved

PeterJCLaw added 3 commits September 3, 2019 10:31

Fix the logic of these tests

e8cf2b4

It's not really possible for the processor to run twice at the same instant, so change the tests not to do that.

Clarify rejection of invalid range inputs

111a634

Ensure the cron processors don't bubble internal errors

2492f58

Internal errors would break the overall cron processing, so we want to avoid that.

PeterJCLaw merged commit 2492f58 into master Sep 3, 2019

PeterJCLaw deleted the fix-timezone-aware-trigger-delay-handling branch September 3, 2019 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make the timezone aware triggers resilient to delays #86

Make the timezone aware triggers resilient to delays #86

PeterJCLaw commented Aug 30, 2019

coveralls commented Aug 30, 2019 •

edited

Make the timezone aware triggers resilient to delays #86

Make the timezone aware triggers resilient to delays #86

Conversation

PeterJCLaw commented Aug 30, 2019

coveralls commented Aug 30, 2019 • edited

coveralls commented Aug 30, 2019 •

edited