Local Retries + OneTimeSchedule #680

cicdw · 2019-02-24T01:03:22Z

Thanks for contributing to Prefect!

Please describe your work and make sure your PR:

adds new tests (if appropriate)
updates CHANGELOG.md (if appropriate)
updates docstrings for any new functions or function arguments, including docs/outline.toml for API reference docs (if appropriate)

What does this PR change?

This PR adds two new features (which are related in my mind):

the ability for tasks to Retry when executing locally with flow.run(on_schedule=True)
a OneTimeSchedule for one-off, scheduled execution

I imagine the OneTimeSchedule to mainly be used during local hacking, for testing retries, etc. (since on_schedule=True requires the Flow to actually have a schedule)

Why is this PR important?

The ability for tasks to Retry in local execution is important for open source users, as well as Cloud users who are hacking locally on their Flows and want to test Retries. This doesn't bring Core into feature parity with Core + Cloud, but it does make Core immensely more useful as a standalone tool, which is still important.

jlowin

👍 love the functionality this adds, have a few comments about simplifying the implementation just to reduce complexity.

jlowin · 2019-02-24T19:02:48Z

src/prefect/core/flow.py

+
+        while True:  # run indefinitely
+            end = self.schedule.next(1)[0]
+            while True:


Feels like there's some complexity in this and the subsequent white/sleep blocks -- why not just sleep until the next schedule time, or the earliest task time?

I've read things that suggest time.sleep isn't very accurate and can easily end too soon / too late depending on other system activity, so this implementation let me not worry too much about that.

Happy to implement this as a utility long_sleep or something if you'd prefer

Gotcha. I'm tempted to say that in the name of simplicity: just time.sleep() until the appointed time, and repeat the sleep if we wake up too early or just go on if we wake up too late. I think guarantees are generally much looser in this method since it's for primarily for local testing

You know, an alternate implementation could begin the Flow with a Scheduled state, and our hooks would prevent it from running prior to the set start_time. That might simplify things quite a bit

Good feedback --> I updated the implementation; should be much simpler to reason about now.

jlowin · 2019-02-24T19:04:51Z

src/prefect/core/flow.py

+
+        kwargs["return_tasks"] = self.tasks
+
+        while True:  # run indefinitely


It feels like the break condition for this while loop is the same as that of the while loop on 876. I think you might be able to remove one loop entirely by creating the flow_state before entering the while loop, and then just saying while not flow_state.is_finished().

That plus the adjustment in my previous comment would go from (3 while loops that depend on a break, and one that has a terminal condition) to just (a single while loop with a terminal condition).

No, the break for the loop on 876 is that a single flow run has complete; the break condition for this while loop is that there are no more flow runs left to schedule.

Gotcha -- I didn't realize. Would you mind just expanding the comment a little bit? I thought "run indefinitely" implicitly mean "...until the break below stops it".

Yup, will do

Updated implementation, with a few comments. Should be much simpler now.

jlowin · 2019-02-24T19:06:11Z

src/prefect/schedules.py

+            raise TypeError("`start_date` must be a datetime.")
+        super().__init__(start_date=start_date, end_date=start_date)
+
+    def next(self, n: int, after: datetime = None) -> List[datetime]:


I see the use case for a one-time schedule! However I think you could get to the same place with less code by subclassing IntervalSchedule and setting end_date = start_date in your init, with any interval (say, one day). You wouldn't need a next method since the IntervalSchedule logic would do the right thing.

Ah interesting, I'll give that a shot!

jlowin · 2019-02-24T19:07:01Z

tests/test_schedules.py

+        s = schedules.OneTimeSchedule(start_date=start_date)
+        assert s.next(0) == []
+
+    def test_onetime_schedule_n_equals_0(self):


Small thing -- this test has the same name as the previous test, so it shadows it.

jlowin · 2019-02-24T21:52:17Z

Cool! LGTM -- I don't have a strong opinion on the while loop it just "feels" complicated and potentially an issue (also for some reason the potential - though impossible reality - of moving closer to the target by 50% and never reaching it is scary)

jlowin

This is great! Thanks for simplifying -- I think it will save future headache without compromising functionality.

Collect task run inputs to subflows

cicdw added 4 commits February 23, 2019 16:29

Implement retries in local execution

e112b08

Implementation of a OneTimeSchedule

ccffcde

Add serializer for OneTimeSchedule

026a3fe

Update Changelog

ef850ae

cicdw requested review from jlowin and joshmeek as code owners February 24, 2019 01:03

cicdw mentioned this pull request Feb 24, 2019

Improve ShellTask #681

Merged

3 tasks

jlowin requested changes Feb 24, 2019

View reviewed changes

Simplify implementation of OneTimeSchedule per feedback

964150b

Simplify implementation of local scheduled execution

249cef4

jlowin approved these changes Feb 24, 2019

View reviewed changes

cicdw merged commit b68c539 into master Feb 24, 2019

cicdw deleted the local-retries branch February 24, 2019 22:13

cicdw added a commit that referenced this pull request Dec 9, 2021

Merge pull request #680 from PrefectHQ/subflow-task-inputs

3f6ce15

Collect task run inputs to subflows

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Local Retries + OneTimeSchedule #680

Local Retries + OneTimeSchedule #680

cicdw commented Feb 24, 2019

jlowin left a comment

jlowin Feb 24, 2019 •

edited

Loading

cicdw Feb 24, 2019

cicdw Feb 24, 2019

jlowin Feb 24, 2019

cicdw Feb 24, 2019 •

edited

Loading

cicdw Feb 24, 2019

jlowin Feb 24, 2019

cicdw Feb 24, 2019

jlowin Feb 24, 2019

cicdw Feb 24, 2019

cicdw Feb 24, 2019

jlowin Feb 24, 2019

cicdw Feb 24, 2019

cicdw Feb 24, 2019

jlowin Feb 24, 2019

cicdw Feb 24, 2019

cicdw Feb 24, 2019

jlowin commented Feb 24, 2019

jlowin left a comment


		kwargs["return_tasks"] = self.tasks

		while True: # run indefinitely

Local Retries + OneTimeSchedule #680

Local Retries + OneTimeSchedule #680

Conversation

cicdw commented Feb 24, 2019

What does this PR change?

Why is this PR important?

jlowin left a comment

Choose a reason for hiding this comment

jlowin Feb 24, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cicdw Feb 24, 2019 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jlowin commented Feb 24, 2019

jlowin left a comment

Choose a reason for hiding this comment

jlowin Feb 24, 2019 •

edited

Loading

cicdw Feb 24, 2019 •

edited

Loading