Enhance Transient Error handling in Async.start() method #1

markshaule-wf · 2015-01-07T18:54:02Z

@beaulyddon-wf @robertkluin-wf @johnlockwood-wf @tannermiller-wf @rosshendrickson-wf @jasonaguilon-wf @tylertreat-wf

Async Changes:
Async.start() now sleeps before attempting to re-add task on a TransientError.
Add option retry_transient_errors to override the retry behaviour in Async.start(). False can be specified to just re-raise the TransientError, and not attempt a retry.

Context Changes:
Context _insert_tasks now re-raises TransientError if retry option has been set to False.
Renamed parameter 'retry_errors' to 'retry_transient_errors' in _insert_tasks function.
_insert_tasks - retry_transient_errors parameter is now passed onto recursive calls correctly.

Notes on compatibility:
Since I've renamed the parameter for _insert_tasks, that could potentially break someone's custom implementation of that function.

…o retry_transient_errors False. Renamed parameter in _insert_tasks 'retry_errors' to 'retry_transient_errors' for clarity. Pass retry_transient_errors parameter onto recursive calls in _insert_tasks when we are splitting and retrying errors.

… to re-add a task in Async.start() method. Add option to override retry of TransientErrors in the Async.start() method

beaulyddon-wf · 2015-01-07T18:59:24Z

+1

macleodbroad-wf · 2015-01-07T19:16:06Z

furious/async.py

+            if not retry_transient:
+                raise
+
+            import time


Personally I hate inline imports

Yes - me too, but that is the pattern in this file unfortunately.

I'm not sure why with a stdlib one. I would have no issue with this moving to the top of the file. And I agree in general with inline imports. I understand in libs like this were often things in the same file will not execute in the same context. But meh.

OK - I'll move the time inline imports from this file and put them up top. We've still got some other stdlib imports in this file, but I'll restrict my changes to the imports I've added.

macleodbroad-wf · 2015-01-07T19:17:02Z

+1

tannermiller-wf · 2015-01-07T20:22:06Z

+1

tylertreat-wf · 2015-01-07T20:31:51Z

+1

johnlockwood-wf · 2015-01-08T00:18:19Z

Adding a 4 second sleep to something called Async seems wrong.

johnlockwood-wf · 2015-01-08T00:38:57Z

What would happen with the existing method when a Transient Error popped?

markshaule-wf · 2015-01-08T14:34:26Z

@johnlockwood-wf - For the existing method, Async.start() would immediately attempt a retry on a TransientError. (Which from my understanding is not what we want - in the past I believe it is recommended to me to have a delay before the retry).

For the context _insert_tasks() method raising a TransientError, I suspect that one would be able to detect the failure of the insert using the insert_failed @property. I also suspect that most users don't actually check that, and the insert would just silently fail.

markshaule-wf · 2015-01-08T14:36:40Z

Since we've had a few suggestions from @macleodbroad-wf , @tylertreat-wf and @johnlockwood-wf about the delay - let me add a new option retry_delay that will allow the caller to override the default 4 second delay on the retry.

tylertreat-wf · 2015-01-08T15:02:59Z

+1

johnlockwood-wf · 2015-01-08T17:22:39Z

@markshaule-wf do you have any empirical evidence that 4 seconds is a good magical number?

johnlockwood-wf · 2015-01-08T17:29:04Z

I see recommendations to use a backoff, where it retries after a very short time, then if that fails, retry after a little longer, etc.

markshaule-wf · 2015-01-08T18:08:32Z

No empirical evidence that 4 seconds is a good magical number. At the time in our discussion, 'around 5 seconds' was recommended by @beaulyddon-wf, which was the Google recommendation. We've got the ability to override the value now, so if callers want to use a different value, they can.

We only retry once, if a TransientError (or other error) occurs on the retry, an exception is thrown.

beaulyddon-wf · 2015-01-08T18:13:26Z

From Google: "Ideally you would not immediately retry as it can cause more issues" is the general statement. However that's mostly to keep people from killing them. They recognize that's not ideal for critical pass operations. So for critical they say a single immediate retry is ok. However after that they recommend a back off of at least 5 seconds as these types of failures/errors/contention tend to clean themselves up after a couple of seconds.

beaulyddon-wf · 2015-01-08T18:15:09Z

+1

tylertreat-wf · 2015-01-08T19:50:40Z

+1

tannermiller-wf · 2015-01-08T20:00:05Z

+1

macleodbroad-wf · 2015-01-08T20:00:56Z

+1

johnlockwood-wf · 2015-01-08T22:38:56Z

+1

…ansactional=True has been specified in context, or Async.start()

markshaule-wf · 2015-01-13T20:08:34Z

@beaulyddon-wf @tannermiller-wf @macleodbroad-wf @tylertreat-wf - The latest commit will always re-raise if transactional=True on a TransientError, regardless of the new option.

beaulyddon-wf · 2015-01-13T20:20:47Z

+1

tannermiller-wf · 2015-01-13T20:26:48Z

+1

tylertreat-wf · 2015-01-13T21:45:37Z

+1

macleodbroad-wf · 2015-01-14T15:47:29Z

+1

rosshendrickson-wf · 2015-01-14T16:46:08Z

+1

…ent_error_retry_on_start

markshaule-wf added 2 commits January 7, 2015 13:10

After a TransientError is raised, add a timed delay before attempting…

32a7100

… to re-add a task in Async.start() method. Add option to override retry of TransientErrors in the Async.start() method

macleodbroad-wf reviewed Jan 7, 2015
View reviewed changes

Add retry_delay option for context retries, and Async.start() retries.

647c4d3

transient error restart-remove 'time' inline imports.

684c0a2

transient_error_retry_on_start - Always re-raise TransientError if tr…

347735b

…ansactional=True has been specified in context, or Async.start()

Merge branch 'master' of github.com:markshaule-wf/furious into transi…

8ef8662

…ent_error_retry_on_start

markshaule-wf mentioned this pull request Jan 19, 2015

Transient error retry on start Workiva/furious#160

Merged

markshaule-wf merged commit 8ef8662 into master Jan 22, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enhance Transient Error handling in Async.start() method #1

Enhance Transient Error handling in Async.start() method #1

markshaule-wf commented Jan 7, 2015

beaulyddon-wf commented Jan 7, 2015

macleodbroad-wf Jan 7, 2015

markshaule-wf Jan 7, 2015

beaulyddon-wf Jan 8, 2015

markshaule-wf Jan 8, 2015

macleodbroad-wf commented Jan 7, 2015

tannermiller-wf commented Jan 7, 2015

tylertreat-wf commented Jan 7, 2015

johnlockwood-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

tylertreat-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

beaulyddon-wf commented Jan 8, 2015

beaulyddon-wf commented Jan 8, 2015

tylertreat-wf commented Jan 8, 2015

tannermiller-wf commented Jan 8, 2015

macleodbroad-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 13, 2015

beaulyddon-wf commented Jan 13, 2015

tannermiller-wf commented Jan 13, 2015

tylertreat-wf commented Jan 13, 2015

macleodbroad-wf commented Jan 14, 2015

rosshendrickson-wf commented Jan 14, 2015

Enhance Transient Error handling in Async.start() method #1

Enhance Transient Error handling in Async.start() method #1

Conversation

markshaule-wf commented Jan 7, 2015

beaulyddon-wf commented Jan 7, 2015

macleodbroad-wf Jan 7, 2015

Choose a reason for hiding this comment

markshaule-wf Jan 7, 2015

Choose a reason for hiding this comment

beaulyddon-wf Jan 8, 2015

Choose a reason for hiding this comment

markshaule-wf Jan 8, 2015

Choose a reason for hiding this comment

macleodbroad-wf commented Jan 7, 2015

tannermiller-wf commented Jan 7, 2015

tylertreat-wf commented Jan 7, 2015

johnlockwood-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

tylertreat-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 8, 2015

beaulyddon-wf commented Jan 8, 2015

beaulyddon-wf commented Jan 8, 2015

tylertreat-wf commented Jan 8, 2015

tannermiller-wf commented Jan 8, 2015

macleodbroad-wf commented Jan 8, 2015

johnlockwood-wf commented Jan 8, 2015

markshaule-wf commented Jan 13, 2015

beaulyddon-wf commented Jan 13, 2015

tannermiller-wf commented Jan 13, 2015

tylertreat-wf commented Jan 13, 2015

macleodbroad-wf commented Jan 14, 2015

rosshendrickson-wf commented Jan 14, 2015