Dynamic delay #111

Tembrel · 2017-09-30T21:37:27Z

This PR is in response to #110. It adds a delay function property to RetryPolicy that is used, if set, to compute the next delay from the previous result or exception.

There are some awkward bits here:

I used net.jodah.failsafe.util.Duration in signature of the delay function, though
I really wanted to use java.time.Duration.
Combining delay factor other than 1 with delay function would be meaningless, but
I've done nothing to make them mutually exclusive.
The one included test is pretty crude, passing if the actual delay is within a window of the
requested delay.

Wrap checked exceptions as unchecked in applying delay function.

coveralls · 2017-09-30T21:39:11Z

Coverage decreased (-0.2%) to 84.035% when pulling 4f3b60d on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

coveralls · 2017-09-30T22:20:28Z

Coverage increased (+0.1%) to 84.298% when pulling 05180e1 on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

whiskeysierra · 2017-10-01T22:11:24Z

src/main/java/net/jodah/failsafe/AbstractExecution.java

+            if (dynamicDelay != null && dynamicDelay.toNanos() >= 0)
+                delayNanos = dynamicDelay.toNanos();
+        } catch (Exception ex) {
+            if (ex instanceof RuntimeException)


Could use a separate catch clause to get rid of the instanceof check:

} catch (RuntimeException e) { throw e; } catch (Exception e) { throw new RuntimeExeption("..", e); }

whiskeysierra · 2017-10-01T22:13:18Z

src/main/java/net/jodah/failsafe/RetryPolicy.java

+   * Returns the function that determines the next delay given
+   * a failed attempt with the given {@link Throwable}.
+   */
+  public CheckedBiFunction<Object, Throwable, Duration> getDelayFunction() {


Have you considered to give this function its own interface? E.g.

@FunctionalInterface public interface DelayFunction { @Nullable Duration calculateDelay(@Nullable Object result, @Nullable Throwable exception); }

Is the idea here that DelayFunction is more friendly for Java 6/7 users who can't use lambdas than writing CheckedBiFunction<Object, Throwable, Duration>?

Yes, that's a good reason, but also: I can't see a compelling need for a delay function to be able to throw a checked exception. I only initially used CheckedBiFunction because there was no BiFunction in net.jodah.failsafe.function. When @whiskeysierra pointed out that rolling a separate @FunctionalInterface would give us an unchecked signature, along with the benefits of a more specific method name and a place to put documentation, I embraced that right away.

Tembrel · 2017-10-01T23:04:00Z

All good comments. Waiting to hear from others whether this PR is really worth it. If so, will integrate these suggestions.

Removed test of throwing checked exception from delay function, since delay functions no longer throw checked exceptions.

coveralls · 2017-10-03T19:03:15Z

Coverage increased (+0.1%) to 84.317% when pulling 5fc8b8a on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

coveralls · 2017-10-03T19:07:08Z

Coverage increased (+0.1%) to 84.317% when pulling 5377904 on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

Tembrel · 2017-10-03T20:24:14Z

@whiskeysierra - all your suggestions incorporated (although the second one obviated the need for the first).

duergner · 2017-10-04T05:37:56Z

Would love to have that PR merged. I see the most obvious use case for doing "correct" retry based on rate limiting response headers.

whiskeysierra · 2017-10-12T05:43:48Z

I'd argue that a proper interface should be the default from a design perspective. It gives you speaking names, for the interface and the method as well as a place where you can put documentation.

…

On Oct 12, 2017 07:38, "Jonathan Halterman" ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In src/main/java/net/jodah/failsafe/RetryPolicy.java <#111 (comment)>: > @@ -227,6 +229,14 @@ public Duration getDelay() { } /** + * Returns the function that determines the next delay given + * a failed attempt with the given ***@***.*** Throwable}. + */ + public CheckedBiFunction<Object, Throwable, Duration> getDelayFunction() { Is the idea here that DelayFunction is more friendly for Java 6/7 users than CheckedBiFunction? — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#111 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAaPnUxhSA6p8tFu-1g30A3BR2byX4f-ks5sraXXgaJpZM4Pps46> .

jhalterman · 2017-10-12T05:49:25Z

This is good stuff. I like the use case and the simple solution.

I used net.jodah.failsafe.util.Duration in signature of the delay function, though
I really wanted to use java.time.Duration

I know :) One of these days we can sever all pre-Java 8 compatibility (at which point we can replace some of the anonymous classes with lambdas), but we have to stay friendly for our pre-Java 8 users.

A few other comments:

Can (/should) we just call it RetryPolicy.withDelay? I'm a fan of short names and the parameters indicate that the user must compute the delay here.
Do we want to be strict and throw IllegalStateException if a delayFunction is already configured and a user calls withDelay or withBackoff, and visa versa? Otherwise we should Javadoc which is used if a delayFunction is configured along with a delay value.
How should we handle a computeDelay result of <= 0?
Since we're handing execution over to users, how should we handle potential exceptions in computeDelay (that happen for whatever reason)?
Is there any use case where a DelayFunction might also need to accept an ExecutionContext (to base a delay on the number of executions so far, for example)?
Right now, Failsafe is used in Java 6/7 applications. The way this works is that 6/7 users simply shouldn't use any of the APIs that would attempt to load a Java 8 class such as the .future stuff, otherwise everything else just works. I am wondering if the inclusion of the @FunctionalInterface annotation would cause any problems for 6/7 users.

whiskeysierra · 2017-10-12T05:55:51Z

It would be awesome if we could allow to use delay function and backoff in conjunction, e.g. by falling back to backoff if no delay was calculated.

…

On Oct 12, 2017 07:49, "Jonathan Halterman" ***@***.***> wrote: I used net.jodah.failsafe.util.Duration in signature of the delay function, though I really wanted to use java.time.Duration I know :) One of these days we can sever all pre-Java 8 compatibility (at which point we can replace some of the anonymous classes with lambdas), but we have to stay friendly for our pre-Java 8 users. A few other comments: - Can (/should) we just call it RetryPolicy.withDelay? I'm a fan of short names and the parameters indicate that the user must compute a delay here. - Do we want to be strict and throw IllegalStateException if a delayFunction is already configured and a user calls withDelay or withBackoff, and visa versa? - Right now, Failsafe is used in Java 6/7 applications. The way this works is that 6/7 users simply shouldn't use any of the APIs that would attempt to load a Java 8 class such as the .future stuff, otherwise everything else just works. I am wondering if the inclusion of the @FunctionalInterface annotation would cause any problems for 6/7 users. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#111 (comment)>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AAaPndDTYPc26MBfKE8sl1wvZyaMMzL6ks5srahogaJpZM4Pps46> .

jhalterman · 2017-10-12T05:58:25Z

src/main/java/net/jodah/failsafe/AbstractExecution.java

+        Duration dynamicDelay = delayFunction.calculateDelay(result, failure);
+        if (dynamicDelay != null && dynamicDelay.toNanos() >= 0)
+            delayNanos = dynamicDelay.toNanos();
+    }
    if (delayNanos == -1)


Do we want to allow the backoff and delayFunction features to be combined? I'm thinking no - we just let the delayFunction do all of the work, in which case I think this could be an else if.

Sounds good to me.

jhalterman · 2017-10-12T06:09:36Z

@whiskeysierra I just left a comment wondering about that. Are you thinking if DelayFunction had a distinguished return value such as <= 0 we'd fallback to the configured delay value (which could include a backoff)?

It might be awkward for a delayFunction's result to be altered by a backoff adjustment - the two could wind up battling each other (backoff adjustment lowers the waitTime, delayFunction might increase it again) which has me wondering - is there a use case for delay + delayFunction?

whiskeysierra · 2017-10-12T13:16:43Z

@jhalterman My primary use case is the Retry-After header. We may get it as part of a response in some case and we may not get it for the rest. That leaves me with the problem how I'm calculating a delay in the second case. If I could easily fallback to exponential backoffs (or fixed delays) as configured and provided by failsafe directly, then I wouldn't have to worry about that case. I wouldn't alter the result, but define a special one (as you suggested, e.g. -1) to signal a fallback.

Tembrel · 2017-10-12T13:39:36Z

@whiskeysierra @jhalterman - If I'm following correctly, it sounds like everyone is in favor of having dynamicDelay < 0 mean "fall through to regular delay/backoff behavior", i.e., as if no delay function had been provided. And it's hard to imagine a use for a calculated delay with the delay factor applied.

whiskeysierra · 2017-10-12T13:45:58Z

And it's hard to imagine a use for a calculated delay with the delay factor applied.

I agree. If the delay function calculates something, I believe it should be taken as is.

Tembrel · 2017-10-12T14:16:24Z

Can (/should) we just call it RetryPolicy.withDelay? I'm a fan of short names and the parameters indicate that the user must compute the delay here.

Funny, I was just talking of Josh Bloch (of Effective Java fame) yesterday, and he reminded me not to
be too clever with overloading. Short names are great, and maybe there's no danger in this particular
overload, but if the longer name would avoid confusion for even one user, it's worth the extra 8 chars.

Do we want to be strict and throw IllegalStateException if a delayFunction is already configured and a user calls withDelay or withBackoff, and visa versa? Otherwise we should Javadoc which is used if a delayFunction is configured along with a delay value.

I like your suggestion of using negative delay function return values as a signal to revert to regular delay/backoff.

How should we handle a computeDelay result of <= 0?

(You mean < 0, right? It's reasonable to return 0 as a delay.)

Answered by previous: Let negative computed delays mean "revert to regular delay/backoff".

To implement this, we would need a separate variable to hold the final delay value to use for this retry, so we don't trash the delayNanos field with earlier computeDelay results; we need that so that if/when we revert to delay/backoff logic we're not interfering with the sequence of backoff values.

Since we're handing execution over to users, how should we handle potential exceptions in computeDelay (that happen for whatever reason)?

Elsewhere I said that I can't find a compelling reason for computeDelay to throw checked exceptions,
so I think we're just talking about unchecked exceptions. Furthermore, since the programmer is free to handle any expected exceptions in the implementation of computeDelay, we're really only talking about unexpected exceptions/errors.

I think the right behavior is to let unexpected exceptions break the whole Failsafe computation by
propagating unhandled out of complete(obj, ex, checkArgs).

For example, if my attempt to read the Retry-After header throws an OutOfMemoryException, I wouldn't want the fact that memory is exhausted to get swept under the rug. It's not part of the computation I'm running under Failsafe, it's part of the surrounding machinery. When something unexpected happens in that machinery, I want to hear about it fast.

Is there any use case where a DelayFunction might also need to accept an ExecutionContext (to base a delay on the number of executions so far, for example)?

Oh, yes, that does make sense. Another argument to computeDelay?

(If so, that's more grist for the custom functional interface mill.)

Right now, Failsafe is used in Java 6/7 applications. The way this works is that 6/7 users simply shouldn't use any of the APIs that would attempt to load a Java 8 class such as the .future stuff, otherwise everything else just works. I am wondering if the inclusion of the @FunctionalInterface annotation would cause any problems for 6/7 users.

I would have thought it wouldn't be a problem, but I notice that @FunctionalInterface has RUNTIME retention.

I'll find out.

whiskeysierra · 2017-10-12T14:20:45Z

I would have thought it wouldn't be a problem, but I notice that @FunctionalInterface has RUNTIME retention.

I believe that the JVM will just silently ignore annotations that are not on the classpath. We should be fine here.

Tembrel · 2017-10-12T14:58:29Z

@whiskeysierra - I think you're right. Here's some supporting evidence: https://stackoverflow.com/a/3567969

jhalterman · 2017-10-12T17:37:52Z

Just realized, DelayFunction's result needs to use a type parameter in order to avoid a cast when using in a lambda:

withDelayFunction((HttpRequest req, Throwable failure) -> ...)

Short names are great, and maybe there's no danger in this particular
overload, but if the longer name would avoid confusion for even one user, it's worth the extra 8 chars.

Yea, this one is borderline, but we already overload other parts of the API with functional things (retryOn, abortOn), so I'd prefer the overload here and just call it withDelay.

And it's hard to imagine a use for a calculated delay with the delay factor applied.

Cool - let's document < 0 as a distinguished return value that falls back to the withDelay configured value, else no delay. Since backoffs may not play nice with a DelayFunction, let's disallow them by throwing IllegalStateException in .withDelay or .withBackoff if the other is already configured. Backoff adjustments should not be made to DelayFunction provided values.

I think the right behavior is to let unexpected exceptions break the whole Failsafe computation by propagating unhandled out of complete(obj, ex, checkArgs).
For example, if my attempt to read the Retry-After header throws an OutOfMemoryException, I wouldn't want the fact that memory is exhausted to get swept under the rug. It's not part of the computation I'm running under Failsafe, it's part of the surrounding machinery. When something unexpected happens in that machinery, I want to hear about it fast.

Good point re: fail fast. We could still fail on errors such as OutOfMemoryError, just ignore exceptions. My argument for ignoring is that I wouldn't want something minor like a retry policy delay computation to ruin a Failsafe execution. Counter-argument would be as you say, what if it's not minor :) Not sure...

Either way, I think we can add a throws Exception to the DelayFunction's method clause just to be more friendly to code that may throw a checked exception.

I think the right behavior is to let unexpected exceptions break the whole Failsafe computation by propagating unhandled

If we do propagate instead of ignore, we'll want to test that the propagation works for async executions as well.

Another argument to computeDelay

Yea - we should include ExecutionContext because the attempt count could for people to compute their own backoff. We do actually have a type for this, ContextualResultListener, but probably makes sense to define a new one for computing a delay.

Tembrel · 2017-10-15T16:48:06Z

Either way, I think we can add a throws Exception to the DelayFunction's method clause just to be more friendly to code that may throw a checked exception.

The whole point of checked exceptions is to make it hard for the programmer to forget about handling a predictable exceptional condition. If we added throws Exception to computeDelay we'd be asking for trouble. Better to force them to think about the checked exceptions thrown by the code they use to implement computeDelay. Anyone who really wants to avoid thinking can catch Exception and wrap as unchecked, but at least then the code will have an obvious bad smell.

Tembrel · 2017-10-29T23:46:19Z

On the subject of overloading withDelay: We have to provide an accessor that returns the currently configured DelayFunction. We can't overload the existing getDelay, so this accessor has to be called getDelayFunction. But if we have withDelay(long, TimeUnit) and withDelay(DelayFunction) that makes for an awkward non-parallel with getDelay and getDelayFunction. (It's already a little strange that withDelay takes a long and a TimeUnit, but getDelay returns a Duration, but that one is understandable.)

Tembrel · 2017-10-30T02:33:10Z

@jhalterman wrote:

Just realized, DelayFunction's result needs to use a type parameter in order to avoid a cast when using in a lambda:
withDelayFunction((HttpRequest req, Throwable failure) -> ...)

It's not so easy to do this, because the type of the delay function field on the RetryPolicy has to be something like DelayFunction<?>. RetryPolicies don't know what contexts the delay function will be called in.

That means that AbstractExecution.complete(Object result, Throwable failure, boolean checkArgs) has no way of knowing whether it can legitimately (i.e., in a type-safe way) call its retry policy's delay function on the result argument.

The best we can do is provide a <T>-parameterized overloading of withDelayFunction (or withDelay -- still waiting for clarity on this) that takes an additional Class<T> result type argument along with a DelayFunction<T> argument and stores it in a field of the RetryPolicy. Then we can check in AbstractExecution.complete whether the provided result is either null or an instance of the provided result type.

Similarly with the failure type: We'd need another overloading to include a restriction on what kinds of failures the delay function is interested in.

I'm going to check in modifications to include these overloadings, for now without changing the method names from withDelayFunction to withDelay. I've also included the logic that makes backoff delays and dynamic delays mutually exclusive, and provided tests for all of this.

coveralls · 2017-10-30T03:21:29Z

Coverage increased (+0.6%) to 84.769% when pulling a4d519d on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

add overload with delay function and failure type.

coveralls · 2017-10-30T16:35:31Z

Coverage increased (+0.6%) to 84.796% when pulling 1833068 on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

Tembrel · 2017-10-30T17:04:52Z

I did the method renaming that @jhalterman asked for, realizing that the get... methods are already not parallell with the with... methods.

I don't think there's a need for variants that just take a result type or a failure type, since the typical use will have to mention the expected types explicitly anyway in the lambda, e.g.,

RetryPolicy retryPolicy = new RetryPolicy()
    .withDelay((SpecificResult result, SpecificException failure, ExecutionContext context) -> ...,
        SpecificResult.class, SpecificException.class)
    ...

So I removed those, leaving just

RetryPolicy withDelay(DelayFunction<?, ? extends Throwable> delayFunction)
// and
<R, F extends Throwable> RetryPolicy withDelay(
    DelayFunction<R, F> delayFunction, Class<R> resultType, Class<F> failureType)

coveralls · 2017-10-30T17:06:27Z

Coverage increased (+0.6%) to 84.769% when pulling 57073c2 on Tembrel:dynamic-delay into 031f362 on jhalterman:master.

jhalterman · 2018-03-30T19:36:04Z

src/main/java/net/jodah/failsafe/RetryPolicy.java

+   *     {@code failureType} is null
+   * @throws IllegalStateException if backoff delays have already been set
+   */
+  public <R, F extends Throwable> RetryPolicy withDelay(DelayFunction<R, F> delayFunction,


Coming back to this PR after too-long (apologies) and I like it overall except for this method signature, which seems a bit too prescriptive for certain cases and awkward for others, ex:

rp.withDelay(delayFn, Object.class, TooManyRequestsException.class);

Foremost, I'm wondering about the use case for this, where we'd care about the result and exception types together. Most likely we only care about one or the other. So with that in mind, along with consistency with the rest of the API, how about:

rp.withDelayOn(delayFunction, TooManyRequestsException.class); // delay on specific failure // or rp.withDelayWhen(delayFunction, 500); // delay on specific result

...where the use of the "On" and "When" wording indicates that the delay is conditional. Of course all of these are just convenience methods since withDelay(DelayFunction) can always be used by itself.

Thoughts?

As part of this, we also may consider whether to expose the delay result and failure type or if we should just add a canApplyDelayFunction to RetryPolicy which, similar to canRetry and canAbort would evaluate the conditionals internally for some result/failure.

Looks like you went ahead and merged. I haven't had a chance to respond to your last comment, though.

No worries. I wasn't sure if you were tied up (which I can sympathize with) so I decided to just merge the PR as is and make the tweaks I had in mind above afterwards. Feel free to share your thoughts on that whenever you can.

All seems reasonable, and I see you've already started the renaming in #128.

+1 for canApplyDelayFn, if it's straightforward to implement.

Cool. #128 was just me testing something against Travis, but the commits for this are already in master. I also added the ability to fallback to static or backoff delays if the DelayFunction returns a negative value. Supporting both seemed more consistent.

Tembrel added 2 commits September 30, 2017 17:13

Dynamic delay computation

b0f41d2

Move dynamic delay test to own package.

4f3b60d

Wrap checked exceptions as unchecked in applying delay function.

Tembrel mentioned this pull request Sep 30, 2017

Support dynamic retry delays #110

Closed

More test coverage

05180e1

whiskeysierra mentioned this pull request Oct 1, 2017

Added failsafe support zalando/riptide#220

Merged

whiskeysierra reviewed Oct 1, 2017

View reviewed changes

Use functional interface DelayFunction instead of CheckedBiFunction.

5fc8b8a

Removed test of throwing checked exception from delay function, since delay functions no longer throw checked exceptions.

PR failsafe-lib#111: remove unused import

5377904

jhalterman reviewed Oct 12, 2017

View reviewed changes

PR failsafe-lib#111: type-specific delay functions

a4d519d

PR failsafe-lib#111: rename withDelayFunction to withDelay,

1833068

add overload with delay function and failure type.

PR failsafe-lib#111: no two-arg variants of withDelay(DelayFunction...)

57073c2

jhalterman reviewed Mar 30, 2018

View reviewed changes

jhalterman merged commit e5d274d into failsafe-lib:master Apr 5, 2018

Tembrel deleted the dynamic-delay branch April 6, 2018 01:45

Dynamic delay #111

Dynamic delay #111

Conversation

Tembrel commented Sep 30, 2017 • edited

coveralls commented Sep 30, 2017

coveralls commented Sep 30, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhalterman Oct 12, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Tembrel commented Oct 1, 2017

coveralls commented Oct 3, 2017

coveralls commented Oct 3, 2017

Tembrel commented Oct 3, 2017

duergner commented Oct 4, 2017

whiskeysierra commented Oct 12, 2017 via email

jhalterman commented Oct 12, 2017 • edited

whiskeysierra commented Oct 12, 2017 via email

jhalterman Oct 12, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhalterman commented Oct 12, 2017 • edited

whiskeysierra commented Oct 12, 2017

Tembrel commented Oct 12, 2017

whiskeysierra commented Oct 12, 2017

Tembrel commented Oct 12, 2017

whiskeysierra commented Oct 12, 2017 • edited

Tembrel commented Oct 12, 2017

jhalterman commented Oct 12, 2017 • edited

Tembrel commented Oct 15, 2017

Tembrel commented Oct 29, 2017

Tembrel commented Oct 30, 2017

coveralls commented Oct 30, 2017

coveralls commented Oct 30, 2017

Tembrel commented Oct 30, 2017

coveralls commented Oct 30, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jhalterman Apr 5, 2018 • edited

Choose a reason for hiding this comment

Tembrel commented Sep 30, 2017 •

edited

jhalterman Oct 12, 2017 •

edited

jhalterman commented Oct 12, 2017 •

edited

jhalterman Oct 12, 2017 •

edited

jhalterman commented Oct 12, 2017 •

edited

whiskeysierra commented Oct 12, 2017 •

edited

jhalterman commented Oct 12, 2017 •

edited

jhalterman Apr 5, 2018 •

edited