Add more retries to more components #423

skandragon · 2018-11-13T04:00:38Z

While I would like a more general approach, and this is not quite as clean as I'd like, it's a step toward getting things closer to sane.

I don't yet have metrics wired up, but will add this eventually; I may also split out the "call this method until it stops throwing exceptions" type of call now into a more sane context (http, atlas) aware retry system. For now, this will work better than what was there.

kayenta-core/src/main/java/com/netflix/kayenta/util/Retry.java

kayenta-atlas/src/main/java/com/netflix/kayenta/atlas/metrics/AtlasMetricsService.java

kayenta-core/src/test/groovy/com/netflix/kayenta/util/RetrySpec.groovy

kayenta-core/src/main/java/com/netflix/kayenta/util/Retry.java

duftler · 2018-11-13T13:15:22Z

...e-configbin/src/main/java/com/netflix/kayenta/configbin/storage/ConfigBinStorageService.java

@@ -29,7 +29,7 @@
 import com.netflix.kayenta.security.AccountCredentialsRepository;
 import com.netflix.kayenta.storage.ObjectType;
 import com.netflix.kayenta.storage.StorageService;
-import com.netflix.spinnaker.kork.core.RetrySupport;
+import com.netflix.kayenta.util.Retry;


It occurred to me after I finished reviewing the PR that I had seen this logic before. I search kayenta on github and ended up back at this file. Wouldn't it make more sense to just add the new logic to the existing RetrySupport class in kork?

This is a temporary step along the path to make things more context aware, so I'd prefer to keep it inside kayenta for now.

Aloren · 2018-11-21T20:33:56Z

kayenta-core/src/main/java/com/netflix/kayenta/util/Retry.java

+
+import java.util.function.Supplier;
+
+public class Retry {


Please have a look at spring-retry.
It might be a good solution in case there is a need for more sophisticated retry policies: like configuring which types of exceptions should be retried, adding retry metrics or providing different backoff policies.

* feat(retry): add a retry util * bug(retry): retry Atlas actions * bug(retry): improve retry in ConfigBin * bug(retry): retry some s3 calls that seem prone to failure

skandragon added 3 commits November 12, 2018 19:51

feat(retry): add a retry util

e195802

bug(retry): retry Atlas actions

fadfd73

bug(retry): improve retry in ConfigBin

88a06de

skandragon added bug 2018Q4 labels Nov 13, 2018

skandragon self-assigned this Nov 13, 2018

skandragon requested a review from duftler November 13, 2018 04:00

duftler reviewed Nov 13, 2018

View reviewed changes

bug(retry): retry some s3 calls that seem prone to failure

2749c08

skandragon merged commit 4375fe7 into spinnaker:master Nov 14, 2018

skandragon deleted the retries-take-two branch November 14, 2018 22:11

spinnakerbot added the target-release/1.11 label Nov 14, 2018

Aloren reviewed Nov 21, 2018

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more retries to more components #423

Add more retries to more components #423

skandragon commented Nov 13, 2018

duftler Nov 13, 2018

skandragon Nov 14, 2018

Aloren Nov 21, 2018

Add more retries to more components #423

Add more retries to more components #423

Conversation

skandragon commented Nov 13, 2018

duftler Nov 13, 2018

Choose a reason for hiding this comment

skandragon Nov 14, 2018

Choose a reason for hiding this comment

Aloren Nov 21, 2018

Choose a reason for hiding this comment