Feature request: retry Resource Options #7932

gordonbondon · 2021-09-09T11:47:19Z

Similar to customTimeouts add customRetries options to resource.

In ideal world resource providers would handle retrying common errors (similar to what terraform does), but this probably won't happen for autogenerated providers.

So this is a feature request to allow providing custom retry logic to resources. Basic usage can look like this:

const key = new azure.dbforpostgresql.ServerKey("key", {/*...*/},
  {
    customRetries: {
      create: {
        maxAttempts: 5,
        delay: 10,
        retriableErrors: [
          "AzureKeyVaultMissingPermissions",
        ],
      }
    },
  },
);

Upon errors on creation/update/etc pulumi would compare error code with provided config, and if there is a match - retry with given settings. If no retriableErrors is provided - just retry on any error.

Workarounds

In some cases it is possible to add custom retry checker in output.apply that will wait for the condition to be resolved (see workaround here pulumi/pulumi-aws#673 (comment)). But it's not always possible, so a more general solution with retriable errors would be nice.

Related issues

#3715 requested the same, but was closed by author.

pulumi/pulumi-azure#1084, pulumi/pulumi-aws#673, pulumi/pulumi-azure-native#903, https://stackoverflow.com/questions/69085796/how-to-wait-for-group-permission-to-have-been-applied - issues where resource creation fails because of eventual consistency in infrastructure, where retrying on some specific error would help.

The text was updated successfully, but these errors were encountered:

lkt82 · 2021-12-17T12:15:59Z

This is a problem I have with certain Azure resources as well. Where the only workaround at the moment is to retry the pulumi up operation

richard-fairthorne · 2022-12-25T14:26:38Z

My provider has eventual consistency in their API. Sometimes a resource I created takes a few seconds to be available. Retrying "pulumi up" results in a second 40 minute provisioning wait. Retying at the provider level would reduce that to an additional 5 seconds or so.

The most basic version of this could implement "attempts" and "delay" and probably satisfy the most urgent use cases.

jameswoodley · 2023-01-10T20:43:19Z

I could definitely do with this. Creating an app service with a custom domain on Azure requires a TXT record to be created, I guess sometimes DNS takes a while to work, so having the retry would be super useful

RobbieMcKinstry · 2023-01-27T17:53:44Z

This reminds me of the AWS Lambda bug, where querying to see if the Lambda caches "no" for upwards of 5+ seconds, but waiting a short amount of time before querying returns "yes" much sooner.

andrewdibiasio6 · 2023-05-11T15:48:15Z

our app is very large. This would save us ~30-40 mins.

alextricity25 · 2023-06-15T18:39:22Z

+1 I would LOVE to see something like this :)

JiriKovar · 2023-11-06T10:17:28Z

We are facing this very same issue with a rather large states in the terraform and I did hope that this is one of the things Pulumi would help us with - being forced to run the whole terraform apply / pulumi up because of transient network error is quite frustrating. I do get that it would probably need to be implemented in the providers and therefor its not an easy thing to achieve, but it would be an awesome competitive advantage.

hellt · 2024-03-04T10:55:02Z

so I understand it that a regular error handling inside the pulumi program won't cut it? The pulumi program will exit anyhow, even if we have smth like try/except block?

criskurtin · 2024-05-16T12:06:55Z

I have a case where this would help right now. I'm creating an Aurora serverless v2 instance, and while the resource is created successfully, it sometimes takes a bit for the hostname to be advertised on DNS. Because of this, next step (which is creating a database) usually fails and I have to re-run the job for it to finish up creating the rest of resources. Having a retry mechanism on the database resource would resolve this issue and minimize the need of manual intervention.

gordonbondon added the kind/enhancement Improvements or new features label Sep 9, 2021

gordonbondon mentioned this issue Sep 9, 2021

Feature Request: add a retry Resource Option #3715

Closed

emiliza added kind/enhancement Improvements or new features and removed kind/enhancement Improvements or new features labels Sep 10, 2021

lukehoban mentioned this issue May 18, 2023

Providers: Support basic retry functionality for failed provider requests #11654

Closed

rquitales mentioned this issue Oct 18, 2023

Bucket mixin callback functions should have a retry pulumi/pulumi-gcp#1277

Open

justinvp added the area/resource-options label Nov 2, 2023

RaicuRobert mentioned this issue Jan 10, 2024

Waiting for custom condition to be met pulumi/pulumi-azure#1595

Open

Frassle mentioned this issue May 8, 2024

Retry provider operations #16141

Closed

danielrbradley mentioned this issue May 8, 2024

Race Condition Issue with User Assigned Managed Identity's PrincipalId and SqlResourceSqlRoleAssignment pulumi/pulumi-azure-native#2816

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request: retry Resource Options #7932

Feature request: retry Resource Options #7932

gordonbondon commented Sep 9, 2021 •

edited

lkt82 commented Dec 17, 2021

richard-fairthorne commented Dec 25, 2022 •

edited

jameswoodley commented Jan 10, 2023

RobbieMcKinstry commented Jan 27, 2023

andrewdibiasio6 commented May 11, 2023 •

edited

alextricity25 commented Jun 15, 2023

JiriKovar commented Nov 6, 2023 •

edited

hellt commented Mar 4, 2024

criskurtin commented May 16, 2024

Feature request: retry Resource Options #7932

Feature request: retry Resource Options #7932

Comments

gordonbondon commented Sep 9, 2021 • edited

Workarounds

Related issues

lkt82 commented Dec 17, 2021

richard-fairthorne commented Dec 25, 2022 • edited

jameswoodley commented Jan 10, 2023

RobbieMcKinstry commented Jan 27, 2023

andrewdibiasio6 commented May 11, 2023 • edited

alextricity25 commented Jun 15, 2023

JiriKovar commented Nov 6, 2023 • edited

hellt commented Mar 4, 2024

criskurtin commented May 16, 2024

gordonbondon commented Sep 9, 2021 •

edited

richard-fairthorne commented Dec 25, 2022 •

edited

andrewdibiasio6 commented May 11, 2023 •

edited

JiriKovar commented Nov 6, 2023 •

edited