Add webhook to rerun failed (or terminated) orchestrations #1243

thdotnet · 2020-02-28T15:25:55Z

At this moment, there are a few operations / endpoints when using Durable Functions (IDurableOrchestrationContext):

statusQueryGetUri
sendEventPostUri
terminatePostUri
purgeHistoryDeleteUri

I would like the possibility to re-execute a workflow, and the reason is as I'm not the caller of the Starter function so I cannot provide the same input parameters. I think this feature could be useful specially when due some activity exceeds the retry count, but after some time, there are no problems anymore and I need to reprocess.

ConnorMcMahon · 2020-02-28T23:34:19Z

It sounds like you want a retryOrchestration endpoint. I could potentially see the use in that.

@cgillum, thoughts?

EDIT: removed my workaround, as Chris had a far better idea for how to get this behavior today.

cgillum · 2020-02-29T00:20:28Z

It's an interesting feature idea for sure. The closest thing we have today is the Rewind API, which is designed to re-run only the most recent logic after a failure occurs. However, this is still in preview because there are a lot of edge cases where it doesn't work.

A restart API is interesting because it's conceptually very simple and would probably be easy to implement. Basically, we just need to query the input from the existing orchestration and then create a start message to restart it.

But given this, could you implement it yourself as well? For example:

[FunctionName("RestartOrchestration")]
public static async Task<HttpResponseMessage> RestartOrchestration(
    [HttpTrigger(AuthorizationLevel.Function, methods: "post", Route = "orchestrations/{instanceId}/restart")] HttpRequestMessage req,
    [DurableClient] IDurableClient client,
    string instanceId)
{
    DurableOrchestrationStatus status = await client.GetStatusAsync(
        instanceId,
        showHistory: false,
        showHistoryOutput: false,
        showInput: true);

    // TODO: Check the runtime status to make sure it's in a restartable state

    await client.StartNewAsync(
        orchestratorFunctionName: status.Name,
        instanceId: status.InstanceId,
        status.Input);

    return client.CreateCheckStatusResponse(req, instanceId);
}

cgillum · 2020-02-29T17:19:08Z

The more I think about this, the more I think we should have this built in. It would be a great tool to help folks recover from problems, including stuck orchestrations, without necessarily needing to wait on support tickets. It would be interesting to see whether or how this might work for sub-orchestrations too.

/cc @anthonychu

thdotnet · 2020-02-29T19:33:42Z

Sorry, I'm late for the discussion... but I can provide more details if needed.

The rewind API is almost the same idea, however I would like the ability to reprocess any execution.

anthonychu · 2020-03-01T00:08:41Z

This is a good idea. What's the advantage of restarting the same orchestration, vs starting a new one with the same input copied from the original orchestration? I'm not opposed to restarting but it does feel weird from an event sourcing perspective to delete history and start over.

thdotnet · 2020-03-01T01:03:18Z

Both would work. In fact, I think it should start a new one with some kind of traceability

raffi1965 · 2021-03-05T13:52:41Z

It's an interesting feature idea for sure. The closest thing we have today is the Rewind API, which is designed to re-run only the most recent logic after a failure occurs. However, this is still in preview because there are a lot of edge cases where it doesn't work.

A restart API is interesting because it's conceptually very simple and would probably be easy to implement. Basically, we just need to query the input from the existing orchestration and then create a start message to restart it.

But given this, could you implement it yourself as well? For example:
[FunctionName("RestartOrchestration")]
public static async Task<HttpResponseMessage> RestartOrchestration(
    [HttpTrigger(AuthorizationLevel.Function, methods: "post", Route = "orchestrations/{instanceId}/restart")] HttpRequestMessage req,
    [DurableClient] IDurableClient client,
    string instanceId)
{
    DurableOrchestrationStatus status = await client.GetStatusAsync(
        instanceId,
        showHistory: false,
        showHistoryOutput: false,
        showInput: true);

    // TODO: Check the runtime status to make sure it's in a restartable state

    await client.StartNewAsync(
        orchestratorFunctionName: status.Name,
        instanceId: status.InstanceId,
        status.Input);

    return client.CreateCheckStatusResponse(req, instanceId);
}

client.GetStatusAsync always returns NULL

ghost added the Needs: Triage 🔍 label Feb 28, 2020

ConnorMcMahon added Enhancement Feature requests. needs-discussion and removed Needs: Triage 🔍 labels Feb 28, 2020

ConnorMcMahon changed the title ~~rerun workflow~~ Add webhook to rerun failed (or terminated) orchestrations Feb 28, 2020

cgillum added this to the Extension vNext milestone Aug 27, 2020

cgillum removed the needs-discussion label Aug 27, 2020

amdeel self-assigned this Aug 27, 2020

ConnorMcMahon modified the milestones: Extension v2.3.1, Extension v.2.4.0 Oct 2, 2020

amdeel mentioned this issue Nov 3, 2020

Added RestartAsync API to rerun existing orchestrator instances #1545

Merged

amdeel closed this as completed in #1545 Nov 25, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add webhook to rerun failed (or terminated) orchestrations #1243

Add webhook to rerun failed (or terminated) orchestrations #1243

thdotnet commented Feb 28, 2020

ConnorMcMahon commented Feb 28, 2020 •

edited

Loading

cgillum commented Feb 29, 2020

cgillum commented Feb 29, 2020

thdotnet commented Feb 29, 2020

anthonychu commented Mar 1, 2020

thdotnet commented Mar 1, 2020

raffi1965 commented Mar 5, 2021

Add webhook to rerun failed (or terminated) orchestrations #1243

Add webhook to rerun failed (or terminated) orchestrations #1243

Comments

thdotnet commented Feb 28, 2020

ConnorMcMahon commented Feb 28, 2020 • edited Loading

cgillum commented Feb 29, 2020

cgillum commented Feb 29, 2020

thdotnet commented Feb 29, 2020

anthonychu commented Mar 1, 2020

thdotnet commented Mar 1, 2020

raffi1965 commented Mar 5, 2021

ConnorMcMahon commented Feb 28, 2020 •

edited

Loading