Fail durable tasks immediately for non-retryable errors #66

Aaron1011 · 2026-01-29T19:19:48Z

Currently, we classify only a few error types (including errors from user steps) as retryable. Everything else is non-retryable, and causes the task to fail immediately, without any retries

Note

Medium Risk
Changes task failure/retry semantics across both Rust worker logic and the durable.fail_run Postgres function, which can alter how many attempts are created and when tasks become terminal. Moderate risk due to workflow correctness implications if error classification is wrong or migration rollout is incomplete.

Overview
Non-retryable errors now fail tasks immediately instead of scheduling retries. The durable.fail_run stored procedure gains a p_force_fail flag; when set, it skips retry-time computation and run creation and marks the task terminal.

Error classification and propagation were tightened in Rust. TaskError replaces the generic internal-error variant with Step and TaskPanicked, adds retryable() to drive retry decisions, and removes the blanket From<sqlx::Error> impl in favor of from_sqlx_error. TaskContext now wraps user step failures as TaskError::Step, and the worker passes force_fail = !error.retryable() when calling durable.fail_run.

Tests were updated/added to reflect the new semantics. Existing retry/checkpoint tests were adjusted for an extra checkpointed maybe_fail step, and a new retry test asserts User errors are not retried even when a retry strategy is configured.

^{Written by Cursor Bugbot for commit 7e69d63. This will update automatically on new commits. Configure here.}

Currently, we classify only a few error types (including errors from user steps) as retryable. Everything else is non-retryable, and causes the task to fail immediately, without any retries

Aaron1011 added 2 commits January 29, 2026 14:09

Fail durable tasks immediately for non-retryable errors

6f21e76

Currently, we classify only a few error types (including errors from user steps) as retryable. Everything else is non-retryable, and causes the task to fail immediately, without any retries

Run fmt

7e69d63

virajmehta approved these changes Jan 29, 2026

View reviewed changes

virajmehta added this pull request to the merge queue Jan 29, 2026

Merged via the queue into main with commit 8248424 Jan 29, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail durable tasks immediately for non-retryable errors #66

Fail durable tasks immediately for non-retryable errors #66

Uh oh!

Aaron1011 commented Jan 29, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fail durable tasks immediately for non-retryable errors #66

Fail durable tasks immediately for non-retryable errors #66

Uh oh!

Conversation

Aaron1011 commented Jan 29, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Aaron1011 commented Jan 29, 2026 •

edited by cursor bot

Loading