Should model retry behavior be configurable? #195
FredKSchott
started this conversation in
Feature Request
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Flue currently retries clearly transient model-generation failures automatically using internal defaults: retries are enabled, with up to 3 retries and bounded jittered exponential backoff. Provider SDK retry settings remain separate.\n\nShould Flue expose configuration for this behavior? A possible shape could allow applications to disable Flue retries or tune retry count and delay, but we would like to understand real use cases before committing to a public API.\n\nFeedback that would help:\n\n- Do the current defaults cause a problem for your workload?\n- Do you need to disable retries, tune retry count, tune delays, or customize which errors retry?\n- Should configuration apply per agent profile, per operation, or globally?\n- How do you expect Flue retry behavior to interact with provider SDK retry configuration?\n\nIf you have a concrete example, please share the provider, failure mode, latency or cost constraint, and desired behavior.
Beta Was this translation helpful? Give feedback.
All reactions