feat: exponential retry decorator #88

a-klos · 2025-09-02T08:32:55Z

This pull request introduces a robust, configurable retry decorator with exponential backoff and rate-limit handling, and integrates it across the RAG stack for both the embedder and summarizer components. The retry behavior is now centrally managed, with clear support for both global and per-component overrides via environment variables and Helm chart values. The documentation has been updated to explain configuration and usage, and the Helm templates and values have been extended to support the new settings.

Retry decorator integration and configuration:

Added a shared retry decorator (retry_with_backoff) in rag-core-lib, with support for both sync and async callables, rate-limit awareness, and extensive configuration via environment variables or Helm values. Documentation in libs/README.md details usage, configuration, and advanced features.
Updated Helm chart templates and values to define and inject retry-related settings for both backend and admin-backend deployments. This includes new configmaps, environment variable wiring, and appropriate value structure in infrastructure/rag/values.yaml and related templates. [1] [2] [3] [4] [5] [6] [7]

Embedder and summarizer retry logic:

The StackitEmbedder (backend) and LangchainSummarizer (admin-backend) now both use the shared retry decorator, with per-component settings overriding global defaults as needed. This is documented in detail in libs/README.md and supported by new environment variable keys and Helm values. [1] [2]
The dependency injection container for the admin API library (DependencyContainer) now wires the new retry_decorator_settings and passes it to the summarizer implementation, ensuring the retry logic is properly configured at runtime. [1] [2] [3]

Documentation improvements:

Expanded libs/README.md to include new sections describing the retry decorator, its configuration (including environment variables and Helm usage), and how the embedder and summarizer resolve their retry settings. [1] [2] [3] [4]
Minor documentation clarifications and code formatting improvements in libs/README.md. [1] [2] [3] [4] [5]

Settings and type improvements:

Extended SummarizerSettings to support optional retry-related fields, aligning with the new decorator's configuration model.

These changes centralize and standardize retry logic across the stack, making it easier to tune reliability and rate-limiting behavior per environment and per component.

…it handling

… normalize dict items in utils

… handling

…and configuration details

Copilot

Pull Request Overview

This PR introduces a robust retry decorator with exponential backoff and rate-limit handling for both synchronous and asynchronous functions, enabling configurable retry behavior through environment variables and Kubernetes configuration.

Implements a configurable retry decorator with exponential backoff, jitter, and rate-limit awareness
Integrates retry settings into infrastructure configuration via Helm values and ConfigMaps
Adds comprehensive test coverage for retry scenarios including rate-limit handling

Reviewed Changes

Copilot reviewed 11 out of 12 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
retry_decorator_test.py	Comprehensive test suite covering sync/async retry scenarios and rate-limit handling
utils.py	Utility functions for parsing rate-limit headers and extracting exception metadata
retry_decorator.py	Core retry decorator implementation with exponential backoff and rate-limit awareness
retry_decorator_settings.py	Pydantic settings model for configuring retry behavior via environment variables
pyproject.toml	Adds pytest-asyncio dependency for async test support
README.md	Documentation for retry decorator usage and configuration
values.yaml	Default retry configuration values for Helm deployment
configmap.yaml	ConfigMap template for retry decorator environment variables
deployment.yaml files	Integration of retry decorator ConfigMap into backend deployments
_helpers.tpl	Helm template helper for retry decorator ConfigMap naming

_{Tip: Customize your code reviews with copilot-instructions.md. Create the file or learn how to get started.}

libs/rag-core-lib/src/rag_core_lib/impl/settings/retry_decorator_settings.py

libs/rag-core-lib/src/rag_core_lib/impl/utils/retry_decorator.py

…ecorator_settings.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…orator.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

…cloud/rag-template into feat/exponential-retry-decorator

libs/rag-core-lib/src/rag_core_lib/impl/settings/retry_decorator_settings.py

libs/rag-core-lib/src/rag_core_lib/impl/utils/retry_decorator.py

This pull request introduces enhanced configurability and reliability to the summarization workflow by adding granular retry and concurrency settings, and refactoring the summarizer to use them. The changes allow for more robust handling of transient failures and better control over resource usage. **Configuration enhancements:** * Added new retry-related fields (e.g., `max_retries`, `retry_base_delay`, `retry_max_delay`, `backoff_factor`, `attempt_cap`, `jitter_min`, `jitter_max`) to the `SummarizerSettings` class, allowing fine-grained control over retry behavior for summarization tasks. [[1]](diffhunk://#diff-ceade27a403894bb34e6c3c94bca8739203875d1037ce8896348b9eeb377dbcbL15-R31) [[2]](diffhunk://#diff-ceade27a403894bb34e6c3c94bca8739203875d1037ce8896348b9eeb377dbcbL26-R82) * Fixed a typo in the `SummarizerSettings` field name from `maximum_concurrreny` to `maximum_concurrency`. [[1]](diffhunk://#diff-ceade27a403894bb34e6c3c94bca8739203875d1037ce8896348b9eeb377dbcbL15-R31) [[2]](diffhunk://#diff-ceade27a403894bb34e6c3c94bca8739203875d1037ce8896348b9eeb377dbcbL26-R82) **Dependency injection and wiring:** * Registered `RetryDecoratorSettings` in the dependency container and passed both summarizer and global retry settings to the `LangchainSummarizer` instance, enabling summarizer-specific overrides. [[1]](diffhunk://#diff-8b7c1816cb3e0a40b7965721c550eefdc184c5d914ec023e36527255613381e7R67) [[2]](diffhunk://#diff-8b7c1816cb3e0a40b7965721c550eefdc184c5d914ec023e36527255613381e7R90) [[3]](diffhunk://#diff-8b7c1816cb3e0a40b7965721c550eefdc184c5d914ec023e36527255613381e7L139-R143) **Summarizer logic refactoring:** * Refactored the summarization logic in `LangchainSummarizer` to: - Use asynchronous chunk summarization with concurrency control via a semaphore. - Implement retry logic with exponential backoff and jitter for chunk summarization, using the new settings for configuration. - Cleaned up error handling and removed redundant retry code in favor of the new decorator-based approach. [[1]](diffhunk://#diff-9793b1081628436dd7d5a0e37abc9d79ee5e25af3f5e784f99379249809ed8dbR3-R21) [[2]](diffhunk://#diff-9793b1081628436dd7d5a0e37abc9d79ee5e25af3f5e784f99379249809ed8dbR39-R47) [[3]](diffhunk://#diff-9793b1081628436dd7d5a0e37abc9d79ee5e25af3f5e784f99379249809ed8dbL68-R161) These changes collectively improve the reliability, configurability, and maintainability of the summarization pipeline. **fixes partly following issue** #87 --------- Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

This pull request introduces configurable retry behavior for the `StackitEmbedder`, allowing for fine-grained control of retry and backoff parameters via environment variables, Helm chart values, or code. The changes ensure that retry settings can be overridden per embedder instance, falling back to shared defaults when not specified. Documentation and dependency injection are updated to reflect this new flexibility. **Embedder Retry Configuration** * Added new optional retry-related fields (`max_retries`, `retry_base_delay`, `retry_max_delay`, `backoff_factor`, `attempt_cap`, `jitter_min`, `jitter_max`) to the `StackitEmbedderSettings` model, allowing per-embedder overrides of retry/backoff parameters. [[1]](diffhunk://#diff-0e502aa8b53287c8b12f5f4d053e9ae904620403c6c502df29f9673f4ae88d09R21-R34) [[2]](diffhunk://#diff-0e502aa8b53287c8b12f5f4d053e9ae904620403c6c502df29f9673f4ae88d09R46-R86) * Updated the `StackitEmbedder` implementation to use a shared retry decorator with exponential backoff, resolving settings from both `StackitEmbedderSettings` and fallback `RetryDecoratorSettings`. The retry logic now handles OpenAI API errors and rate limits robustly. [[1]](diffhunk://#diff-7ebf8bf6adafb79699aea6bcd32de76398f9734e795f1d3c53cf524f1d69a5a1L4-R38) [[2]](diffhunk://#diff-7ebf8bf6adafb79699aea6bcd32de76398f9734e795f1d3c53cf524f1d69a5a1R63-R73) [[3]](diffhunk://#diff-7ebf8bf6adafb79699aea6bcd32de76398f9734e795f1d3c53cf524f1d69a5a1L72-R138) **Dependency Injection and Configuration** * Modified the dependency container to inject both `StackitEmbedderSettings` and `RetryDecoratorSettings` into the `StackitEmbedder`, supporting the new configuration pattern. [[1]](diffhunk://#diff-483b37f4ebbc24c973c3b170542171d90c65f3c6b68f1a6d598ce8964a94be7bR66) [[2]](diffhunk://#diff-483b37f4ebbc24c973c3b170542171d90c65f3c6b68f1a6d598ce8964a94be7bR93) [[3]](diffhunk://#diff-483b37f4ebbc24c973c3b170542171d90c65f3c6b68f1a6d598ce8964a94be7bL101-R103) * Added corresponding environment variable keys to the Helm chart (`values.yaml`), enabling retry configuration via deployment configuration for both backend and adminBackend services. [[1]](diffhunk://#diff-673dd2d3d4e66a8fd4e45f9c1c9900711313f946bf8b6a89e96c954988fc14f3R195-R202) [[2]](diffhunk://#diff-673dd2d3d4e66a8fd4e45f9c1c9900711313f946bf8b6a89e96c954988fc14f3R325) **Documentation Updates** * Documented the new retry configuration mechanism in `libs/README.md`, explaining how override and fallback resolution works, and how to configure via environment variables and Helm chart values. [[1]](diffhunk://#diff-34194a117b05d75d22ca968cdb7d540839dc7a0eb33960fbca668b5a6ade87cbR11) [[2]](diffhunk://#diff-34194a117b05d75d22ca968cdb7d540839dc7a0eb33960fbca668b5a6ade87cbR103-R128) **tackles following issue:** #87

…ettings initialization

…try_decorator_settings

…r and StackitEmbedder settings

…and StackitEmbedder settings

a-klos added 7 commits September 2, 2025 07:59

feat: implement retry decorator with exponential backoff and rate lim…

1db3fe7

…it handling

refactor: improve retry decorator by separating async and sync logic;…

cbdd287

… normalize dict items in utils

test: add comprehensive tests for retry decorator with async and sync…

815ca87

… handling

feat: add retry decorator configuration and update deployment templates

cdb5445

docs: add documentation for retry decorator with exponential backoff …

72e2480

…and configuration details

feat: add pytest-asyncio dependency for improved async testing support

bde7a8c

Merge branch 'main' into feat/exponential-retry-decorator

b65c696

huhn511 requested a review from Copilot September 23, 2025 15:13

Copilot AI reviewed Sep 23, 2025

View reviewed changes

libs/rag-core-lib/src/rag_core_lib/impl/settings/retry_decorator_settings.py Outdated Show resolved Hide resolved

libs/rag-core-lib/src/rag_core_lib/impl/utils/retry_decorator.py Outdated Show resolved Hide resolved

a-klos requested a review from manu-hoffmann September 24, 2025 09:44

a-klos and others added 4 commits September 24, 2025 12:08

docs: Update libs/rag-core-lib/src/rag_core_lib/impl/settings/retry_d…

657715c

…ecorator_settings.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

chore: Update libs/rag-core-lib/src/rag_core_lib/impl/utils/retry_dec…

f94ab4e

…orator.py Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

chore: merge main

2dd24a0

Merge branch 'feat/exponential-retry-decorator' of github.com:stackit…

ae1f8e1

…cloud/rag-template into feat/exponential-retry-decorator

manu-hoffmann approved these changes Oct 9, 2025

View reviewed changes

libs/rag-core-lib/src/rag_core_lib/impl/settings/retry_decorator_settings.py Outdated Show resolved Hide resolved

libs/rag-core-lib/src/rag_core_lib/impl/utils/retry_decorator.py Show resolved Hide resolved

a-klos and others added 8 commits October 9, 2025 09:45

refactor: remove redundant validation checks in RetryDecoratorSettings

2866593

chore: merge main

f87d9a0

refactor: clean up import statements and streamline retry decorator s…

ba120c1

…ettings initialization

refactor: improve readability of settings initialization in create_re…

63b9eae

…try_decorator_settings

refactor: enhance settings initialization and validation in Summarize…

c317897

…r and StackitEmbedder settings

refactor: add validation for jitter_min and jitter_max in Summarizer …

b4f291b

…and StackitEmbedder settings

a-klos merged commit 62883be into main Oct 9, 2025
12 checks passed

a-klos deleted the feat/exponential-retry-decorator branch October 9, 2025 10:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: exponential retry decorator #88

feat: exponential retry decorator #88

Uh oh!

a-klos commented Sep 2, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat: exponential retry decorator #88

feat: exponential retry decorator #88

Uh oh!

Conversation

a-klos commented Sep 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

a-klos commented Sep 2, 2025 •

edited

Loading