fix: timeouts with delays #80

TheGreatAlgo · 2025-07-29T14:52:03Z

Summary by CodeRabbit

New Features
- Added configurable HTTP retry with exponential backoff and jitter for save/load operations.
- Introduced options to set max retries, initial delay, and backoff factor.
- Improved timeout error messages; non-timeout HTTP errors no longer trigger retries.
Tests
- Added async tests for retry behavior, backoff timing, timeout exhaustion, and parameter validation.
- Updated gateway tests to disable retries where appropriate.

coderabbitai · 2025-07-29T14:52:10Z

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch timeouts

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (1)

py_hamt/store_httpx.py (1)

309-347: Consider extracting retry logic to reduce code duplication

The retry logic is duplicated between save() and load() methods. This violates the DRY principle and makes maintenance harder.

Consider extracting the retry logic into a generic helper method:

async def _retry_with_backoff(self, operation_name: str, operation_callable):
    """Execute an operation with retry and exponential backoff."""
    retry_count = 0
    
    while retry_count <= self.max_retries:
        try:
            return await operation_callable()
        except (httpx.TimeoutException, httpx.RequestError) as e:
            retry_count += 1
            if retry_count > self.max_retries:
                raise httpx.TimeoutException(
                    f"Failed to {operation_name} after {self.max_retries} retries: {str(e)}",
                    request=e.request if isinstance(e, httpx.RequestError) else None,
                )
            
            # Calculate backoff delay
            delay = self.initial_delay * (self.backoff_factor ** (retry_count - 1))
            # Add jitter to prevent thundering herd
            jitter = delay * 0.1 * (random.random() - 0.5)
            await asyncio.sleep(delay + jitter)
        
        except httpx.HTTPStatusError:
            # Re-raise non-timeout HTTP errors immediately
            raise
    
    raise RuntimeError("Exited the retry loop unexpectedly.")

Then simplify the methods to use this helper.

Also applies to: 350-383

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 964496b and 6cd7673.

📒 Files selected for processing (3)

py_hamt/store_httpx.py (4 hunks)
tests/test_kubo_cas.py (2 hunks)
tests/test_public_gateway.py (3 hunks)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Create project environment, run all checks (3.12)

🔇 Additional comments (6)

py_hamt/store_httpx.py (1)

2-2: LGTM!

The addition of the random module is appropriate for implementing jitter in the retry backoff mechanism.

tests/test_public_gateway.py (1)

107-107: LGTM!

Setting max_retries=0 appropriately disables the retry mechanism for these public gateway tests, ensuring they test direct request handling without retry interference.

Also applies to: 138-138, 201-203

tests/test_kubo_cas.py (4)

171-260: LGTM!

Comprehensive test coverage for retry behavior, including success scenarios and max retry failures for both save and load operations.

262-305: LGTM!

Well-designed test that properly verifies exponential backoff timing with appropriate tolerances for jitter.

307-336: LGTM!

Correctly verifies that HTTP status errors bypass the retry mechanism and are immediately re-raised.

338-349: Tests may need update based on parameter validation

These tests use max_retries=-1 to trigger the RuntimeError. If parameter validation is added as suggested, these tests would fail during initialization with a ValueError instead.

Consider updating these tests to use a different approach to trigger the RuntimeError, or adjust them to test the parameter validation instead.

py_hamt/store_httpx.py

codecov-commenter · 2025-08-21T08:02:13Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 100.00%. Comparing base (45f276a) to head (8cca5fc).

Additional details and impacted files

@@            Coverage Diff            @@
##              main       #80   +/-   ##
=========================================
  Coverage   100.00%   100.00%           
=========================================
  Files            5         5           
  Lines          673       707   +34     
=========================================
+ Hits           673       707   +34

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (2)

py_hamt/store_httpx.py (2)

231-240: Parameter validation implemented — resolves prior review

The validation for max_retries, initial_delay, and backoff_factor is in place and matches the earlier recommendation.

386-389: Jitter now centered around zero — thundering herd mitigation fixed

The jitter formula uses (random.random() - 0.5), correctly centering around zero. This resolves the earlier issue where all jitter was positive.

Also applies to: 422-425

🧹 Nitpick comments (5)

py_hamt/store_httpx.py (3)

115-131: Document the new retry/backoff parameters in the constructor docs

The “Parameters” section lists hasher/client/headers/auth/rpc_base_url/gateway_base_url/chunker but omits the new retry controls. Please add brief bullets for max_retries (int, retries after the first attempt), initial_delay (seconds), and backoff_factor (multiplier). This avoids ambiguity about whether “N retries” includes the initial attempt (it does not: attempts = 1 + max_retries).

     ### Parameters
     - **hasher** (str): multihash name (defaults to *blake3*).
@@
     - **chunker** (str): chunking algorithm specification for Kubo's `add`
       RPC. Accepted formats are `"size-<positive int>"`, `"rabin"`, or
       `"rabin-<min>-<avg>-<max>"`.
+    - **max_retries** (int): number of retry attempts on transient network errors
+      (timeouts/request errors). Total attempts = 1 + `max_retries`. Default: 3.
+    - **initial_delay** (float): base backoff delay in seconds before the first
+      retry. Default: 1.0.
+    - **backoff_factor** (float): multiplicative backoff factor applied per retry
+      (k-th delay = initial_delay * backoff_factor**(k-1)). Must be ≥ 1.0.

354-371: Deduplicate retry/backoff logic between save() and load()

The retry loops in save and load are nearly identical. Consider extracting a small helper (e.g., _request_with_retries) to centralize backoff and error wrapping, which reduces maintenance overhead and the chance of behavioral drift between the two paths.

# Example helper (add inside KuboCAS)
async def _request_with_retries(self, op_name: str, fn: callable):
    retry = 0
    while True:
        try:
            return await fn()
        except (httpx.TimeoutException, httpx.RequestError) as e:
            if retry >= self.max_retries:
                raise httpx.TimeoutException(
                    f"Failed to {op_name} data after {self.max_retries} retries: {e}",
                    request=e.request if isinstance(e, httpx.RequestError) else None,
                )
            delay = self.initial_delay * (self.backoff_factor ** retry)
            jitter = delay * 0.1 * (random.random() - 0.5)
            await asyncio.sleep(max(0.0, delay + jitter))
            retry += 1

# Usage sketch:
async with self._sem:
    client = self._loop_client()
    response = await self._request_with_retries(
        "save", lambda: client.post(self.rpc_url, files=files, timeout=60.0)
    )

Also applies to: 372-393

388-389: Harden sleep against rare negative values (future-proofing)

While current validation guarantees delay > 0 and ensures total (delay + jitter) ≥ 0.95*delay, adding a max(0.0, ...) guard makes this robust against future changes.

-                    await asyncio.sleep(delay + jitter)
+                    await asyncio.sleep(max(0.0, delay + jitter))

Also applies to: 424-425

tests/test_public_gateway.py (1)

178-182: Also disable retries for public-gateway loop to reduce flakiness

In the for-loop instantiations, consider passing max_retries=0 to avoid long backoffs if ipfs.io/dweb.link is slow. You’re already skipping when the gateway is down, so retries mainly add latency.
-        cas = KuboCAS(
+        cas = KuboCAS(
             rpc_base_url="http://127.0.0.1:5001",  # Keep local RPC for saves
             gateway_base_url=gateway_url,  # Use specified gateway for loads
-        )
+            max_retries=0,
+        )

tests/test_kubo_cas.py (1)

231-248: Minor: align failing request method with the patched client method

You reuse a single failing_method for both post and get, but it always constructs a “POST” request. This is harmless, but using method-appropriate dummy requests improves error messages and test readability.

-    async def failing_method(url, **kwargs):
-        dummy_request = httpx.Request(
-            "POST", url
-        )  # Create the dummy request
+    async def failing_post(url, **kwargs):
+        dummy_request = httpx.Request("POST", url)
         raise httpx.TimeoutException(
             "Simulated timeout", request=dummy_request
         )
@@
-                    with patch.object(
-                        httpx.AsyncClient,
-                        "post",
-                        new=AsyncMock(side_effect=failing_method),
-                    ):
-                        with patch.object(
-                            httpx.AsyncClient,
-                            "get",
-                            new=AsyncMock(side_effect=failing_method),
-                        ):
+                    with patch.object(
+                        httpx.AsyncClient,
+                        "post",
+                        new=AsyncMock(side_effect=failing_post),
+                    ):
+                        async def failing_get(url, **kwargs):
+                            dummy_request = httpx.Request("GET", url)
+                            raise httpx.TimeoutException(
+                                "Simulated timeout", request=dummy_request
+                            )
+                        with patch.object(
+                            httpx.AsyncClient, "get", new=AsyncMock(side_effect=failing_get)
+                        ):

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 6cd7673 and 8cca5fc.

📒 Files selected for processing (3)

py_hamt/store_httpx.py (4 hunks)
tests/test_kubo_cas.py (2 hunks)
tests/test_public_gateway.py (3 hunks)

🧰 Additional context used

🧬 Code graph analysis (3)

tests/test_public_gateway.py (2)

py_hamt/store_httpx.py (2)

KuboCAS (76-429)

aclose (283-300)

tests/test_kubocas_auth.py (2)

test_user_supplied_client_auth (36-56)

test_internal_client_headers (27-32)

tests/test_kubo_cas.py (1)

py_hamt/store_httpx.py (7)

KuboCAS (76-429)

save (28-32)

save (49-52)

save (354-393)

load (35-36)

load (54-73)

load (395-429)

py_hamt/store_httpx.py (1)

py_hamt/hamt.py (4)

load (137-138)

load (158-167)

load (275-284)

get (592-599)

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Create project environment, run all checks (3.12)

🔇 Additional comments (6)

py_hamt/store_httpx.py (1)

150-153: Retry/backoff knobs added as keyword-only — API change looks good

Adding max_retries, initial_delay, and backoff_factor at the end of the keyword-only section preserves backward compatibility and provides useful control for flaky networks. No concerns on the signature itself.

tests/test_public_gateway.py (1)

152-156: Setting max_retries=0 in gateway tests improves determinism and speed

Good call disabling retries where we’re asserting URL construction and local gateway behavior. This keeps CI fast and avoids backoff noise.

Also applies to: 218-223, 282-287, 295-298
tests/test_kubo_cas.py (4)
262-305: Backoff timing assertions are well-bounded given jitter

Nice bounds: they tolerate ±10% around the base delays, which safely encloses the ±5% jitter window.

320-336: No-retry on HTTP 5xx is verified correctly

The test ensures httpx.HTTPStatusError is surfaced immediately and that asyncio.sleep is not called. This matches the production behavior.

339-374: Constructor validation coverage is comprehensive

Edge cases and negative paths for max_retries, initial_delay, and backoff_factor are well covered; also validates the equality edges (0 retries, factor 1.0).

181-189: Fix construction of dummy httpx objects in mocks (invalid kwargs used)

Two issues will break these tests:

httpx.Request does not accept files=...

httpx.Response does not accept json=... (provide JSON via content and content-type)

Patch the mock to build valid httpx objects.
@@
-    async def mock_post(url, **kwargs):
+    async def mock_post(url, **kwargs):
         nonlocal timeout_count
-        # Manually create a dummy request object
-        dummy_request = httpx.Request("POST", url, files=kwargs.get("files"))
+        # Manually create a dummy request object
+        dummy_request = httpx.Request("POST", url)
         if timeout_count < successful_after:
             timeout_count += 1
             raise httpx.TimeoutException("Simulated timeout", request=dummy_request)
-        return httpx.Response(200, json={"Hash": test_cid}, request=dummy_request)
+        import json
+        payload = json.dumps({"Hash": test_cid}).encode("utf-8")
+        return httpx.Response(
+            200,
+            content=payload,
+            headers={"content-type": "application/json"},
+            request=dummy_request,
+        )
Additionally, ensure json is imported once at the top of the test module for clarity.
@@
-import asyncio
+import asyncio
+import json
Likely an incorrect or invalid review comment.

0xSwego

LGTM!

Faolain · 2025-08-27T05:55:48Z

lgtm as well

fix: timeouts with delays

6cd7673

coderabbitai bot reviewed Jul 29, 2025

View reviewed changes

py_hamt/store_httpx.py Show resolved Hide resolved

py_hamt/store_httpx.py Outdated Show resolved Hide resolved

py_hamt/store_httpx.py Outdated Show resolved Hide resolved

TheGreatAlgo added 8 commits August 21, 2025 03:33

Merge branch 'main' into timeouts

cf83466

fix: update indent

7dee663

fix: update test

ca3dbb9

fix: add max_retries

e63fb75

fix: jitter and coverage

3a95db7

fix: precommit

a8e1e83

fix: remove duplicate

0043201

fix: error back in

8cca5fc

coderabbitai bot reviewed Aug 21, 2025

View reviewed changes

0xSwego approved these changes Aug 25, 2025

View reviewed changes

Faolain merged commit de33efe into main Aug 27, 2025
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: timeouts with delays #80

fix: timeouts with delays #80

Uh oh!

TheGreatAlgo commented Jul 29, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jul 29, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Aug 21, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

0xSwego left a comment

Uh oh!

Faolain commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

fix: timeouts with delays #80

fix: timeouts with delays #80

Uh oh!

Conversation

TheGreatAlgo commented Jul 29, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jul 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Aug 21, 2025

Codecov Report

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

0xSwego left a comment

Choose a reason for hiding this comment

Uh oh!

Faolain commented Aug 27, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

TheGreatAlgo commented Jul 29, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jul 29, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)