feat: Implement asynchronous report processing service #11

haljet-chain · 2025-10-21T19:00:42Z

Overview: This PR introduces a new service for asynchronously processing reports, laying the groundwork for future AI orchestration.

Changes

Created app/services/report_processor.py to handle report generation.
Implemented an async process_report function that simulates background work using asyncio.sleep().
Added logic to update a shared state, marking reports as 'completed' after processing.
This service is designed for integration with the AI orchestration module in upcoming changes.

Summary by CodeRabbit

New Features
- Report processing with real-time status tracking and asynchronous background execution.
Bug Fixes
- Improved concurrency control to prevent duplicate processing and added robust cancellation and error state handling.
Tests
- Comprehensive tests added covering success, concurrent runs, cancellation, failures, status queries, and multi-report scenarios.

coderabbitai · 2025-10-21T19:01:07Z

Walkthrough

Adds a new in-memory report processing service with concurrency control, async processing and status queries, plus a comprehensive test suite covering success, concurrent starts, cancellation, exception handling, and status retrieval.

Changes

Cohort / File(s)	Summary
New Report Processing Service `backend/app/services/report_processor.py`	Adds module-level `report_status` dict and `report_status_lock` asyncio.Lock; implements `async def process_report(report_id: str, token_id: str) -> bool` with lock-guarded state transitions ("processing" → "completed"/"cancelled"/"failed"), logging, simulated 5s work, and `async def get_report_status(report_id: str)` to read status under lock.
Tests for Report Processing `backend/tests/test_report_processor.py`	Adds tests that reset shared state and validate: successful processing, prevention of duplicate concurrent starts, cancellation behavior, exception handling (sleep patched to raise), status retrieval, and concurrent processing of different report IDs.

Sequence Diagram(s)

sequenceDiagram
    participant Caller
    participant process_report as process_report()
    participant report_status as report_status dict
    participant report_status_lock as report_status_lock

    Caller->>process_report: call process_report(report_id, token_id)
    process_report->>report_status_lock: acquire lock
    report_status_lock-->>process_report: locked
    process_report->>report_status: set status="processing", token_id
    process_report->>process_report: log "starting"
    report_status_lock->>process_report: release lock

    alt normal completion
        process_report->>process_report: await 5s (simulated work)
        process_report->>report_status_lock: acquire lock
        report_status_lock-->>process_report: locked
        process_report->>report_status: set status="completed"
        process_report->>process_report: log "completed"
        report_status_lock->>process_report: release lock
        process_report-->>Caller: return True
    else cancellation (CancelledError)
        process_report->>process_report: CancelledError raised
        process_report->>report_status_lock: acquire lock
        report_status_lock-->>process_report: locked
        process_report->>report_status: set status="cancelled"
        report_status_lock->>process_report: release lock
        process_report-->>Caller: raise CancelledError
    else exception
        process_report->>process_report: exception raised
        process_report->>report_status_lock: acquire lock
        report_status_lock-->>process_report: locked
        process_report->>report_status: set status="failed"
        process_report->>process_report: log "failed"
        report_status_lock->>process_report: release lock
        process_report-->>Caller: re-raise exception
    end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰 I hop and nudge the status light,

Processing starts, then day to night.
Five seconds' dance, or canceled spin,
Failed or done — I note it in.
A tiny trail of states I keep, then off I go to burrow deep.

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 20.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The title "feat: Implement asynchronous report processing service" directly and accurately describes the main changes in this pull request. The changeset introduces a new asynchronous report processing module with two key async functions (process_report and get_report_status), along with comprehensive test coverage. The title clearly identifies the primary objective stated in the PR description: adding a new service for asynchronous report processing intended to support future AI orchestration. The title is concise, specific, and avoids vague terminology.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/report-processor-service

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (1)

backend/app/services/report_processor.py (1)
8-17: Consider task lifecycle and observability for production.

For production deployment, consider:

Task management: How will tasks be tracked? Should asyncio.create_task() be used by callers?

Cancellation: What happens if the service shuts down mid-processing?

Monitoring: Add metrics for processing time, success/failure rates, and queue depth

Resource limits: Should there be a maximum number of concurrent reports?

Persistence: On restart, how are in-flight reports recovered?

Example adding observability:
import time
from contextlib import asynccontextmanager

@asynccontextmanager
async def track_processing_time(report_id: str):
    start_time = time.time()
    try:
        yield
    finally:
        duration = time.time() - start_time
        logger.info("Processing duration", extra={
            "report_id": report_id, 
            "duration_seconds": duration
        })
Then use it in process_report:
async with track_processing_time(report_id):
    await asyncio.sleep(5)

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between c5191c0 and 78a4538.

📒 Files selected for processing (1)

backend/app/services/report_processor.py (1 hunks)

backend/app/services/report_processor.py

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (1)

backend/tests/test_report_processor.py (1)

56-73: Refactor monkey-patching to use proper test fixtures.

The global monkey-patching of asyncio.sleep is risky because:

If the test fails before line 73, the original sleep is never restored, potentially breaking subsequent tests
The mock function has unused parameters (lines 63)

Use pytest's monkeypatch fixture or unittest.mock.patch with a context manager for safer cleanup:

+from unittest.mock import patch
+
 @pytest.mark.asyncio
 async def test_process_report_exception_handling():
     report_id = "test_report_4"
     token_id = "test_token_4"
 
-    # Temporarily modify process_report to raise an exception
-    original_sleep = asyncio.sleep
-    async def mock_sleep_raise(*args, **kwargs):
+    async def mock_sleep_raise(_duration):
         raise Exception("Simulated processing error")
-    asyncio.sleep = mock_sleep_raise
 
-    with pytest.raises(Exception, match="Simulated processing error"):
-        await process_report(report_id, token_id)
+    with patch('asyncio.sleep', side_effect=mock_sleep_raise):
+        with pytest.raises(Exception, match="Simulated processing error"):
+            await process_report(report_id, token_id)
 
     async with report_status_lock:
         assert report_status[report_id]["status"] == "failed"
-    
-    asyncio.sleep = original_sleep # Restore original sleep

This ensures automatic cleanup even if the test fails.

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 78a4538 and 5398368.

📒 Files selected for processing (2)

backend/app/services/report_processor.py (1 hunks)
backend/tests/test_report_processor.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

backend/tests/test_report_processor.py (1)

backend/app/services/report_processor.py (2)

process_report (14-50)

get_report_status (52-57)

🪛 Ruff (0.14.1)

backend/tests/test_report_processor.py

16-16: Possible hardcoded password assigned to: "token_id"

(S105)

28-28: Possible hardcoded password assigned to: "token_id"

(S105)

44-44: Possible hardcoded password assigned to: "token_id"

(S105)

59-59: Possible hardcoded password assigned to: "token_id"

(S105)

63-63: Unused function argument: args

(ARG001)

63-63: Unused function argument: kwargs

(ARG001)

64-64: Create your own exception

(TRY002)

64-64: Avoid specifying long messages outside the exception class

(TRY003)

78-78: Possible hardcoded password assigned to: "token_id"

(S105)

92-92: Possible hardcoded password assigned to: "token_id_1"

(S105)

94-94: Possible hardcoded password assigned to: "token_id_2"

(S105)

backend/app/services/report_processor.py

28-28: Avoid specifying long messages outside the exception class

(TRY003)

39-39: Consider moving this statement to an else block

(TRY300)

🔇 Additional comments (9)

backend/tests/test_report_processor.py (6)

5-11: Excellent test isolation pattern.

The autouse fixture ensures clean state before and after each test, which is essential for testing concurrent state management. The proper use of the lock guarantees thread-safe cleanup.

13-23: Well-structured success case test.

The test properly verifies both the return value and the state changes under lock, ensuring complete validation of the happy path.

25-39: Thorough concurrent processing test.

The test correctly simulates concurrent processing attempts and validates both the ValueError for duplicate processing and proper cleanup of the cancelled task.

41-54: Good cancellation path coverage.

The test properly validates that task cancellation updates the status to "cancelled" and propagates the CancelledError as expected.

75-87: Complete coverage of status retrieval.

The test validates both successful retrieval of existing report status and proper handling of non-existent reports with None return.

89-105: Excellent concurrent processing validation.

This test confirms that the locking mechanism correctly allows concurrent processing of different reports while preventing duplicate processing of the same report. The use of asyncio.gather properly validates true concurrency.

backend/app/services/report_processor.py (3)

1-12: Excellent resolution of previous critical issues.

The addition of report_status_lock (line 12) addresses the race condition flagged in previous reviews. The comprehensive comments (lines 6-10) clearly document the single-process limitation, and the proper logging setup (lines 2-4) replaces the previous print() statements.

14-50: Well-implemented async processing with proper concurrency control.

The function correctly addresses all previous review concerns:

Comprehensive locking prevents race conditions (lines 26, 35, 41, 46)

Proper error handling for cancellation (lines 40-44) and failures (lines 45-50)

Clear return value and raised exceptions enable proper caller handling

Defensive isinstance check (line 36) prevents KeyError in edge cases

The implementation is production-ready for single-process deployments.

52-57: Clean and correct status retrieval.

The function properly protects read access with the lock and returns None for non-existent reports, providing a simple and safe interface.

haljet-chain added 2 commits October 21, 2025 13:00

feat: Add report processing service

78a4538

Implement robust concurrency and error handling for report processing

5398368

coderabbitai bot reviewed Oct 21, 2025

View reviewed changes

felixjordandev approved these changes Oct 21, 2025

View reviewed changes

felixjordandev merged commit 318af84 into main Oct 21, 2025
1 check passed

felixjordandev deleted the feat/report-processor-service branch October 21, 2025 20:05

coderabbitai bot mentioned this pull request Oct 25, 2025

Feature: Asynchronous Report Generation for /report/generate Endpoint #15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat: Implement asynchronous report processing service #11

feat: Implement asynchronous report processing service #11

haljet-chain commented Oct 21, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 21, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

feat: Implement asynchronous report processing service #11

feat: Implement asynchronous report processing service #11

Conversation

haljet-chain commented Oct 21, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

haljet-chain commented Oct 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 21, 2025 •

edited

Loading