Feat: Implement concurrent agent execution with asyncio.gather #8

lanceherman · 2025-10-21T11:49:02Z

Overview: This PR introduces concurrent execution for agents to significantly improve report generation performance.

Changes

Added the execute_agents_concurrently method to app/core/orchestrator.py.
Leverages asyncio.gather to run multiple registered agents in parallel.
Aggregates individual agent results into a dictionary, then saves them to the report store via save_report_data().
Includes robust error handling to gracefully manage exceptions from individual agents without halting the entire process.

Summary by CodeRabbit

New Features
- Report generation now kicks off background processing so responses are returned faster.
- Added a status endpoint to check report progress and fetch stored report data.
- Concurrent agent processing added to enhance and parallelize report workflows; demo agents included for demonstration.
Tests
- New tests validate concurrent agent execution, success and failure handling, and report status transitions.

coderabbitai · 2025-10-21T11:49:10Z

Walkthrough

Adds an Orchestrator to register and run agents concurrently, starts background agent execution from report generation endpoint, exposes a report status endpoint, and provides persistence for aggregated agent results; includes tests for success and failure agent runs.

Changes

Cohort / File(s)	Summary
Orchestrator Framework `backend/app/core/orchestrator.py`	New `Orchestrator` class and `orchestrator` instance. Methods: `register_agent`, `_run_agent_safely`, and `execute_agents_concurrently`. Runs registered agents with `asyncio.gather(..., return_exceptions=True)`, aggregates per-agent results, logs errors, and calls `save_report_data` to persist outcomes.
Report Generation Endpoints `backend/app/api/v1/routes.py`	Added `dummy_agent_one`, `dummy_agent_two`. `generate_report_endpoint` now obtains `report_id`, schedules `orchestrator.execute_agents_concurrently` as a background task with a done-callback that logs exceptions, and returns the original report response immediately. Added `get_report_status(report_id)` endpoint to return stored report data or 404 if missing. Agents are registered with the orchestrator.
Report Service `backend/app/services/report_service.py`	Updated import paths to `backend.app.*`. Added module-level logger and new async `save_report_data(report_id: str, data: Dict)` to merge/update `in_memory_reports` entries and log warnings if `report_id` is absent.
Orchestrator Tests `backend/tests/test_orchestrator.py`	New async tests and autouse fixture clearing `in_memory_reports`. `test_execute_agents_concurrently_success` checks both agents run and final status `completed` with per-agent `{"status":"completed","data":...}`. `test_execute_agents_concurrently_with_failure` checks failure path, marking failing agent `{"status":"failed","error":...}` and overall `partial_success`.
Legacy Cleanup `main.py`	Removed FastAPI `app` instance and route handlers (`read_root`, `read_item`) and related imports (e.g., `Union`, `FastAPI`), eliminating the previous HTTP API surface.

Sequence Diagram(s)

sequenceDiagram
    participant Client
    participant API as generate_report_endpoint
    participant Orch as Orchestrator
    participant Agents as Agents
    participant RS as ReportService
    participant Status as get_report_status

    Client->>API: POST /report
    API->>API: generate_report() -> report_id
    API-->>Client: 200 OK (report_id)
    
    Note over API,Orch: Background task: execute_agents_concurrently
    API->>Orch: execute_agents_concurrently(report_id, token_id)
    activate Orch
    par Parallel agent runs
        Orch->>Agents: _run_agent_safely(AgentOne)
        Orch->>Agents: _run_agent_safely(AgentTwo)
    end
    Agents-->>Orch: results / exceptions
    Orch->>RS: save_report_data(report_id, aggregated_results)
    deactivate Orch

    loop Polling
        Client->>Status: GET /report_status/{report_id}
        Status-->>Client: current report data or 404
    end

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Poem

🐰 I hopped through code to start the show,

Agents scurry, fast and slow,
Background trails no longer bind,
Report status keeps us kind,
🥕 Logs and tests—now off they go!

Pre-merge checks and finishing touches

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	You can run `@coderabbitai generate docstrings` to improve docstring coverage.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title Check	✅ Passed	The pull request title "Feat: Implement concurrent agent execution with asyncio.gather" directly reflects the primary objective and main change in the changeset. The summary of changes confirms that the core modification is adding an `execute_agents_concurrently` method to the Orchestrator class that leverages `asyncio.gather` to run multiple agents in parallel, along with supporting infrastructure to integrate this into the routes and report service. The title is specific, avoids vague terminology, and clearly communicates the key feature being introduced—a teammate reviewing the git history would immediately understand that this PR implements concurrent agent execution capabilities.

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch feat/concurrent-agent-execution

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

🧹 Nitpick comments (2)

backend/tests/test_orchestrator.py (1)

13-63: Comprehensive test coverage for success and failure scenarios.

Both tests properly validate:

Agent function calls with correct arguments
Result aggregation structure
Status updates in in_memory_reports
Error handling and propagation

Consider adding these additional test cases for more complete coverage:

@pytest.mark.asyncio
async def test_execute_agents_concurrently_no_agents():
    """Test behavior when no agents are registered."""
    orchestrator = Orchestrator()
    report_id = "test_report_id_empty"
    token_id = "test_token_id"
    
    in_memory_reports[report_id] = {"token_id": token_id, "status": "processing"}
    
    await orchestrator.execute_agents_concurrently(report_id, token_id)
    
    assert in_memory_reports[report_id]["status"] == "completed"
    assert in_memory_reports[report_id]["agent_results"] == {}

@pytest.mark.asyncio
async def test_execute_agents_concurrently_all_fail():
    """Test behavior when all agents fail."""
    orchestrator = Orchestrator()
    mock_agent_one = AsyncMock(side_effect=Exception("Agent 1 failed"))
    mock_agent_two = AsyncMock(side_effect=Exception("Agent 2 failed"))
    
    orchestrator.register_agent("AgentOne", mock_agent_one)
    orchestrator.register_agent("AgentTwo", mock_agent_two)
    
    report_id = "test_report_id_all_fail"
    token_id = "test_token_id"
    
    in_memory_reports[report_id] = {"token_id": token_id, "status": "processing"}
    
    await orchestrator.execute_agents_concurrently(report_id, token_id)
    
    assert in_memory_reports[report_id]["status"] == "completed"
    assert in_memory_reports[report_id]["agent_results"]["AgentOne"]["status"] == "failed"
    assert in_memory_reports[report_id]["agent_results"]["AgentTwo"]["status"] == "failed"

backend/app/core/orchestrator.py (1)

13-18: Consider using dict comprehension to simplify agent task creation.

The parallel agent_names and agent_tasks lists can be streamlined by iterating over items directly during result processing.

     async def execute_agents_concurrently(self, report_id: str, token_id: str):
-        agent_tasks = []
-        agent_names = []
-
-        for name, agent_func in self.registered_agents.items():
-            agent_names.append(name)
-            agent_tasks.append(self._run_agent_safely(name, agent_func, report_id, token_id))
+        agent_tasks = {
+            name: self._run_agent_safely(name, agent_func, report_id, token_id)
+            for name, agent_func in self.registered_agents.items()
+        }
 
-        results = await asyncio.gather(*agent_tasks, return_exceptions=True)
+        results = await asyncio.gather(*agent_tasks.values(), return_exceptions=True)
 
         aggregated_results = {}
-        for i, result in enumerate(results):
-            agent_name = agent_names[i]
+        for agent_name, result in zip(agent_tasks.keys(), results):
             if isinstance(result, Exception):

Alternatively, use asyncio.TaskGroup (Python 3.11+) for more structured concurrency:

async def execute_agents_concurrently(self, report_id: str, token_id: str):
    async with asyncio.TaskGroup() as tg:
        tasks = {
            name: tg.create_task(self._run_agent_safely(name, agent_func, report_id, token_id))
            for name, agent_func in self.registered_agents.items()
        }
    
    aggregated_results = {}
    for name, task in tasks.items():
        try:
            result = task.result()
            aggregated_results[name] = {"status": "completed", "data": result}
        except Exception as e:
            logger.error(f"Agent '{name}' failed with error: {e}")
            aggregated_results[name] = {"status": "failed", "error": str(e)}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between f8038aa and bedfda0.

⛔ Files ignored due to path filters (7)

backend/__pycache__/__init__.cpython-313.pyc is excluded by !**/*.pyc
backend/app/__pycache__/__init__.cpython-313.pyc is excluded by !**/*.pyc
backend/app/core/__pycache__/orchestrator.cpython-313.pyc is excluded by !**/*.pyc
backend/app/models/__pycache__/report_models.cpython-313.pyc is excluded by !**/*.pyc
backend/app/services/__pycache__/report_service.cpython-313.pyc is excluded by !**/*.pyc
backend/app/utils/__pycache__/id_generator.cpython-313.pyc is excluded by !**/*.pyc
backend/tests/__pycache__/test_orchestrator.cpython-313-pytest-8.4.2.pyc is excluded by !**/*.pyc

📒 Files selected for processing (6)

backend/app/api/v1/routes.py (1 hunks)
backend/app/core/orchestrator.py (1 hunks)
backend/app/services/report_service.py (2 hunks)
backend/main.py (1 hunks)
backend/tests/test_orchestrator.py (1 hunks)
main.py (0 hunks)

💤 Files with no reviewable changes (1)

main.py

🧰 Additional context used

🧬 Code graph analysis (4)

backend/app/core/orchestrator.py (1)

backend/app/services/report_service.py (1)

save_report_data (19-24)

backend/tests/test_orchestrator.py (1)

backend/app/core/orchestrator.py (3)

Orchestrator (5-38)

register_agent (9-10)

execute_agents_concurrently (12-31)

backend/app/services/report_service.py (2)

backend/app/models/report_models.py (2)

ReportRequest (4-6)

ReportResponse (8-10)

backend/app/utils/id_generator.py (1)

generate_report_id (3-8)

backend/app/api/v1/routes.py (3)

backend/app/models/report_models.py (2)

ReportRequest (4-6)

ReportResponse (8-10)

backend/app/services/report_service.py (1)

generate_report (8-17)

backend/app/core/orchestrator.py (2)

register_agent (9-10)

execute_agents_concurrently (12-31)

🪛 Ruff (0.14.1)

backend/tests/test_orchestrator.py

23-23: Possible hardcoded password assigned to: "token_id"

(S105)

48-48: Possible hardcoded password assigned to: "token_id"

(S105)

🔇 Additional comments (4)

backend/app/services/report_service.py (1)

1-2: LGTM! Import paths updated correctly.

The import paths have been updated to use the backend.app.* namespace, which aligns with the module reorganization.

backend/app/api/v1/routes.py (1)

9-22: Dummy agents look good for demonstration purposes.

The two dummy agents are properly structured as async functions with appropriate signatures. The simulated async work with asyncio.sleep() effectively demonstrates concurrent execution.

backend/tests/test_orchestrator.py (1)

7-11: LGTM! Fixture properly manages test state.

The autouse fixture ensures in_memory_reports is cleared before and after each test, preventing test pollution.

backend/app/core/orchestrator.py (1)

5-10: LGTM! Clean orchestrator initialization and registration.

The class structure is clear, and the register_agent method properly types the agent function signature.

backend/app/api/v1/routes.py

backend/app/core/orchestrator.py

backend/app/services/report_service.py

backend/main.py

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (3)

backend/app/api/v1/routes.py (2)
13-21: Replace print() with proper logging in dummy agents.

For consistency with the rest of the codebase and better observability, use the logger instead of print() statements.

Apply this diff:
 async def dummy_agent_one(report_id: str, token_id: str) -> dict:
-    print(f"Dummy Agent One running for report {report_id} and token {token_id}")
+    logger.info("Dummy Agent One running for report %s and token %s", report_id, token_id)
     await asyncio.sleep(1) # Simulate async work
     return {"agent_one_data": "data_from_agent_one"}
 
 async def dummy_agent_two(report_id: str, token_id: str) -> dict:
-    print(f"Dummy Agent Two running for report {report_id} and token {token_id}")
+    logger.info("Dummy Agent Two running for report %s and token %s", report_id, token_id)
     await asyncio.sleep(0.5) # Simulate async work
     return {"agent_two_data": "data_from_agent_two"}
37-43: Remove redundant exception parameter from logger.exception.

The logger.exception() method automatically logs the current exception, so passing the exception object as a parameter is redundant.

Apply this diff:
     def _on_done(t: asyncio.Task):
         try:
             t.result()
         except Exception as e:
-            logger.exception('Background orchestration failed for %s: %s', report_id, e)
+            logger.exception('Background orchestration failed for %s', report_id)
             # Optionally update report status to failed here as well
Based on static analysis hints.
backend/app/core/orchestrator.py (1)
26-32: Consider simplifying the exc_info parameter.

The current usage of exc_info=isinstance(result, BaseException) is technically correct but could be simplified. Since the exception is already logged with full traceback in _run_agent_safely (line 53), this orchestration-level log serves as a summary and doesn't need the exc_info parameter.

Apply this diff:
         for i, result in enumerate(results):
             agent_name = agent_names[i]
             if isinstance(result, Exception):
-                logger.error("Agent '%s' failed with error: %s", agent_name, result, exc_info=isinstance(result, BaseException))
+                logger.error("Agent '%s' failed with error: %s", agent_name, result)
                 aggregated_results[agent_name] = {'status': 'failed', 'error': str(result)}
             else:
                 aggregated_results[agent_name] = {'status': 'completed', 'data': result}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between bedfda0 and 2b7099f.

📒 Files selected for processing (4)

backend/app/api/v1/routes.py (1 hunks)
backend/app/core/orchestrator.py (1 hunks)
backend/app/services/report_service.py (2 hunks)
backend/tests/test_orchestrator.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (4)

backend/app/services/report_service.py (2)

backend/app/models/report_models.py (2)

ReportRequest (4-6)

ReportResponse (8-10)

backend/app/utils/id_generator.py (1)

generate_report_id (3-8)

backend/app/core/orchestrator.py (1)

backend/app/services/report_service.py (1)

save_report_data (22-27)

backend/app/api/v1/routes.py (3)

backend/app/models/report_models.py (2)

ReportRequest (4-6)

ReportResponse (8-10)

backend/app/services/report_service.py (1)

generate_report (11-20)

backend/app/core/orchestrator.py (2)

register_agent (12-13)

execute_agents_concurrently (15-47)

backend/tests/test_orchestrator.py (1)

backend/app/core/orchestrator.py (3)

Orchestrator (8-54)

register_agent (12-13)

execute_agents_concurrently (15-47)

🪛 Ruff (0.14.1)

backend/app/api/v1/routes.py

41-41: Redundant exception object included in logging.exception call

(TRY401)

backend/tests/test_orchestrator.py

22-22: Possible hardcoded password assigned to: "token_id"

(S105)

47-47: Possible hardcoded password assigned to: "token_id"

(S105)

🔇 Additional comments (11)

backend/app/services/report_service.py (2)

1-6: LGTM!

The logging setup follows best practices, and the import path updates are correct.

22-27: LGTM!

The save_report_data function correctly handles both success and missing report ID cases with proper logging.

backend/tests/test_orchestrator.py (2)

6-10: LGTM!

The autouse fixture ensures proper test isolation by cleaning up in_memory_reports before and after each test.

12-35: LGTM!

The test correctly validates successful concurrent agent execution, including proper agent invocation and result aggregation.

backend/app/api/v1/routes.py (2)

1-8: LGTM!

The imports and logger initialization are properly configured.

46-50: LGTM!

The status endpoint correctly validates the report ID and returns appropriate responses.

backend/app/core/orchestrator.py (5)

1-13: LGTM!

The class initialization and agent registration logic are clean and well-typed.

15-23: LGTM!

The concurrent task execution using asyncio.gather with return_exceptions=True correctly enables parallel agent execution with graceful error handling.

34-47: LGTM!

The overall status determination logic correctly handles all scenarios (complete success, partial failure, complete failure), and the summary provides useful metrics.

49-54: LGTM!

The safe agent execution wrapper properly logs exceptions with full traceback before re-raising them for asyncio.gather to handle.

56-56: LGTM!

The module-level orchestrator instance provides a clean singleton pattern for agent registration and execution.

backend/tests/test_orchestrator.py

coderabbitai

Actionable comments posted: 0

🧹 Nitpick comments (2)

backend/tests/test_orchestrator.py (2)
12-35: Test logic is correct and validates the success path.

The test properly validates that both agents execute concurrently and their results are aggregated correctly.

Optional enhancement: Consider asserting the summary field for more complete coverage:
     assert in_memory_reports[report_id]["agent_results"]["AgentTwo"] == {"status": "completed", "data": {"agent_two_result": "data2"}}
+    assert in_memory_reports[report_id]["summary"] == {"total": 2, "success": 2, "failed": 0}
37-62: Test correctly validates the partial success scenario.

The test properly handles the case where one agent succeeds and another fails. Line 57 correctly asserts "partial_success" status, which aligns with the orchestrator logic. The past review concern has been addressed.

Optional enhancement: Consider asserting the summary field for more complete coverage:
     assert "Agent failed" in in_memory_reports[report_id]["agent_results"]["AgentFailing"]["error"]
+    assert in_memory_reports[report_id]["summary"] == {"total": 2, "success": 1, "failed": 1}

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 2b7099f and 0f72888.

⛔ Files ignored due to path filters (4)

backend/__pycache__/__init__.cpython-313.pyc is excluded by !**/*.pyc
backend/app/core/__pycache__/orchestrator.cpython-313.pyc is excluded by !**/*.pyc
backend/app/services/__pycache__/report_service.cpython-313.pyc is excluded by !**/*.pyc
backend/tests/__pycache__/test_orchestrator.cpython-313-pytest-8.4.2.pyc is excluded by !**/*.pyc

📒 Files selected for processing (1)

backend/tests/test_orchestrator.py (1 hunks)

🧰 Additional context used

🧬 Code graph analysis (1)

backend/tests/test_orchestrator.py (1)

backend/app/core/orchestrator.py (3)

Orchestrator (8-54)

register_agent (12-13)

execute_agents_concurrently (15-47)

🪛 Ruff (0.14.1)

backend/tests/test_orchestrator.py

22-22: Possible hardcoded password assigned to: "token_id"

(S105)

47-47: Possible hardcoded password assigned to: "token_id"

(S105)

🔇 Additional comments (1)

backend/tests/test_orchestrator.py (1)

6-10: LGTM! Good test isolation.

The autouse fixture correctly clears the shared in_memory_reports state before and after each test, ensuring proper test isolation.

felixjordandev · 2025-10-21T13:20:04Z

Nice, the concurrent execution should speed things up a lot. Approved!

lanceherman added 3 commits October 21, 2025 05:49

Implement concurrent agent execution

bedfda0

Fix: API orchestration race condition, logging, and dotenv loading

2b7099f

Fix: Adjust orchestrator test to expect partial success status

0f72888

coderabbitai bot reviewed Oct 21, 2025

View reviewed changes

backend/tests/test_orchestrator.py Outdated Show resolved Hide resolved

coderabbitai bot reviewed Oct 21, 2025

View reviewed changes

felixjordandev approved these changes Oct 21, 2025

View reviewed changes

felixjordandev merged commit 1f67c9d into main Oct 21, 2025
1 check passed

felixjordandev deleted the feat/concurrent-agent-execution branch October 21, 2025 13:20

This was referenced Oct 21, 2025

feat: Implement base AIOrchestrator for agent coordination #10

Merged

Feat: Add endpoint for report processing status #12

Merged

Feature: Asynchronous Report Generation for /report/generate Endpoint #15

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Feat: Implement concurrent agent execution with asyncio.gather #8

Feat: Implement concurrent agent execution with asyncio.gather #8

Uh oh!

lanceherman commented Oct 21, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Oct 21, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

felixjordandev commented Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Feat: Implement concurrent agent execution with asyncio.gather #8

Feat: Implement concurrent agent execution with asyncio.gather #8

Uh oh!

Conversation

lanceherman commented Oct 21, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Oct 21, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Pre-merge checks and finishing touches

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

felixjordandev commented Oct 21, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lanceherman commented Oct 21, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Oct 21, 2025 •

edited

Loading