LCORE-1166: Rebased conversation history changes #1129

asimurka · 2026-02-10T09:13:35Z

Description

This PR returns incorrectly rebased changes for conversation history enrichment.
It also fixes hidden bug for legacy conversations that have no metadata stored for older turns. Now the code correctly adds dummy metadata for oldest turns and real metadata for the newest turns.
Finally, locking of rows when turn is being persisted was removed because 1) no race conditions are expected here in practice, 2) the previous syntax did not work correctly with sqlite.

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: Cursor

Related Tickets & Documents

Related Issue # LCORE-1078
Closes # LCORE-1166

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.
How were the fix/results from this change verified? Please provide relevant screenshots or results.

Summary by CodeRabbit

New Features
- Query executions now record a UTC completion timestamp for more accurate timing.
- Conversations persist explicit turn timing (started/completed) for precise interaction history.
Bug Fixes
- Improved error handling and clearer API error responses when retrieving conversations.
Refactor
- Conversation assembly rewritten to group and render turns more consistently.
Tests
- Expanded test coverage for retrieval, error paths, and conversation persistence.

coderabbitai · 2026-02-10T09:13:54Z

Walkthrough

Endpoints now capture a UTC completed_at timestamp at query finalization and pass model (renamed from model_id) and completed_at into store_query_results, which was refactored to extract provider/model IDs, create UserTurn entries with started/completed timestamps, update conversation metadata, transcripts, and cache.

Changes

Cohort / File(s)	Summary
Endpoint updates `src/app/endpoints/query.py`, `src/app/endpoints/streaming_query.py`	Capture UTC `completed_at` (ISO-like) at the end of query flows and call `store_query_results` with `model` (renamed) and the new `completed_at` parameter.
Storage & persistence logic `src/utils/query.py`	`store_query_results` signature changed: `model_id` → `model`, added `completed_at`, `skip_userid_check`, `topic_summary`; now extracts `provider_id` and `model_id`, creates `UserTurn` records (turn_number, started_at, completed_at, provider_id, model_id), updates conversation metadata and cache, and handles transcript/persistence paths accordingly.
Conversation construction `src/utils/conversations.py`	Refactor to group items into turns then process each turn's items into messages/tool calls/results via new helpers `_group_items_into_turns` and `_process_turn_items`; assembles `ConversationTurn` entries and preserves legacy dummy-turn handling.
Endpoint tests `tests/unit/app/endpoints/test_conversations.py`	Tests updated to use `validate_and_retrieve_conversation`, added Forbidden/500/404 error-path coverage, adjusted mocks for conversation/turn retrieval errors, and expanded assertions.
Query storage tests `tests/unit/utils/test_query.py`	Converted async tests to sync, introduced mock query routing (UserConversation vs max-turn query), added `started_at`/`completed_at` in inputs, updated mocks/assertions for `model` vs `model_id` changes, and added a test for turn-number incrementation.

Sequence Diagram(s)

sequenceDiagram
    participant Client as Client
    participant Endpoint as QueryEndpoint
    participant Storage as UtilsQuery
    participant DB as Database

    Client->>Endpoint: Send query request
    Endpoint->>Endpoint: Process request / produce response
    Endpoint->>Endpoint: Capture completed_at (UTC)
    Endpoint->>Storage: store_query_results(model, completed_at, ...)
    Storage->>Storage: extract_provider_and_model_from_model_id(model)
    Storage->>DB: Acquire FOR UPDATE lock on UserTurn rows
    DB-->>Storage: Lock acquired
    Storage->>DB: Query max(turn_number) for conversation
    DB-->>Storage: Return max turn_number
    Storage->>DB: Insert new UserTurn (turn_number, started_at, completed_at, provider_id, model_id)
    DB-->>Storage: UserTurn inserted
    Storage->>DB: Persist conversation metadata (last_used_model, started_at, completed_at)
    DB-->>Storage: Persisted
    Storage-->>Endpoint: Done
    Endpoint-->>Client: Return final response

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Possibly related PRs

[RHDHPAI-978] Topic summary of initial query #564 — Modifies the same query/persistence callsites and store_query_results signature changes.
LCORE-1166: Added tool calls and timestamps into turn history #1096 — Adds per-turn timestamps and UserTurn-related persistence, closely matching turn-level timing changes.
LCORE-956: Bump llama stack to 0.3.0 #866 — Mirrors renaming model_id→model and adding completed_at in endpoint and utils call sites.

Suggested reviewers

tisnik
are-ces

🚥 Pre-merge checks | ✅ 2 | ❌ 1

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title is vague and generic, using the phrase 'Rebased conversation history changes' which does not clearly convey the specific technical changes made in the changeset.	Replace with a more specific title that describes the main change, such as 'Add completed_at timestamp and update model parameter across query endpoints' or 'Refactor query result storage with provider/model extraction and turn tracking'.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Docstring Coverage	✅ Passed	Docstring coverage is 96.55% which is sufficient. The required threshold is 80.00%.
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing touches

📝 Generate docstrings

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment

Tip

Issue Planner is now in beta. Read the docs and try it out! Share your feedback on Discord.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 1

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/utils/query.py (1)
413-420: ⚠️ Potential issue | 🟠 Major

Bug: completed_at parameter is overwritten before building the cache entry, causing DB/cache timestamp inconsistency.

Line 413 reassigns completed_at with a freshly computed timestamp, shadowing the function parameter received at line 336. The database record (via persist_user_conversation_details at line 401) uses the original caller-provided completed_at, but the CacheEntry at line 420 uses this new value. This creates an inconsistency between the persisted database record and the cache.
🐛 Proposed fix — remove the redundant reassignment
     # Store conversation in cache
     try:
-        completed_at = datetime.now(UTC).strftime("%Y-%m-%dT%H:%M:%SZ")
         cache_entry = CacheEntry(
             query=query_request.query,
             response=summary.llm_response,

🤖 Fix all issues with AI agents

In `@tests/unit/app/endpoints/test_conversations.py`:
- Around line 799-802: The tests in TestGetConversationEndpoint are patching the
wrong function: replace any mocker.patch targeting "retrieve_conversation" with
"validate_and_retrieve_conversation" so the actual call from
get_conversation_endpoint_handler is intercepted; update mocks in
test_llama_stack_not_found_error, test_get_conversation_forbidden, and
test_sqlalchemy_error_in_get_conversation to patch
validate_and_retrieve_conversation (and keep returning mock_conversation or
raising the intended exceptions) so the tests exercise the correct code path
invoked by get_conversation_endpoint_handler.

🧹 Nitpick comments (5)

tests/unit/app/endpoints/test_conversations.py (1)

827-870: New APIStatusError test looks good overall.

The test correctly verifies that an APIStatusError during item retrieval results in an HTTP 404 with the expected detail payload.

One minor note: line 841 patches can_access_conversation, but since validate_and_retrieve_conversation is fully mocked (line 842-844), the access check may never be reached. If so, the patch is dead weight — consider removing it for clarity, or confirm it's needed by the endpoint's call order.
tests/unit/utils/test_query.py (2)
527-528: @pytest.mark.asyncio on a synchronous function is unnecessary.

persist_user_conversation_details is a synchronous function (def, not async def), so @pytest.mark.asyncio and async def on these tests are not needed. While it won't break anything, it adds unnecessary overhead and is misleading.
♻️ Suggested fix
-    `@pytest.mark.asyncio`
-    async def test_create_new_conversation(self, mocker: MockerFixture) -> None:
+    def test_create_new_conversation(self, mocker: MockerFixture) -> None:
Same applies to test_update_existing_conversation at line 576–577.
547-556: Dead if not args branch in query_side_effect.

The if not args: branch (line 549) is unreachable — session.query(func.max(UserTurn.turn_number)) always passes one argument. The func.max(...) expression won't match UserConversation or UserTurn, so it correctly falls through to the default return mock_max_query at line 556. The not args check is harmless but dead code.
♻️ Suggested simplification
         def query_side_effect(*args: Any) -> Any:
             """Route queries based on the argument type."""
-            if not args:
-                return mock_max_query
-            arg = args[0]
-            if arg is UserConversation:
+            if args and args[0] is UserConversation:
                 return mock_conv_query
-            if arg is UserTurn:
+            if args and args[0] is UserTurn:
                 return mock_turn_lock_query
             return mock_max_query
Same applies to the duplicate at lines 603–612.
src/utils/query.py (2)
331-341: Docstring parameter name inconsistency: model parameter description says "model identifier" but it actually expects the composite provider/model format.

The callers pass responses_params.model which is in "provider/model" format (as evidenced by extract_provider_and_model_from_model_id(model) at line 366). The docstring at line 354 should clarify this expected format to avoid confusion for future callers.
📝 Suggested docstring improvement
-        model: The model identifier
+        model: The full model identifier in "provider/model" format
548-553: Combine the lock and max() query to reduce unnecessary memory consumption.

The current implementation fetches all UserTurn rows into memory just to acquire locks, which is wasteful for conversations with many turns. Since the rows are never read—only the lock matters—combine the locking and aggregation into a single query:
♻️ Suggested refactor
-        # Lock UserTurn rows for this conversation to prevent race conditions
-        # when computing max(turn_number) and inserting a new turn
-        session.query(UserTurn).filter_by(
-            conversation_id=normalized_id
-        ).with_for_update().all()
-        # Recompute max(turn_number) after acquiring the lock
-        max_turn_number = (
-            session.query(func.max(UserTurn.turn_number))
-            .filter_by(conversation_id=normalized_id)
-            .scalar()
-        )
+        # Lock and compute max turn number in a single query
+        max_turn_number = (
+            session.query(func.max(UserTurn.turn_number))
+            .filter_by(conversation_id=normalized_id)
+            .with_for_update()
+            .scalar()
+        )
Note: PostgreSQL properly locks all rows scanned by the aggregate. SQLite silently ignores FOR UPDATE (as with the current code), so behavior is unchanged. Verify the lock semantics against your supported databases in testing.

tests/unit/app/endpoints/test_conversations.py

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/utils/query.py`:
- Around line 547-554: The .with_for_update() call when computing max turn
number on UserTurn (the
session.query(...).filter_by(conversation_id=normalized_id).with_for_update().scalar()
block) is invalid on SQLite; guard it by checking the current DB dialect (e.g.,
session.get_bind().dialect.name or session.bind.dialect.name) and only append
.with_for_update() when the dialect is not "sqlite" (for SQLite run the query
without .with_for_update(), or apply a SQLite-compatible lock if desired).
Ensure the logic still computes max_turn_number the same way and references
UserTurn.turn_number and conversation_id/normalized_id the same as before.

🧹 Nitpick comments (2)

tests/unit/app/endpoints/test_conversations.py (1)

869-900: test_sqlalchemy_error_in_get_conversation — test only verifies exception passthrough, not actual DB error handling.

This test patches validate_and_retrieve_conversation to directly raise HTTPException and then asserts the same exception comes out. It's effectively testing that FastAPI propagates an HTTPException — not that the endpoint or validate_and_retrieve_conversation correctly handles a SQLAlchemyError. The existing test_sqlalchemy_error_in_retrieve_conversation (lines 970–1011) already covers the real DB error path. Consider whether this test adds meaningful coverage or is redundant.
tests/unit/utils/test_query.py (1)
527-565: Test mock routing is well-structured, but consider verifying the UserTurn was added.

The query_side_effect dispatch pattern is clean. However, mock_session.add.assert_called() only verifies add was called at least once — it doesn't confirm a UserTurn object was persisted. Since turn tracking is the core new behavior, a targeted assertion would strengthen coverage.
Optional: verify UserTurn was added
         mock_session.add.assert_called()
         mock_session.commit.assert_called_once()
+
+        # Verify a UserTurn was added
+        from models.database.conversations import UserTurn
+        add_calls = mock_session.add.call_args_list
+        turn_adds = [c for c in add_calls if isinstance(c[0][0], UserTurn)]
+        assert len(turn_adds) == 1
+        added_turn = turn_adds[0][0][0]
+        assert added_turn.turn_number == 1
+        assert added_turn.provider == "provider1"
+        assert added_turn.model == "model1"

src/utils/query.py

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/utils/conversations.py`:
- Around line 400-415: The calculation for legacy_turns_count can go negative
when len(turns_metadata) > total_turns; change the logic around total_turns,
legacy_turns_count and the selection of turn_metadata (used in the loop over
turn_items_list and functions like _create_dummy_turn_metadata) to defend
against that: clamp legacy_turns_count to at least 0 (e.g., legacy_turns_count =
max(0, total_turns - len(turns_metadata))) or explicitly check if
len(turns_metadata) > total_turns and adjust metadata indexing, and add a
warning log when metadata outnumbers turns so orphaned metadata is visible
instead of silently skipped. Ensure subsequent indexing of turns_metadata uses
the adjusted metadata_index calculation to avoid negative offsets.

🧹 Nitpick comments (6)

src/utils/query.py (3)

547-566: Good: with_for_update() removed, resolving SQLite compatibility.

The previous review flagged that with_for_update() would fail on SQLite. This has been addressed by removing it entirely.

However, without any locking, two concurrent requests for the same conversation_id can read the same max_turn_number and attempt to insert duplicate (conversation_id, turn_number) composite PKs. This will raise an IntegrityError (a subclass of SQLAlchemyError), which is caught by the caller in store_query_results (line 406) and converted to an HTTP 500. For low-concurrency workloads this is acceptable, but for high-concurrency scenarios on the same conversation, consider a retry-on-conflict or a DB-level sequence/auto-increment for turn_number.

334-342: Parameter naming: model shadows the Python built-in.

The parameter model: str on line 334 shadows Python's built-in model — actually model isn't a Python built-in, so this is fine from a shadowing perspective. However, note the asymmetry: the public API uses model (combined "provider/model" string) while internally it's split into model_id and provider_id. The docstring on line 354 could be clearer that model is expected in "provider/model" format.

488-507: Docstring: document the expected ISO format for started_at / completed_at.

The docstring says "timestamp when the conversation started/completed" but doesn't specify the expected format. Since the function calls datetime.fromisoformat() internally (lines 556–557), invalid formats will raise ValueError that isn't caught here (it would propagate as an unhandled exception, not as an SQLAlchemyError). Consider either documenting the expected ISO format or catching ValueError and wrapping it.

tests/unit/utils/test_query.py (2)

529-565: Consider strengthening assertions in test_create_new_conversation.

Line 564 uses mock_session.add.assert_called() which only verifies add was invoked at least once. The new test_create_new_conversation_with_existing_turns test (line 654–669) demonstrates a stronger pattern — inspecting call_args_list to verify both UserConversation and UserTurn objects were added. Consider applying the same pattern here for consistency and to catch regressions where one of the two session.add() calls might be accidentally removed.

543-549: Repeated mock routing logic could be extracted to a helper or fixture.

The query_side_effect function is duplicated across three tests with only the mock objects varying. A shared helper or parametrized fixture would reduce boilerplate.

Also applies to: 589-594, 632-637
src/utils/conversations.py (1)
321-323: Messages are parsed twice: once during grouping, once during per-turn processing.

_parse_message_item is called here solely to check message.type == "user", but the resulting Message object is discarded. The same item is parsed again inside _process_turn_items (line 367). You can avoid the redundant construction by checking the role directly on the cast item:
♻️ Suggested simplification
         if item_type == "message":
             message_item = cast(MessageOutput, item)
-            message = _parse_message_item(message_item)
-
-            if message.type == "user":
+            if message_item.role == "user":

src/utils/conversations.py

tisnik

looks sane :)

jrobertboos

Overall LGTM. This PR though should be tied to a JIRA issue though :)

are-ces

LGTM

asimurka · 2026-02-10T16:47:45Z

Overall LGTM. This PR though should be tied to a JIRA issue though :)

Done ;)

coderabbitai bot reviewed Feb 10, 2026

View reviewed changes

tests/unit/app/endpoints/test_conversations.py Show resolved Hide resolved

asimurka force-pushed the rebase_conversation_history branch from a0ef91f to 5c354fe Compare February 10, 2026 09:33

coderabbitai bot reviewed Feb 10, 2026

View reviewed changes

src/utils/query.py Outdated Show resolved Hide resolved

asimurka force-pushed the rebase_conversation_history branch from 5c354fe to 43d06ab Compare February 10, 2026 10:37

coderabbitai bot reviewed Feb 10, 2026

View reviewed changes

src/utils/conversations.py Show resolved Hide resolved

Rebased conversation history changes

23a433d

asimurka force-pushed the rebase_conversation_history branch from 43d06ab to 23a433d Compare February 10, 2026 11:31

tisnik approved these changes Feb 10, 2026

View reviewed changes

jrobertboos approved these changes Feb 10, 2026

View reviewed changes

are-ces approved these changes Feb 10, 2026

View reviewed changes

asimurka changed the title ~~Rebased conversation history changes~~ LCORE-1166: Rebased conversation history changes Feb 10, 2026

tisnik merged commit 81e303a into lightspeed-core:main Feb 10, 2026
21 of 22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LCORE-1166: Rebased conversation history changes #1129

LCORE-1166: Rebased conversation history changes #1129

asimurka commented Feb 10, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Feb 10, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

tisnik left a comment

Uh oh!

jrobertboos left a comment

Uh oh!

are-ces left a comment

Uh oh!

asimurka commented Feb 10, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

LCORE-1166: Rebased conversation history changes #1129

LCORE-1166: Rebased conversation history changes #1129

Conversation

asimurka commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Tools used to create PR

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

jrobertboos left a comment

Choose a reason for hiding this comment

Uh oh!

are-ces left a comment

Choose a reason for hiding this comment

Uh oh!

asimurka commented Feb 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

asimurka commented Feb 10, 2026 •

edited

Loading

coderabbitai bot commented Feb 10, 2026 •

edited

Loading

asimurka commented Feb 10, 2026 •

edited

Loading