Improve RAG context injection #223

afterrburn · 2025-07-06T14:55:05Z

Since the SDK for vector search is currently broken (only returning a maximum of 10 results), this is a good opportunity to revise our approach. Previously, we retrieved the full document associated with a chunk, which provided rich context but often led to saturated and slower responses.

The new strategy retrieves only the content surrounding relevant chunks, providing more targeted context to the LLM. This avoids unnecessary overhead—some markdowns have up to 75 chunks, but only a few are needed to answer most questions.

I also reduced the number of relevant vector results from 20 to 10 to match the current SDK behavior. We can increase this when it makes sense.

Summary by CodeRabbit

New Features
- Added new metadata to document chunks, including the total number of chunks and additional fields for relevance score, chunk range, and chunk index.
- Enhanced document retrieval to provide expanded context by including neighboring chunks for more comprehensive answers.
Improvements
- Updated system prompts for more concise and relevant assistant responses, with clearer instructions and improved formatting.
- Reduced the default number of vector search results from 20 to 10 for more focused results.
- Improved logging for document synchronization and vector deletion processes, offering better visibility into operations.

coderabbitai · 2025-07-06T14:55:11Z

Walkthrough

This update introduces chunk context awareness and expansion in document processing and retrieval. It adds a totalChunks property to chunk types and metadata, refines vector deletion batching, and enhances retrieval by grouping and expanding relevant document chunks with their neighbors. The system prompt and document schemas are also updated for clarity and richer metadata.

Changes

File(s)	Change Summary
agent-docs/config.ts	Reduced `vectorSearchNumber` constant from 20 to 10.
agent-docs/src/agents/doc-processing/chunk-mdx.ts, types.ts, docs-processor.ts	Added `totalChunks` to chunk types and metadata; ensured chunk objects include this property.
agent-docs/src/agents/doc-processing/docs-orchestrator.ts	Refactored vector deletion to batch in groups of 100; improved logging; added log after upserting all chunks.
agent-docs/src/agents/doc-qa/rag.ts	Simplified and clarified system prompt; changed XML tags and answer instructions.
agent-docs/src/agents/doc-qa/retriever.ts	Refactored to expand relevant chunks with neighbors; added `expandPathGroup` helper; improved logging and error handling.
agent-docs/src/agents/doc-qa/types.ts	Added optional `relevanceScore`, `chunkRange`, and `chunkIndex` to `RelevantDocSchema`.

Sequence Diagram(s)

sequenceDiagram
    participant User
    participant Retriever
    participant VectorStore

    User->>Retriever: Ask question
    Retriever->>VectorStore: Vector search with prompt
    VectorStore-->>Retriever: Return top N chunks
    Retriever->>Retriever: Group chunks by path
    loop For each path group
        Retriever->>VectorStore: Fetch neighboring chunks (expand context)
        VectorStore-->>Retriever: Return expanded chunks
        Retriever->>Retriever: Concatenate and sort chunks, preserve relevance
    end
    Retriever-->>User: Return expanded relevant documents

Suggested reviewers

jhaynie
rblalock
mcongrove

Poem

In fields of docs, the chunks now know
Their total kin, in lines they grow.
Neighbors gathered, context wide—
More helpful answers now reside.
With logs and prompts both crisp and clear,
The code hops forward—rabbits cheer! 🐇✨

Warning

There were issues while running some tools. Please review the errors and either fix the tool's configuration or disable the tool if it's a critical failure.

🔧 ESLint

If the error stems from missing dependencies, add them to the package.json file. For unrecoverable errors (e.g., due to private dependencies), disable the tool in the CodeRabbit configuration.

npm error Exit handler never called!
npm error This is an error with npm itself. Please report this error at:
npm error https://github.com/npm/cli/issues
npm error A complete log of this run can be found in: /.npm/_logs/2025-07-10T00_09_56_573Z-debug-0.log

📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between e5d713e and 5f8c376.

📒 Files selected for processing (1)

agent-docs/src/agents/doc-qa/rag.ts (3 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

agent-docs/src/agents/doc-qa/rag.ts

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (1)

GitHub Check: Workers Builds: docs

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch seng/rag-improvements

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Explain this complex logic.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
- @coderabbitai explain this code block.
- @coderabbitai modularize this function.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read src/utils.ts and explain its main purpose.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
- @coderabbitai help me debug CodeRabbit configuration file.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

@coderabbitai pause to pause the reviews on a PR.
@coderabbitai resume to resume the paused reviews.
@coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
@coderabbitai full review to do a full review from scratch and review all the files again.
@coderabbitai summary to regenerate the summary of the PR.
@coderabbitai generate docstrings to generate docstrings for this PR.
@coderabbitai generate sequence diagram to generate a sequence diagram of the changes in this PR.
@coderabbitai auto-generate unit tests to generate unit tests for this PR.
@coderabbitai resolve resolve all the CodeRabbit review comments.
@coderabbitai configuration to show the current CodeRabbit configuration for the repository.
@coderabbitai help to get help.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

cloudflare-workers-and-pages · 2025-07-06T14:55:17Z

Deploying with Cloudflare Workers

The latest updates on your project. Learn more about integrating Git with Workers.

Status	Name	Latest Commit	Preview URL	Updated (UTC)
✅ Deployment successful! View logs	docs	`5f8c376`	Commit Preview URL	Jul 10 2025, 12:10 AM

coderabbitai

Actionable comments posted: 3

🔭 Outside diff range comments (1)

agent-docs/src/agents/doc-processing/docs-orchestrator.ts (1)
92-94: Use the correct logger specifier for the upsert count

We’ve confirmed via the JavaScript SDK docs that ctx.vector.upsert returns a Promise<string[]>—i.e., an array of string IDs—so accessing result.length is safe. To improve clarity and formatting, switch the logger’s placeholder from %o (object) to %d (decimal number).

• File: agent-docs/src/agents/doc-processing/docs-orchestrator.ts
Line 93

Suggested change:
- ctx.logger.info('Upserted chunk: %o', result.length);
+ ctx.logger.info('Upserted chunk count: %d', result.length);

🧹 Nitpick comments (3)

agent-docs/src/agents/doc-processing/docs-orchestrator.ts (1)
96-96: Use correct format specifier for numeric value.
-      ctx.logger.info('Upserted total %o chunks for file: %s', chunks.length, logicalPath);
+      ctx.logger.info('Upserted total %d chunks for file: %s', chunks.length, logicalPath);
agent-docs/src/agents/doc-qa/retriever.ts (2)
22-22: Consider making contextWindow configurable.

The context window is hardcoded to 1. This should be configurable to allow flexibility in how much surrounding context is included.

Consider moving this to a configuration constant or accepting it as a parameter:
 async function expandPathGroup(
   ctx: AgentContext,
   path: string,
   pathChunks: Array<{
     path: string;
     content: string;
     relevanceScore?: number;
     chunkIndex?: number;
-  }>
+  }>,
+  contextWindow: number = 1
 ): Promise<RelevantDoc | null> {
-  const contextWindow = 1; // Get 1 chunk before and after each chunk
92-92: Use appropriate format specifier for relevance score.
-    ctx.logger.debug('Expanded path %s with %d chunks (range: %s) score %d', path, foundChunks.length, chunkRange, bestScore);
+    ctx.logger.debug('Expanded path %s with %d chunks (range: %s) score %f', path, foundChunks.length, chunkRange, bestScore);

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 7f7620e and e5d713e.

📒 Files selected for processing (8)

agent-docs/config.ts (1 hunks)
agent-docs/src/agents/doc-processing/chunk-mdx.ts (2 hunks)
agent-docs/src/agents/doc-processing/docs-orchestrator.ts (2 hunks)
agent-docs/src/agents/doc-processing/docs-processor.ts (1 hunks)
agent-docs/src/agents/doc-processing/types.ts (1 hunks)
agent-docs/src/agents/doc-qa/rag.ts (3 hunks)
agent-docs/src/agents/doc-qa/retriever.ts (1 hunks)
agent-docs/src/agents/doc-qa/types.ts (1 hunks)

🧰 Additional context used

📓 Path-based instructions (1)

`agent-docs/src/agents/**/*.ts`: Use TypeScript for better type safety and IDE s...

agent-docs/src/agents/**/*.ts: Use TypeScript for better type safety and IDE support
Import types from @agentuity/sdk
Use structured error handling with try/catch blocks
Leverage the provided logger for consistent logging
Use the storage APIs for persisting data

📄 Source: CodeRabbit Inference Engine (agent-docs/.cursor/rules/sdk.mdc)

List of files the instruction was applied to:

agent-docs/src/agents/doc-processing/docs-processor.ts
agent-docs/src/agents/doc-processing/types.ts
agent-docs/src/agents/doc-processing/chunk-mdx.ts
agent-docs/src/agents/doc-qa/types.ts
agent-docs/src/agents/doc-processing/docs-orchestrator.ts
agent-docs/src/agents/doc-qa/rag.ts
agent-docs/src/agents/doc-qa/retriever.ts

🧠 Learnings (8)

📓 Common learnings

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:7-15
Timestamp: 2025-06-18T04:07:57.583Z
Learning: In the agent-docs project, the ChunkMetadata type requires title and description fields to be mandatory strings rather than optional, indicating that the system ensures these fields are always populated during document processing.

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:27-33
Timestamp: 2025-06-18T04:09:27.663Z
Learning: The `embedChunks` function in the agent-docs project maintains strict order and count consistency with input chunks, so positional correspondence between chunks.map() and embeddings[index] is reliable without additional safeguarding.

agent-docs/src/agents/doc-processing/docs-processor.ts (2)

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:27-33
Timestamp: 2025-06-18T04:09:27.663Z
Learning: The `embedChunks` function in the agent-docs project maintains strict order and count consistency with input chunks, so positional correspondence between chunks.map() and embeddings[index] is reliable without additional safeguarding.

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:7-15
Timestamp: 2025-06-18T04:07:57.583Z
Learning: In the agent-docs project, the ChunkMetadata type requires title and description fields to be mandatory strings rather than optional, indicating that the system ensures these fields are always populated during document processing.

agent-docs/src/agents/doc-processing/types.ts (5)

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:7-15
Timestamp: 2025-06-18T04:07:57.583Z
Learning: In the agent-docs project, the ChunkMetadata type requires title and description fields to be mandatory strings rather than optional, indicating that the system ensures these fields are always populated during document processing.

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:27-33
Timestamp: 2025-06-18T04:09:27.663Z
Learning: The `embedChunks` function in the agent-docs project maintains strict order and count consistency with input chunks, so positional correspondence between chunks.map() and embeddings[index] is reliable without additional safeguarding.

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : All code should be in Typescript format

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Import types from `@agentuity/sdk`

agent-docs/config.ts (3)

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : Prefer naming the default function Agent or the name of the Agent based on the context of the Agent description

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : All code should be in Typescript format

agent-docs/src/agents/doc-processing/chunk-mdx.ts (4)

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:7-15
Timestamp: 2025-06-18T04:07:57.583Z
Learning: In the agent-docs project, the ChunkMetadata type requires title and description fields to be mandatory strings rather than optional, indicating that the system ensures these fields are always populated during document processing.

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:27-33
Timestamp: 2025-06-18T04:09:27.663Z
Learning: The `embedChunks` function in the agent-docs project maintains strict order and count consistency with input chunks, so positional correspondence between chunks.map() and embeddings[index] is reliable without additional safeguarding.

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : All code should be in Typescript format

agent-docs/src/agents/doc-qa/types.ts (5)

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:7-15
Timestamp: 2025-06-18T04:07:57.583Z
Learning: In the agent-docs project, the ChunkMetadata type requires title and description fields to be mandatory strings rather than optional, indicating that the system ensures these fields are always populated during document processing.

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Import types from `@agentuity/sdk`

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : All code should be in Typescript format

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : Prefer loading types from the node modules package `@agentuity/sdk` in the node_modules folder

agent-docs/src/agents/doc-qa/rag.ts (5)

Learnt from: afterrburn
PR: agentuity/docs#208
File: agent-docs/src/agents/doc-qa/index.ts:27-28
Timestamp: 2025-06-28T18:06:02.340Z
Learning: The answerQuestion function in agent-docs/src/agents/doc-qa/rag.ts handles errors internally by logging them and returning a fallback JSON answer, so it doesn't require external try-catch blocks at the call site.

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : Prefer naming the default function Agent or the name of the Agent based on the context of the Agent description

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Prefer using the `agentuity agent create` command to create a new Agent

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agentuity.mdc:0-0
Timestamp: 2025-07-01T12:36:39.469Z
Learning: Applies to agent-docs/**/agentuity.yaml : Do not suggest edits to the Agentuity AI Configuration file (agentuity.yaml)

agent-docs/src/agents/doc-qa/retriever.ts (8)

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : Prefer naming the default function Agent or the name of the Agent based on the context of the Agent description

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use TypeScript for better type safety and IDE support

Learnt from: afterrburn
PR: agentuity/docs#195
File: agent-docs/src/agents/doc-processing/docs-processor.ts:27-33
Timestamp: 2025-06-18T04:09:27.663Z
Learning: The `embedChunks` function in the agent-docs project maintains strict order and count consistency with input chunks, so positional correspondence between chunks.map() and embeddings[index] is reliable without additional safeguarding.

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : All code should be in Typescript format

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : Prefer loading types from the node modules package `@agentuity/sdk` in the node_modules folder

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Import types from `@agentuity/sdk`

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/agent.mdc:0-0
Timestamp: 2025-07-01T12:36:36.319Z
Learning: Applies to agent-docs/**/src/agents/**/index.ts : The file should export a default function

Learnt from: CR
PR: agentuity/docs#0
File: agent-docs/.cursor/rules/sdk.mdc:0-0
Timestamp: 2025-07-01T12:36:46.237Z
Learning: Applies to agent-docs/src/agents/**/*.ts : Use structured error handling with try/catch blocks

⏰ Context from checks skipped due to timeout of 90000ms (1)

GitHub Check: Workers Builds: docs

🔇 Additional comments (10)

agent-docs/config.ts (1)

2-2: LGTM: Vector search reduction aligns with improved RAG approach.

Reducing the vector search number from 20 to 10 makes sense given the new chunk grouping and expansion strategy. This should improve performance while maintaining relevance quality.

agent-docs/src/agents/doc-processing/chunk-mdx.ts (2)

14-14: LGTM: Added totalChunks property for chunk context awareness.

The addition of totalChunks to the Chunk type provides valuable context about the document's chunk distribution, supporting the improved RAG approach.

148-148: LGTM: Correct implementation of totalChunks assignment.

The assignment of totalChunks: chunks.length is accurate and provides each chunk with awareness of the total number of chunks in the document.

agent-docs/src/agents/doc-processing/docs-processor.ts (1)

27-27: LGTM: Proper propagation of totalChunks to vector metadata.

The addition of totalChunks: chunk.totalChunks to the metadata object ensures consistent chunk context information throughout the embedding pipeline.

agent-docs/src/agents/doc-processing/types.ts (1)

22-22: LGTM: Type consistency maintained with totalChunks addition.

The addition of totalChunks: number to the ChunkMetadata type maintains proper type safety and consistency with the extended Chunk type.

agent-docs/src/agents/doc-qa/types.ts (1)

6-8: LGTM: Schema extensions support enhanced RAG context injection.

The addition of relevanceScore, chunkRange, and chunkIndex as optional fields properly extends the schema to support the new chunk grouping and expansion approach. The optional nature and correct typing make these additions flexible and type-safe.

agent-docs/src/agents/doc-processing/docs-orchestrator.ts (1)

15-19: Clarify the use of whitespace query for vector search.

Using a single space ' ' as the query appears to be a workaround to retrieve all vectors. Is this the intended API pattern? Consider documenting this behavior or checking if there's a more explicit way to query all vectors with the given metadata filter.

agent-docs/src/agents/doc-qa/rag.ts (1)

17-101: System prompt refactoring looks good!

The simplified prompt structure with a single <USER_QUESTION> tag and the addition of user context (software engineer) should help generate more targeted responses. The separation of system prompt and user-specific instructions is clean.
agent-docs/src/agents/doc-qa/retriever.ts (2)

173-184: Well-structured chunk expansion implementation!

The approach of grouping chunks by path before expansion is efficient and avoids redundant queries. The error handling and logging throughout the function provide good observability.

142-145: Verify and strengthen the vector search return type for similarity

We’re currently doing:
const relevanceScore = (vector as any).similarity;
which sidesteps TypeScript’s checks. Since we couldn’t locate a local Vector…Result/Response type in the repo, please:

Inspect the ctx.vector.search return type in @agentuity/sdk (via its .d.ts).

Import or extend the correct interface (e.g. VectorSearchResult) so it includes similarity: number.

Replace the as any cast with that concrete type.

Example diff in agent-docs/src/agents/doc-qa/retriever.ts:
-import type { AgentContext } from '@agentuity/sdk';
+import type { AgentContext, VectorSearchResult } from '@agentuity/sdk';

…

-for (const vector of vectors) {
+for (const vector of vectors as VectorSearchResult[]) {
   // …
-  const relevanceScore = (vector as any).similarity;
+  const relevanceScore = vector.similarity;
   // …
}
If the SDK doesn’t yet export such a type, declare a local interface that mirrors its shape and use that instead.

coderabbitai · 2025-07-06T15:01:09Z

agent-docs/src/agents/doc-processing/docs-orchestrator.ts

    // Delete vectors one by one to avoid issues with large batches
    for (const vector of vectors) {
      await ctx.vector.delete(vectorStoreName, vector.key);
    }


💡 Verification agent

🧩 Analysis chain

Consider batch deletion for better performance.

Deleting vectors one by one could be inefficient, especially with 100 vectors per batch. This results in 100 individual API calls per iteration.

Check if the vector API supports batch deletion:

If batch deletion is available, consider refactoring:

- // Delete vectors one by one to avoid issues with large batches - for (const vector of vectors) { - await ctx.vector.delete(vectorStoreName, vector.key); - } + // Delete vectors in batch + const keys = vectors.map(v => v.key); + await ctx.vector.deleteMany(vectorStoreName, keys);

🏁 Script executed:

#!/bin/bash # Search for batch delete methods in the SDK or vector-related files rg -A 5 "batch.*delete|delete.*batch|deleteMany|bulkDelete" --type ts

Length of output: 335

Unify and parallelize vector deletions for improved performance and reliability

No native batch‐delete method was found in the SDK, so we should at least run deletions in parallel and add error handling. Replace the sequential for…await loop with something like:

- // Delete vectors one by one to avoid issues with large batches - for (const vector of vectors) { - await ctx.vector.delete(vectorStoreName, vector.key); - } + // Delete vectors in parallel with individual error handling + const results = await Promise.allSettled( + vectors.map(v => ctx.vector.delete(vectorStoreName, v.key)) + ); + results.forEach((res, i) => { + if (res.status === 'rejected') { + logger.error( + `Failed to delete vector ${vectors[i].key}:`, + res.reason + ); + } + });

• This runs all deletions concurrently to reduce overall latency.
• Using Promise.allSettled lets us log failures without aborting the entire batch.
• For very large batches, consider chunking the vectors array to limit in‐flight requests.

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

// Delete vectors one by one to avoid issues with large batches

for (const vector of vectors) {

await ctx.vector.delete(vectorStoreName, vector.key);

}

// Delete vectors in parallel with individual error handling

const results = await Promise.allSettled(

vectors.map(v => ctx.vector.delete(vectorStoreName, v.key))

);

results.forEach((res, i) => {

if (res.status === 'rejected') {

logger.error(

`Failed to delete vector ${vectors[i].key}:`,

res.reason

);

}

});

🤖 Prompt for AI Agents

In agent-docs/src/agents/doc-processing/docs-orchestrator.ts around lines 25 to 28, the current code deletes vectors sequentially which is inefficient. Replace the for-await loop with a parallel deletion approach using Promise.allSettled to run all deletions concurrently and handle errors individually by logging failures without stopping the entire batch. Optionally, implement chunking of the vectors array if the batch size is very large to limit the number of simultaneous requests.

coderabbitai · 2025-07-06T15:01:09Z

agent-docs/src/agents/doc-qa/retriever.ts

+    const expandedContent = sortedChunks.map(chunk => chunk.text).join('\n\n');
+
+    // Find the best relevance score from the original chunks
+    const bestScore = Math.max(...pathChunks.map(chunk => chunk.relevanceScore || 0));


⚠️ Potential issue

Handle empty array case for Math.max.

Math.max() returns -Infinity when called with an empty array or when all relevanceScores are undefined.

- const bestScore = Math.max(...pathChunks.map(chunk => chunk.relevanceScore || 0)); + const scores = pathChunks.map(chunk => chunk.relevanceScore || 0); + const bestScore = scores.length > 0 ? Math.max(...scores) : 0;

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

const bestScore = Math.max(...pathChunks.map(chunk => chunk.relevanceScore || 0));

const scores = pathChunks.map(chunk => chunk.relevanceScore || 0);

const bestScore = scores.length > 0 ? Math.max(...scores) : 0;

🤖 Prompt for AI Agents

In agent-docs/src/agents/doc-qa/retriever.ts at line 85, the use of Math.max on an array derived from pathChunks can result in -Infinity if the array is empty or all relevanceScores are undefined. To fix this, check if pathChunks is empty or if all relevanceScores are undefined before calling Math.max, and provide a default value (e.g., 0) to use in that case to avoid -Infinity.

coderabbitai · 2025-07-06T15:01:09Z

agent-docs/src/agents/doc-qa/retriever.ts

+  for (const chunk of pathChunks) {
+    if (chunk.chunkIndex !== undefined) {
+      const targetIndex = chunk.chunkIndex;
+
+      // Add the chunk itself and its neighbors
+      expandedChunkIndices.add(targetIndex - contextWindow);
+      expandedChunkIndices.add(targetIndex);
+      expandedChunkIndices.add(targetIndex + contextWindow);
    }
-    try {
-      const vectors = await ctx.vector.search(VECTOR_STORE_NAME, dbQuery);
+  }


💡 Verification agent

🧩 Analysis chain

Add upper bound validation for chunk indices.

The code adds neighboring chunk indices without checking against totalChunks. While negative indices are filtered out, indices beyond the document's total chunks are not validated.

First, let's verify if totalChunks metadata is available:

If available, add upper bound checking:

// Add neighbors for each chunk to the set for (const chunk of pathChunks) { if (chunk.chunkIndex !== undefined) { const targetIndex = chunk.chunkIndex; + // Assuming totalChunks is available in metadata + const totalChunks = chunk.totalChunks || Number.MAX_SAFE_INTEGER; // Add the chunk itself and its neighbors - expandedChunkIndices.add(targetIndex - contextWindow); + if (targetIndex - contextWindow >= 0) { + expandedChunkIndices.add(targetIndex - contextWindow); + } expandedChunkIndices.add(targetIndex); - expandedChunkIndices.add(targetIndex + contextWindow); + if (targetIndex + contextWindow < totalChunks) { + expandedChunkIndices.add(targetIndex + contextWindow); + } } }

🏁 Script executed:

#!/bin/bash # Check if totalChunks is part of the chunk metadata rg -A 5 "totalChunks" --type ts

Length of output: 1901

Ensure upper‐bound validation when expanding chunk indices

Add checks against chunk.totalChunks so neighbor indices never exceed the last chunk.

File: agent-docs/src/agents/doc-qa/retriever.ts

Loop over pathChunks at lines ~26–35

Suggested diff:

for (const chunk of pathChunks) { if (chunk.chunkIndex !== undefined) { const targetIndex = chunk.chunkIndex; + const totalChunks = chunk.totalChunks; - expandedChunkIndices.add(targetIndex - contextWindow); + if (targetIndex - contextWindow >= 0) { + expandedChunkIndices.add(targetIndex - contextWindow); + } expandedChunkIndices.add(targetIndex); - expandedChunkIndices.add(targetIndex + contextWindow); + if (targetIndex + contextWindow < totalChunks) { + expandedChunkIndices.add(targetIndex + contextWindow); + } } }

This guarantees all added indices stay within [0, totalChunks – 1].

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

for (const chunk of pathChunks) {

if (chunk.chunkIndex !== undefined) {

const targetIndex = chunk.chunkIndex;

// Add the chunk itself and its neighbors

expandedChunkIndices.add(targetIndex - contextWindow);

expandedChunkIndices.add(targetIndex);

expandedChunkIndices.add(targetIndex + contextWindow);

}

try {

const vectors = await ctx.vector.search(VECTOR_STORE_NAME, dbQuery);

}

for (const chunk of pathChunks) {

if (chunk.chunkIndex !== undefined) {

const targetIndex = chunk.chunkIndex;

const totalChunks = chunk.totalChunks;

// Add the chunk itself and its neighbors

if (targetIndex - contextWindow >= 0) {

expandedChunkIndices.add(targetIndex - contextWindow);

}

expandedChunkIndices.add(targetIndex);

if (targetIndex + contextWindow < totalChunks) {

expandedChunkIndices.add(targetIndex + contextWindow);

}

}

}

🤖 Prompt for AI Agents

In agent-docs/src/agents/doc-qa/retriever.ts around lines 26 to 35, the code expands chunk indices by adding neighbors without checking if these indices exceed the upper bound defined by chunk.totalChunks. To fix this, add validation to ensure that when adding targetIndex - contextWindow and targetIndex + contextWindow, the indices stay within the range 0 to totalChunks - 1, preventing out-of-bounds errors.

mcongrove · 2025-07-07T17:59:52Z

agent-docs/src/agents/doc-qa/rag.ts


 === CONTEXT ===
-You will receive both the user's ORIGINAL question and a REPHRASED version that was optimized for document search. The rephrased version helped find the relevant documents, but you should answer the user's original intent.
+Your role is to be as helpful as possible. You must first 


This seems cut off?

jhaynie · 2025-07-08T04:56:02Z

@afterrburn the SDK default if you don't pass a limit in your query is 10. you can pass a limit property in your query to change the number of results you want to return.

agent-docs/src/agents/doc-qa/rag.ts

Signed-off-by: Seng Rith <50646727+afterrburn@users.noreply.github.com>

afterrburn added 2 commits July 6, 2025 08:42

add totalChunks to metadata for tracing

ad9f99f

improve RAG retrieval process

e5d713e

afterrburn requested a review from rblalock July 6, 2025 14:55

afterrburn requested a review from mcongrove July 6, 2025 14:55

coderabbitai bot requested a review from jhaynie July 6, 2025 14:55

coderabbitai bot reviewed Jul 6, 2025

View reviewed changes

rblalock approved these changes Jul 7, 2025

View reviewed changes

mcongrove reviewed Jul 7, 2025

View reviewed changes

afterrburn commented Jul 10, 2025

View reviewed changes

agent-docs/src/agents/doc-qa/rag.ts Outdated Show resolved Hide resolved

Apply suggestions from code review

5f8c376

Signed-off-by: Seng Rith <50646727+afterrburn@users.noreply.github.com>

coderabbitai bot requested review from mcongrove and rblalock July 10, 2025 00:08

afterrburn merged commit 0d9063e into main Jul 31, 2025
2 checks passed

afterrburn deleted the seng/rag-improvements branch July 31, 2025 13:00

This was referenced Sep 14, 2025

Seng/chat prototype #279

Merged

Fix instrumentation hook error with conditional loading and restore chat prototype #286

Closed

Fix chat prototype build #287

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve RAG context injection #223

Improve RAG context injection #223

Uh oh!

afterrburn commented Jul 6, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Jul 6, 2025 •

edited

Loading

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (`.coderabbit.yaml`)

Documentation and Community

Uh oh!

cloudflare-workers-and-pages bot commented Jul 6, 2025 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

coderabbitai bot Jul 6, 2025

Uh oh!

coderabbitai bot Jul 6, 2025

Uh oh!

coderabbitai bot Jul 6, 2025

Uh oh!

mcongrove Jul 7, 2025

Uh oh!

afterrburn Jul 8, 2025

Uh oh!

jhaynie commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

-    // Delete vectors one by one to avoid issues with large batches
-    for (const vector of vectors) {
-      await ctx.vector.delete(vectorStoreName, vector.key);
-    }
+    // Delete vectors in parallel with individual error handling
+    const results = await Promise.allSettled(
+      vectors.map(v => ctx.vector.delete(vectorStoreName, v.key))
+    );
+    results.forEach((res, i) => {
+      if (res.status === 'rejected') {
+        logger.error(
+          `Failed to delete vector ${vectors[i].key}:`,
+          res.reason
+        );
+      }
+    });

	const bestScore = Math.max(...pathChunks.map(chunk => chunk.relevanceScore \|\| 0));
	const scores = pathChunks.map(chunk => chunk.relevanceScore \|\| 0);
	const bestScore = scores.length > 0 ? Math.max(...scores) : 0;

Improve RAG context injection #223

Improve RAG context injection #223

Uh oh!

Conversation

afterrburn commented Jul 6, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Suggested reviewers

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Documentation and Community

Uh oh!

cloudflare-workers-and-pages bot commented Jul 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Deploying with Cloudflare Workers

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jul 6, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jul 6, 2025

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Jul 6, 2025

Choose a reason for hiding this comment

Uh oh!

mcongrove Jul 7, 2025

Choose a reason for hiding this comment

Uh oh!

afterrburn Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

jhaynie commented Jul 8, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

afterrburn commented Jul 6, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Jul 6, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)

cloudflare-workers-and-pages bot commented Jul 6, 2025 •

edited

Loading