Improve haystack_deep_research_agent example#1170
Merged
rapids-bot[bot] merged 2 commits intoNVIDIA:developfrom Nov 13, 2025
Merged
Improve haystack_deep_research_agent example#1170rapids-bot[bot] merged 2 commits intoNVIDIA:developfrom
rapids-bot[bot] merged 2 commits intoNVIDIA:developfrom
Conversation
WalkthroughIntegrates NVIDIA embedders into the Haystack deep research agent: configuration adds Changes
Sequence Diagram(s)sequenceDiagram
participant User
participant Indexer as Indexing Pipeline
participant RAG as RAG Pipeline
participant Store as Document Store
participant LLM
rect rgb(245, 250, 255)
Note over Indexer: Indexing flow (new)
User->>Indexer: submit PDFs + embedder_model
Indexer->>Indexer: pdf_converter → cleaner → splitter
Indexer->>Indexer: NvidiaDocumentEmbedder (embed PDFs)
Indexer->>Store: write documents (with embeddings)
end
rect rgb(245, 255, 245)
Note over RAG: RAG query flow (new)
User->>RAG: query + embedder_model
RAG->>RAG: NvidiaTextEmbedder (query → embedding)
RAG->>RAG: OpenSearchEmbeddingRetriever (use query embedding)
RAG->>Store: retrieve relevant docs
RAG->>RAG: prompt_builder → LLM call
RAG->>User: response
end
Estimated code review effort🎯 4 (Complex) | ⏱️ ~45 minutes
Suggested labels
Pre-merge checks and finishing touchesImportant Pre-merge checks failedPlease resolve all errors before merging. Addressing warnings is optional. ❌ Failed checks (1 warning, 1 inconclusive)
✅ Passed checks (1 passed)
✨ Finishing touches
🧪 Generate unit tests (beta)
📜 Recent review detailsConfiguration used: Path: .coderabbit.yaml Review profile: CHILL Plan: Pro 📒 Files selected for processing (1)
🚧 Files skipped from review as they are similar to previous changes (1)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
… NvidiaDocumentEmbedder ; Update tests ; Refactoring Signed-off-by: Michele Pangrazzi <xmikex83@gmail.com>
b360922 to
9683d71
Compare
willkill07
approved these changes
Nov 10, 2025
Member
|
/ok to test 9683d71 |
Member
|
/ok to test 7403c75 |
Member
|
/merge |
saglave
pushed a commit
to snps-scm13/SNPS-NeMo-Agent-Toolkit
that referenced
this pull request
Dec 11, 2025
This improves the Haystack Deep Research Agent example: - Updates the RAG / Agent LLM model to `nvidia/llama-3.3-nemotron-super-49b-v1.5` - Use embedding-based retrieval and document indexing replacing `OpenSearchBM25Retriever` with `OpenSearchEmbeddingRetriever` and `NvidiaDocumentEmbedder` Haystack components, improving search relevance and enabling actual semantic search. The main changes include switching to embedding models for both document indexing and retrieval Haystack pipelines, updating configuration to support embedding parameters, and refactoring pipelines to integrate embedding components. (retry of NVIDIA#1161) ## Summary by CodeRabbit * **New Features** * Integrated NVIDIA embedders for document indexing and query embedding; retrieval now uses embedding-based matching. * **Updates** * Language model identifiers updated to v1.5. * Configuration now requires an embedder model name and embedding_dim (1024 by default). * Startup indexing and retrieval flows updated to require and propagate the embedder selection. * **Tests** * Config validation updated to assert presence of embedding_dim. Authors: - Michele Pangrazzi (https://github.com/mpangrazzi) Approvers: - Will Killian (https://github.com/willkill07) URL: NVIDIA#1170 Signed-off-by: Sangharsh Aglave <aglave@synopsys.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This improves the Haystack Deep Research Agent example:
nvidia/llama-3.3-nemotron-super-49b-v1.5OpenSearchBM25RetrieverwithOpenSearchEmbeddingRetrieverandNvidiaDocumentEmbedderHaystack components, improving search relevance and enabling actual semantic search. The main changes include switching to embedding models for both document indexing and retrieval Haystack pipelines, updating configuration to support embedding parameters, and refactoring pipelines to integrate embedding components.(retry of #1161)
Summary by CodeRabbit
New Features
Updates
Tests