feat: Add JSON output support to query_rag.py script #35

omkarjoshi0304 · 2025-08-19T21:21:59Z

Description

Add --json flag to enable structured JSON output format
Maintain backward compatibility with existing plain text output
Implement consistent JSON structure for all response types:
- Successful queries return nodes array with metadata
- Error cases return structured error messages
- Include query parameters (top_k, threshold) in response
Support JSON output for both Faiss and Llama-Stack vector stores
Add proper error handling with JSON format

Type of change

Related Tickets & Documents

Related Issue #
https://issues.redhat.com/browse/OSPRH-18801
Closes #

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

Please provide detailed steps to perform tests related to this code change.

uv run python scripts/query_rag.py
-p vector_db/custom_docs/0.1
-x custom_docs-0_1
-m embeddings_model
-k 2
-q "vector database"
--json

How were the fix/results from this change verified? Please provide relevant screenshots or results.

Show help with JSON option:
[--vector-store-type {auto,faiss,llamastack-faiss,llamastack-sqlite-vec}]
[--json]

--
vector store type to be used.
--json Output results in JSON format

Original format:
Command line used: scripts/query_rag.py -p ./test_container_output -x custom_docs-0_1 -m embeddings_model -k 2 -q vector database
LLM is explicitly disabled. Using MockLLM.
Load pretrained SentenceTransformer: embeddings_model
Loading faiss.
Successfully loaded faiss.
Failed to load GPU Faiss: name 'GpuIndexIVFFlat' is not defined. Will not load constructor refs for GPU indexes. This is only an error if you're trying to use GPU Faiss.
Loading llama_index.vector_stores.faiss.base from ./test_container_output/default__vector_store.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/docstore.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/index_store.json.
Loading indices with ids: ['custom_docs-0_1']
Batches: 100%|████████████████████████████████████████| 1/1 [00:00<00:00, 7.64it/s]
================================================================================
Node ID: 2c3a6a58-48eb-4229-896c-92c31f35580b
Text: Vector Database is an efficient way how to provide information
to LLM
Score: 0.677

================================================================================
Node ID: 9c226a5d-1885-4b46-8b36-8416cc6b0dae
Text: Lightspeed projects leverage vector databases for improved AI
responses
Score: 0.560

JSON format:
LLM is explicitly disabled. Using MockLLM.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/docstore.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/index_store.json.
{
"query": "vector database",
"top_k": 2,
"threshold": 0.0,
"nodes": [
{
"id": "2c3a6a58-48eb-4229-896c-92c31f35580b",
"score": 0.6774420738220215,
"text": "Vector Database is an efficient way how to provide information to LLM",
"metadata": {
"docs_url": "https://www.redhat.com",
"title": "Vector Database is an efficient way how to provide information to LLM",
"url_reachable": true
}
},
{
"id": "9c226a5d-1885-4b46-8b36-8416cc6b0dae",
"score": 0.559581458568573,
"text": "Lightspeed projects leverage vector databases for improved AI responses",
"metadata": {
"docs_url": "https://www.redhat.com",
"title": "Lightspeed projects leverage vector databases for improved AI responses",
"url_reachable": true
}
}
]
}
Different query (RAG systems):
LLM is explicitly disabled. Using MockLLM.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/docstore.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/index_store.json.
{
"query": "RAG systems",
"top_k": 1,
"threshold": 0.0,
"nodes": [
{
"id": "2d7ef837-88b6-45a4-acf7-aa6dfab91f80",
"score": 0.5027890801429749,
"text": "RAG systems use embeddings to find relevant context for queries",
"metadata": {
"docs_url": "https://www.redhat.com",
"title": "RAG systems use embeddings to find relevant context for queries",
"url_reachable": true
}
}
]
}
Error handling - invalid path:
ERROR: Cannot recognize the DB in ./nonexistent
Test with threshold (might trigger threshold error):
LLM is explicitly disabled. Using MockLLM.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/docstore.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./test_container_output/index_store.json.
{
"query": "vector database",
"top_k": 1,
"threshold": 0.95,
"nodes": []
}

Summary by CodeRabbit

New Features
- Added a --json option to emit structured, pretty-printed JSON for query results and errors; CLI help updated and the diagnostic command-line line is suppressed when enabled.
Bug Fixes
- More reliable score extraction and consistent handling for no-results, threshold failures, and unknown-database cases across query paths.
Behavior
- Preserves prior textual output when --json is not used; query outputs standardized across index and stack modes with clearer machine-readable error payloads.

coderabbitai · 2025-08-19T21:22:05Z

Walkthrough

Adds a --json CLI flag to scripts/query_rag.py, imports json and logging, and emits structured JSON for query results and error conditions in both llama_index and llama_stack flows while preserving legacy textual output when --json is not used.

Changes

Cohort / File(s)	Summary
Top-level CLI `scripts/query_rag.py`	Added `--json` flag; imported `json` and `logging`; suppresses the “Command line used” diagnostic when JSON is enabled; routes outputs through JSON or legacy text.
llama_index flow (in same script) `scripts/query_rag.py`	Single-node (`--node`) and multi-node handling updated to build unified JSON payloads: `query`, `type` (`single_node`), `node_id`, and node(s) with `id`/`text`/`metadata`/`score`; emits JSON payloads for no-results and threshold failures; preserves non-JSON prints.
llama_stack flow (in same script) `scripts/query_rag.py`	Uses fixed `query_cfg` (chunk_size=380, mode="vector", max_chunks=top_k, overlap=0); warns on no-chunks; extracts scores robustly (`score.score` when present); emits unified JSON success and error payloads for no-chunks or threshold failures; preserves non-JSON prints.

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant U as User (CLI)
  participant Q as query_rag.py
  participant I as llama_index
  participant S as llama_stack

  U->>Q: run CLI (--source [index|stack], --json?, --node?, --top_k, --threshold)
  alt source == llama_index
    Q->>I: fetch node(s) or specific node
    I-->>Q: nodes[], scores, metadata
    alt --json
      Q-->>U: JSON { query, type?, node_id?, top_k?, threshold?, nodes: [...] }
    else non-JSON
      Q-->>U: textual node blocks or error text
    end
  else source == llama_stack
    Q->>S: query with cfg (vector, chunk_size=380, overlap=0, max_chunks=top_k)
    S-->>Q: chunks[], scores
    alt --json
      Q-->>U: JSON { query, top_k, threshold, nodes: [...] }
    else non-JSON
      Q-->>U: textual chunk blocks or error text
    end
  end

  opt Threshold or empty-results
    note right of Q: emits JSON error/info payload or logs/writes text
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Poem

I hop through logs and tidy every line,
Packing nodes in JSON, neat by design.
Scores checked, thresholds kept in sight,
Errors logged and outputs polite.
A tidy payload—hop!—and off into the night. 🐇✨

Tip

🔌 Remote MCP (Model Context Protocol) integration is now available!

Pro plan users can now connect to remote MCP servers from the Integrations page. Connect with popular remote MCPs such as Notion and Linear to add more context to your reviews and chats.

📜 Recent review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between daa6f66 and f57af14.

📒 Files selected for processing (1)

scripts/query_rag.py (5 hunks)

🚧 Files skipped from review as they are similar to previous changes (1)

scripts/query_rag.py

✨ Finishing Touches

📝 Generate Docstrings

🧪 Generate unit tests

Create PR with unit tests
Post copyable unit tests in a comment

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

Review comments: Directly reply to a review comment made by CodeRabbit. Example:
- I pushed a fix in commit <commit_id>, please review it.
- Open a follow-up GitHub issue for this discussion.
Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query.
PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
- @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
- @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.

Support

Need help? Create a ticket on our support page for assistance with any issues or questions.

CodeRabbit Commands (Invoked using PR/Issue comments)

Type @coderabbitai help to get the list of available commands.

Other keywords and placeholders

Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (`.coderabbit.yaml`)

You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
Please see the configuration documentation for more information.
If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Status, Documentation and Community

Visit our Status Page to check the current availability of CodeRabbit.
Visit our Documentation for detailed information on how to use CodeRabbit.
Join our Discord Community to get help, request features, and share feedback.
Follow us on X/Twitter for updates and announcements.

coderabbitai

Actionable comments posted: 3

🧹 Nitpick comments (8)

scripts/query_rag.py (8)
55-65: Include query parameters in “no nodes” error payload for parity

Consider including top_k and threshold in the error payload to keep the JSON error shape aligned with success payloads and aid debugging.
-            result = {
-                "query": args.query,
-                "error": f"No nodes retrieved for query: {args.query}",
-                "nodes": []
-            }
+            result = {
+                "query": args.query,
+                "top_k": args.top_k,
+                "threshold": args.threshold,
+                "error": f"No nodes retrieved for query: {args.query}",
+                "nodes": []
+            }
101-106: Optional: Improve non-JSON text output readability

Printing the raw NodeWithScore object may not be as readable as printing ID/score/text explicitly. Consider formatting similarly to the JSON fields.

Example tweak (outside current range for brevity):

Print node_id, score, and first N chars of text for each item.

154-163: Include query parameters in “no chunks” error for consistency

Similar to the llama-index path, including top_k and threshold helps debugging and keeps structure consistent.
-        result = {
-            "query": args.query,
-            "error": f"No chunks retrieved for query: {args.query}",
-            "chunks": []
-        }
+        result = {
+            "query": args.query,
+            "top_k": args.top_k,
+            "threshold": args.threshold,
+            "error": f"No chunks retrieved for query: {args.query}",
+            "chunks": []
+        }
185-191: Align JSON schema across vector stores; also fix text-mode score display

The PR description promises a consistent JSON structure. The llama-index path emits nodes: [{id, score, text, metadata}], while the llama-stack path emits chunks with document_id and no metadata. Consider unifying to nodes with id and metadata (even if empty), and fix text mode to print a numeric score.

If consistency is required, apply this diff:
-    result = {
-        "query": args.query,
-        "top_k": args.top_k,
-        "threshold": args.threshold,
-        "chunks": []
-    }
+    result = {
+        "query": args.query,
+        "top_k": args.top_k,
+        "threshold": args.threshold,
+        "nodes": []
+    }
@@
-    for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
-        chunk_data = {
-            "document_id": _id,
-            "score": score.score if hasattr(score, 'score') else score,
-            "text": chunk
-        }
-        result["chunks"].append(chunk_data)
+    for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
+        score_val = score.score if hasattr(score, 'score') else score
+        node_data = {
+            "id": _id,
+            "score": score_val,
+            "text": chunk,
+            "metadata": {}
+        }
+        result["nodes"].append(node_data)
@@
-    if args.json:
-        print(json.dumps(result, indent=2))
-    else:
-        for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
-            print("=" * 80)
-            print(f"Node ID: {_id}\nScore: {score}\nText:\n{chunk}")
+    if args.json:
+        print(json.dumps(result, indent=2))
+    else:
+        for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
+            score_val = score.score if hasattr(score, 'score') else score
+            print("=" * 80)
+            print(f"Document ID: {_id}\nScore: {score_val}\nText:\n{chunk}")
If backward compatibility with chunks consumers is a concern, you could keep both keys temporarily (duplicate arrays) and deprecate chunks later.

Also applies to: 193-199, 200-205

258-263: Return richer JSON for unknown DB auto-detection

Include the provided db_path and that auto-detection was attempted. Easier to debug from logs.
-            if args.json:
-                print(json.dumps({"error": error_msg}, indent=2))
+            if args.json:
+                print(json.dumps(
+                    {"error": error_msg, "db_path": args.db_path, "vector_store_type": "auto"},
+                    indent=2
+                ))
142-150: Minor: consider with-context managers for file I/O

open(...); yaml.safe_load(...) and yaml.safe_dump(..., open(...)) leave file handles open until GC. Using with open(...) blocks is cleaner and safer.

200-205: Nit: label should say “Document ID” (llama-stack), and print numeric score

Text mode currently uses “Node ID” and prints the score object, not the numeric value, in some llama-stack configurations.

Included in the unification diff above; if you don’t unify schemas, at least change:

“Node ID” -> “Document ID”

Print score.score if hasattr(score, "score") else score

239-241: Optional: centralize JSON printing to reduce duplication

A tiny helper like def _emit(result): print(json.dumps(result, indent=2)) if args.json else ... would DRY up many branches and make future schema changes safer.

I can factor this into print_json_or_text(args, payload, text_fn) to consolidate all emit paths. Want me to push a patch?

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 53179ec and 986d2f6.

📒 Files selected for processing (1)

scripts/query_rag.py (5 hunks)

🔇 Additional comments (6)

scripts/query_rag.py (6)

5-5: Importing json is appropriate and scoped correctly

No concerns. The import is used consistently for JSON output.

67-83: Threshold error path looks fine

The message is clear and the JSON payload includes threshold and top_score. No functional issues here.

143-147: Query config changes LGTM

chunk_size_in_tokens, mode, and chunk_overlap_in_tokens look reasonable defaults and keep behavior explicit.

238-242: CLI: --json flag addition looks good

Flag semantics are clear and match output behavior.

246-247: Suppressing command echo in JSON mode is correct

Prevents polluting the JSON stream. Good call.

85-91: Result scaffolding for llama-index path looks good

Fields query, top_k, threshold, and nodes provide a clean envelope for results.

scripts/query_rag.py

lpiwowar

Good first iteration! 👍 Two suggestions that can make the code better IMO:)

scripts/query_rag.py

coderabbitai

Actionable comments posted: 1

♻️ Duplicate comments (3)

scripts/query_rag.py (3)

37-51: Guard get_node() returning None and make single-node payload attribute-safe

If the node is missing, this will raise AttributeError, and even when present, id/text access can differ across llama-index versions.

Apply this diff:

-        node = storage_context.docstore.get_node(args.node)
-        result = {
-            "query": args.query,
-            "type": "single_node",
-            "node_id": args.node,
-            "node": {
-                "id": node.node_id,
-                "text": node.text,
-                "metadata": node.metadata if hasattr(node, 'metadata') else {}
-            }
-        }
-        if args.json:
-            print(json.dumps(result, indent=2))
-        else:
-            print(node)
+        node = storage_context.docstore.get_node(args.node)
+        if node is None:
+            error_msg = f"Node '{args.node}' not found."
+            payload = {
+                "query": args.query,
+                "type": "single_node",
+                "node_id": args.node,
+                "node": None,
+                "error": error_msg,
+            }
+            if args.json:
+                print(json.dumps(payload, indent=2))
+            else:
+                print(error_msg)
+            exit(1)
+        result = {
+            "query": args.query,
+            "type": "single_node",
+            "node_id": args.node,
+            "node": {
+                "id": getattr(node, "node_id", getattr(node, "id_", None)),
+                "text": getattr(node, "text", None) if hasattr(node, "text") else (node.get_content() if hasattr(node, "get_content") else None),
+                "metadata": getattr(node, "metadata", {}) or {},
+            },
+        }
+        if args.json:
+            print(json.dumps(result, indent=2))
+        else:
+            print(node)

91-98: NodeWithScore vs BaseNode: access underlying node to avoid AttributeError

retriever.retrieve() yields NodeWithScore; id/text/metadata live on the underlying BaseNode.

-        for node in nodes:
-            node_data = {
-                "id": node.node_id,
-                "score": node.score,
-                "text": node.text,
-                "metadata": node.metadata if hasattr(node, 'metadata') else {}
-            }
-            result["nodes"].append(node_data)
+        for nws in nodes:
+            base = getattr(nws, "node", nws)
+            node_data = {
+                "id": getattr(base, "node_id", getattr(base, "id_", None)),
+                "score": getattr(nws, "score", None),
+                "text": getattr(base, "text", None) if hasattr(base, "text") else (base.get_content() if hasattr(base, "get_content") else None),
+                "metadata": getattr(base, "metadata", {}) or {},
+            }
+            result["nodes"].append(node_data)

179-194: Threshold check can crash if md['scores'][0] is a float; normalize top_score first

You later handle both shapes when formatting; apply the same robustness here.

-    threshold = args.threshold
-    if threshold > 0.0 and md.get("scores") and md["scores"][0].score < threshold:
+    threshold = args.threshold
+    if threshold > 0.0 and md.get("scores"):
+        _top = md["scores"][0]
+        top_score = _top.score if hasattr(_top, "score") else _top
+        if top_score < threshold:
-        logging.error(
-            f"Score {md['scores'][0].score} of the top retrieved node for query '{args.query}' "
-            f"didn't cross the minimal threshold {threshold}."
-        )
-        if args.json:
-            result = {
-                "query": args.query,
-                "top_k": args.top_k,
-                "threshold": args.threshold,
-                "top_score": md["scores"][0].score,
-                "nodes": []
-            }
-            print(json.dumps(result, indent=2))
-        exit(1)
+            logging.error(
+                f"Score {top_score} of the top retrieved node for query '{args.query}' "
+                f"didn't cross the minimal threshold {threshold}."
+            )
+            if args.json:
+                result = {
+                    "query": args.query,
+                    "top_k": args.top_k,
+                    "threshold": args.threshold,
+                    "top_score": top_score,
+                    "nodes": [],
+                }
+                print(json.dumps(result, indent=2))
+            exit(1)

🧹 Nitpick comments (5)

scripts/query_rag.py (5)

5-6: Imports look good; consider initializing logging to avoid polluting JSON output

Since you’re relying on logging for errors, initialize logging in main so logs go to stderr and don’t interleave with JSON on stdout. Keep it terse in JSON mode.

Add this near argument parsing (no diff, outside changed lines):

# After args = parser.parse_args()
logging.basicConfig(
    level=logging.ERROR if args.json else logging.INFO,
    format="%(levelname)s: %(message)s",
)

56-66: Return a structured JSON error for “no nodes retrieved”

Current JSON has no "error" field, which makes it harder to programmatically detect failures.

-        if len(nodes) == 0:
-            logging.error(f"No nodes retrieved for query: {args.query}")
-            if args.json:
-                result = {
-                    "query": args.query,
-                    "top_k": args.top_k,
-                    "threshold": args.threshold,
-                    "nodes": []
-                }
-                print(json.dumps(result, indent=2))
+        if len(nodes) == 0:
+            error_msg = f"No nodes retrieved for query: {args.query}"
+            logging.error(error_msg)
+            if args.json:
+                result = {
+                    "query": args.query,
+                    "top_k": args.top_k,
+                    "threshold": args.threshold,
+                    "nodes": [],
+                    "error": error_msg,
+                }
+                print(json.dumps(result, indent=2))
             exit(1)

167-176: Return a structured JSON error for “no chunks retrieved”

Same as the llama-index path: include an "error" field so clients can detect failure.

-    if len(md["chunks"]) == 0:
-        logging.error(f"No chunks retrieved for query: {args.query}")
-        if args.json:
-            result = {
-                "query": args.query,
-                "top_k": args.top_k,
-                "threshold": args.threshold,
-                "nodes": []
-            }
-            print(json.dumps(result, indent=2))
+    if len(md["chunks"]) == 0:
+        error_msg = f"No chunks retrieved for query: {args.query}"
+        logging.error(error_msg)
+        if args.json:
+            result = {
+                "query": args.query,
+                "top_k": args.top_k,
+                "threshold": args.threshold,
+                "nodes": [],
+                "error": error_msg,
+            }
+            print(json.dumps(result, indent=2))
         exit(1)

203-211: Optional: include metadata in llama-stack results when available

If llama-stack returns metadata (e.g., md["metadatas"]), consider including it to match the schema parity with the llama-index path and the PR’s examples.

If md contains a "metadatas" list aligned with document_ids, adjust the loop accordingly (illustrative snippet outside the changed range):

metas = md.get("metadatas")
for idx, (_id, chunk, score) in enumerate(zip(md["document_ids"], md["chunks"], md["scores"])):
    metadata = (metas[idx] if metas and idx < len(metas) else {}) or {}
    node_data = {
        "id": _id,
        "score": score.score if hasattr(score, "score") else score,
        "text": chunk,
        "metadata": metadata,
    }
    result["nodes"].append(node_data)

276-281: Enrich unknown-DB JSON error with context

Include db_path to aid callers and keep error payloads informative.

-            if args.json:
-                print(json.dumps({"error": error_msg}, indent=2))
+            if args.json:
+                print(json.dumps({"error": error_msg, "db_path": args.db_path}, indent=2))
             else:
                 print(error_msg)

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between 986d2f6 and 9ff8e5d.

📒 Files selected for processing (1)

scripts/query_rag.py (5 hunks)

🔇 Additional comments (3)

scripts/query_rag.py (3)

212-217: LGTM: preserves plain-text output for non-JSON mode

Text output remains backward-compatible.

256-260: LGTM: CLI --json flag addition

Clear help text and consistent behavior toggle.

264-265: LGTM: suppress banner when JSON is requested

Prevents non-JSON noise on stdout.

scripts/query_rag.py

coderabbitai

Actionable comments posted: 0

♻️ Duplicate comments (3)

scripts/query_rag.py (3)

36-51: Guard against missing node and make attribute access robust for single-node path

Dereferencing node.node_id/node.text will crash when get_node() returns None; also attribute names vary across versions (id_, get_content()). Return a structured error in JSON mode and use resilient getters.

Apply this diff:

-        node = storage_context.docstore.get_node(args.node)
-        result = {
-            "query": args.query,
-            "type": "single_node",
-            "node_id": args.node,
-            "node": {
-                "id": node.node_id,
-                "text": node.text,
-                "metadata": node.metadata if hasattr(node, 'metadata') else {}
-            }
-        }
-        if args.json:
-            print(json.dumps(result, indent=2))
-        else:
-            print(node)
+        node = storage_context.docstore.get_node(args.node)
+        if node is None:
+            error_msg = f"Node '{args.node}' not found."
+            result = {
+                "query": args.query,
+                "type": "single_node",
+                "node_id": args.node,
+                "node": None,
+                "error": error_msg,
+            }
+            if args.json:
+                print(json.dumps(result, indent=2))
+            else:
+                print(error_msg)
+            exit(1)
+        # Node exists; normalize attributes across versions
+        base = node
+        result = {
+            "query": args.query,
+            "type": "single_node",
+            "node_id": args.node,
+            "node": {
+                "id": getattr(base, "node_id", getattr(base, "id_", None)),
+                "text": getattr(base, "text", None) if hasattr(base, "text") else (base.get_content() if hasattr(base, "get_content") else None),
+                "metadata": getattr(base, "metadata", {}) or {},
+            },
+        }
+        if args.json:
+            print(json.dumps(result, indent=2))
+        else:
+            print(node)

84-100: NodeWithScore vs BaseNode: extract BaseNode before reading id/text/metadata

retriever.retrieve() returns NodeWithScore items; reading node_id, text, metadata directly will break. Normalize via nws.node and use safe fallbacks.

-        for node in nodes:
-            node_data = {
-                "id": node.node_id,
-                "score": node.score,
-                "text": node.text,
-                "metadata": node.metadata if hasattr(node, 'metadata') else {}
-            }
-            result["nodes"].append(node_data)
+        for nws in nodes:
+            base = getattr(nws, "node", nws)
+            node_data = {
+                "id": getattr(base, "node_id", getattr(base, "id_", None)),
+                "score": getattr(nws, "score", None),
+                "text": getattr(base, "text", None) if hasattr(base, "text") else (base.get_content() if hasattr(base, "get_content") else None),
+                "metadata": getattr(base, "metadata", {}) or {},
+            }
+            result["nodes"].append(node_data)

178-194: Threshold check may crash: md['scores'][0] can be a float; normalize top_score and include "error"

Accessing .score blindly can raise AttributeError. Normalize first, then compare and emit structured output. Also avoid logging when in JSON mode.

-    threshold = args.threshold
-    if threshold > 0.0 and md.get("scores") and md["scores"][0].score < threshold:
-        logging.error(
-            f"Score {md['scores'][0].score} of the top retrieved node for query '{args.query}' "
-            f"didn't cross the minimal threshold {threshold}."
-        )
-        if args.json:
-            result = {
-                "query": args.query,
-                "top_k": args.top_k,
-                "threshold": args.threshold,
-                "top_score": md["scores"][0].score,
-                "nodes": []
-            }
-            print(json.dumps(result, indent=2))
-        exit(1)
+    threshold = args.threshold
+    if threshold > 0.0 and md.get("scores"):
+        _top = md["scores"][0]
+        top_score = _top.score if hasattr(_top, "score") else _top
+        if top_score < threshold:
+            error_msg = (
+                f"Score {top_score} of the top retrieved node for query '{args.query}' "
+                f"didn't cross the minimal threshold {threshold}."
+            )
+            if args.json:
+                result = {
+                    "query": args.query,
+                    "top_k": args.top_k,
+                    "threshold": args.threshold,
+                    "top_score": top_score,
+                    "nodes": [],
+                    "error": error_msg,
+                }
+                print(json.dumps(result, indent=2))
+            else:
+                logging.error(error_msg)
+            exit(1)

🧹 Nitpick comments (7)

scripts/query_rag.py (7)

56-66: In JSON mode, avoid logging noise and include an error field for machine consumers

Currently logs the error unconditionally and returns JSON without an "error" message. For clean JSON output and actionable payloads, only log in non-JSON path and add an "error" string.

-            logging.error(f"No nodes retrieved for query: {args.query}")
-            if args.json:
+            if not args.json:
+                logging.error(f"No nodes retrieved for query: {args.query}")
+            if args.json:
                 result = {
                     "query": args.query,
                     "top_k": args.top_k,
                     "threshold": args.threshold,
-                    "nodes": []
+                    "nodes": [],
+                    "error": f"No nodes retrieved for query: {args.query}",
                 }
                 print(json.dumps(result, indent=2))
             exit(1)

68-83: Threshold failure: add non-JSON logging and include "error" in JSON payload

Non-JSON path exits silently; also the JSON result lacks an "error" message. Emit a human-readable log when not using --json and include "error" in JSON for consistency.

-        if args.threshold > 0.0 and nodes[0].score < args.threshold:
+        if args.threshold > 0.0 and nodes[0].score < args.threshold:
             error_msg = (
                 f"Score {nodes[0].score} of the top retrieved node for query '{args.query}' "
                 f"didn't cross the minimal threshold {args.threshold}."
             )
-            if args.json:
+            if args.json:
                 result = {
                     "query": args.query,
                     "top_k": args.top_k,
                     "threshold": args.threshold,
                     "top_score": nodes[0].score,
-                    "nodes": []
+                    "nodes": [],
+                    "error": error_msg,
                 }
                 print(json.dumps(result, indent=2))
+            else:
+                logging.error(error_msg)
             exit(1)

165-177: In JSON mode, avoid logging and add "error" key for no-chunks case

Mirror the faiss/llama-index handling: don't log when --json is set and include an error message in the JSON body.

-        logging.error(f"No chunks retrieved for query: {args.query}")
-        if args.json:
+        if not args.json:
+            logging.error(f"No chunks retrieved for query: {args.query}")
+        if args.json:
             result = {
                 "query": args.query,
                 "top_k": args.top_k,
                 "threshold": args.threshold,
-                "nodes": []
+                "nodes": [],
+                "error": f"No chunks retrieved for query: {args.query}",
             }
             print(json.dumps(result, indent=2))
         exit(1)

212-218: Normalize score for human-readable non-JSON output

For textual output, print the normalized float value instead of a potential wrapper object.

-    else:
-        for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
-            print("=" * 80)
-            print(f"Node ID: {_id}\nScore: {score}\nText:\n{chunk}")
+    else:
+        for _id, chunk, score in zip(md["document_ids"], md["chunks"], md["scores"]):
+            _score = score.score if hasattr(score, "score") else score
+            print("=" * 80)
+            print(f"Node ID: {_id}\nScore: {_score}\nText:\n{chunk}")

264-266: Configure logging based on --json to keep stdout strictly JSON

To avoid mixing logs with JSON output, disable logging in JSON mode and set a default config otherwise.

-    if not args.json:
-        print("Command line used: " + " ".join(sys.argv))
+    # Configure logging early: suppress logs in JSON mode to keep stdout machine-parsable
+    if args.json:
+        logging.disable(logging.CRITICAL)
+    else:
+        logging.basicConfig(level=logging.INFO, stream=sys.stderr)
+        print("Command line used: " + " ".join(sys.argv))

276-286: Unknown DB path: include error in JSON and avoid logging when --json

Return a machine-parseable error message and log only in non-JSON mode.

-            logging.error(f"Cannot recognize the DB in {args.db_path}")
-            if args.json:
+            if not args.json:
+                logging.error(f"Cannot recognize the DB in {args.db_path}")
+            if args.json:
                 # In JSON mode, output minimal error info and exit
                 result = {
                     "query": args.query,
                     "top_k": args.top_k,
                     "threshold": args.threshold,
-                    "nodes": []
+                    "nodes": [],
+                    "error": f"Cannot recognize the DB in {args.db_path}",
                 }
                 print(json.dumps(result, indent=2))
             exit(1)

267-267: Unused variable

vector_store_type = args.vector_store_type is assigned but not used.

You can drop this line.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

💡 Knowledge Base configuration:

MCP integration is disabled by default for public repositories
Jira integration is disabled by default for public repositories
Linear integration is disabled by default for public repositories

You can enable these sources in your CodeRabbit configuration.

📥 Commits

Reviewing files that changed from the base of the PR and between bb2d979 and 23dde0e.

📒 Files selected for processing (1)

scripts/query_rag.py (5 hunks)

🔇 Additional comments (5)

scripts/query_rag.py (5)

100-106: LGTM: JSON vs text output preserved

Good separation between JSON output and legacy textual prints. With the threshold/no-result logging addressed above, this path will be clean.

203-211: LGTM: Robust score extraction for llama-stack path

Nice defensive handling of score as either a float or an object with .score.

256-260: CLI UX: --json flag addition is clear and discoverable

Help text is concise and the flag name is intuitive.

195-202: Consistency note: JSON schema looks uniform across backends

The success payload shape (query/top_k/threshold/nodes) appears consistent between llama-index and llama-stack. Once the error payloads include an "error" field everywhere (as suggested), the contract will match the PR objective.

Would you like me to generate a JSON schema (and a quick jq validation script) to pin this contract in CI?

123-130: Metadata parity across backends

In the llama-stack path, result nodes have an empty "metadata". The PR objective mentions metadata fields like docs_url/title/url_reachable. If llama-stack provides these in res.metadata or via a follow-up call, consider adding them so both backends return comparable metadata.

Do you have a reliable source for chunk metadata in llama-stack responses we can surface here? If yes, I can propose a patch to plumb those through.

Also applies to: 147-163

lpiwowar

Thanks for the updates! 🙏 Just couple of small comments:)

scripts/query_rag.py

- Add --json flag to enable structured JSON output format - Maintain backward compatibility with existing plain text output - Implement consistent JSON structure for all response types: * Successful queries return nodes array with metadata * Error cases return structured error messages * Include query parameters (top_k, threshold) in response - Support JSON output for both Faiss and Llama-Stack vector stores - Add proper error handling with JSON format

tisnik

LGTM, thank you

lpiwowar

LGTM! 👍 Thank you 🙏

There are some prints() from llama_index that polute the stdout. Adding option to save only the JSON to a file might help. But this can be done in a follow up PR:):

python scripts/query_rag.py --db-path ./vector_db/os_product_docs --product-index os-docs-2024.2 -m ./embeddings_model/ -q "What is Nova in OpenStack?" --json --threshold 0.2 2> result.error
LLM is explicitly disabled. Using MockLLM.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./vector_db/os_product_docs/docstore.json.
Loading llama_index.core.storage.kvstore.simple_kvstore from ./vector_db/os_product_docs/index_store.json.
{
  "query": "What is Nova in OpenStack?",
  "top_k": 1,
  "threshold": 0.2,
  "nodes": [
    {
      "id": "00ab1db5-0c2b-438e-8001-7497428a1eec",
      "score": 0.8653656840324402,
      "text": "Technical Reference Deep Dives\n******************************\n\nThe nova project is large, and there are lots of complicated parts in\nit where it helps to have an overview to understand how the internals\nof a particular part work.",
      "metadata": {
        "docs_url": "https://docs.openstack.org/nova/2024.2_docs/reference/index.html",
        "title": "Technical Reference Deep Dives",
        "url_reachable": false
      }
    }
  ]
}

coderabbitai bot reviewed Aug 19, 2025

View reviewed changes

scripts/query_rag.py Show resolved Hide resolved

scripts/query_rag.py Show resolved Hide resolved

scripts/query_rag.py Outdated Show resolved Hide resolved

lpiwowar reviewed Aug 20, 2025

View reviewed changes

scripts/query_rag.py Outdated Show resolved Hide resolved

scripts/query_rag.py Show resolved Hide resolved

scripts/query_rag.py Show resolved Hide resolved

coderabbitai bot reviewed Aug 20, 2025

View reviewed changes

scripts/query_rag.py Show resolved Hide resolved

omkarjoshi0304 force-pushed the Dev branch 3 times, most recently from bb2d979 to 23dde0e Compare August 20, 2025 13:16

coderabbitai bot reviewed Aug 20, 2025

View reviewed changes

lpiwowar reviewed Aug 21, 2025

View reviewed changes

omkarjoshi0304 force-pushed the Dev branch from 93ad974 to daa6f66 Compare August 26, 2025 00:18

umago approved these changes Aug 26, 2025

View reviewed changes

scripts/query_rag.py Outdated Show resolved Hide resolved

scripts/query_rag.py Outdated Show resolved Hide resolved

omkarjoshi0304 force-pushed the Dev branch from daa6f66 to f57af14 Compare August 26, 2025 09:06

tisnik approved these changes Aug 26, 2025

View reviewed changes

lpiwowar approved these changes Aug 27, 2025

View reviewed changes

tisnik merged commit bf2b17a into lightspeed-core:main Aug 28, 2025
1 check passed

This was referenced Oct 28, 2025

Update Llama Stack to 0.2.22 #43

Open

adding the ability to work with llama_stack release-0.2.22 #44

Merged

feat: Add JSON output support to query_rag.py script #35

feat: Add JSON output support to query_rag.py script #35

Uh oh!

Conversation

omkarjoshi0304 commented Aug 19, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Type of change

Related Tickets & Documents

Checklist before requesting a review

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Aug 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

Chat

Support

CodeRabbit Commands (Invoked using PR/Issue comments)

Other keywords and placeholders

CodeRabbit Configuration File (.coderabbit.yaml)

Status, Documentation and Community

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lpiwowar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

lpiwowar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tisnik left a comment

Choose a reason for hiding this comment

Uh oh!

lpiwowar left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

omkarjoshi0304 commented Aug 19, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Aug 19, 2025 •

edited

Loading

CodeRabbit Configuration File (`.coderabbit.yaml`)