Guardrails: add rephrase support for input validators by nishika26 · Pull Request #783 · ProjectTech4DevAI/kaapi-backend

nishika26 · 2026-04-23T09:15:07Z

Summary

Target issue is #784

Notes

Enhanced guardrail system to directly provide responses when needed, improving query handling efficiency. if rephrase needed is true from guardrail service then we just pass the safe text to the user and do not pass that to the LLM.

Summary by CodeRabbit

Release Notes

Improvements
- Guardrails can now handle input rephrasing directly, bypassing unnecessary LLM provider calls when rephrasing is detected, improving response times and reducing costs for certain requests.
Tests
- Updated tests to verify guardrail rephrasing behavior works correctly without invoking the LLM provider.

coderabbitai · 2026-04-23T09:15:20Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: bb9686b3-ccde-416e-84bb-d9b9c06327f7

📥 Commits

Reviewing files that changed from the base of the PR and between ea9434f and 14554b7.

📒 Files selected for processing (2)

backend/app/services/llm/jobs.py
backend/app/tests/services/llm/test_jobs.py

🚧 Files skipped from review as they are similar to previous changes (1)

backend/app/services/llm/jobs.py

📝 Walkthrough

Walkthrough

The apply_input_guardrails function now returns a third value for direct guardrail responses. When guardrails indicate rephrasing is needed, the rephrased text is returned directly without invoking the LLM provider. The execute_llm_call function was updated to handle this short-circuit path and bypass provider execution when a direct response is available.

Changes

Cohort / File(s)	Summary
Guardrail direct response logic `backend/app/services/llm/jobs.py`	Modified `apply_input_guardrails` to return a third value (`guardrail_direct_response`) containing rephrased text when `rephrase_needed` is true. Updated `execute_llm_call` to bypass LLM provider execution and construct a synthetic response when a direct guardrail response is present.
Test updates `backend/app/tests/services/llm/test_jobs.py`	Updated `test_guardrails_rephrase_needed_allows_job_with_sanitized_input` to verify that the LLM provider is not called when guardrails produce a direct response, asserting the final output matches the rephrased text.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~20 minutes

Possibly related PRs

Guardrail: Integration #575: Directly related—both PRs modify guardrails handling in backend/app/services/llm/jobs.py by changing apply_input_guardrails return value and execute_llm_call control flow to short-circuit on guardrail direct response.

Suggested labels

enhancement

Suggested reviewers

AkhileshNegi

Poem

🐰 A guardrail hops with clever grace,
Rephrasing words at lightning pace,
No LLM call to make delay,
Direct responses light the way!
Efficiency wins the rabbit's praise! ✨

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 75.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'Guardrails: add rephrase support for input validators' directly and clearly summarizes the main change: adding rephrase functionality to the guardrail input validation system.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch bug/rephrase_support

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 4

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (2)

backend/app/services/collections/helpers.py (1)
5-16: ⚠️ Potential issue | 🔴 Critical

Add postponed annotations to fix runtime NameError for CloudStorage type annotation.

The CloudStorage type is imported only under TYPE_CHECKING but used as a runtime annotation on line 70. Without postponed annotations, Python evaluates this annotation at function definition time, causing a NameError when the module is imported.
Proposed fix
+from __future__ import annotations
+
 from typing import TYPE_CHECKING
 from uuid import UUID
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/app/services/collections/helpers.py` around lines 5 - 16, Add
postponed evaluation of annotations so the runtime NameError for CloudStorage
(which is only imported under TYPE_CHECKING) is avoided: insert "from __future__
import annotations" at the very top of the module (before other imports) so any
function or parameter annotated with CloudStorage (the symbol imported only
under TYPE_CHECKING and referenced later in the file) is treated as a string and
not evaluated at import time; this will fix the NameError without changing the
CloudStorage import or other type hints.
backend/app/services/llm/jobs.py (1)
427-458: ⚠️ Potential issue | 🟠 Major

Rephrased safe_text will include prompt-template scaffolding when returned directly.

At lines 427-430 the prompt template is interpolated into query.input.content.value before guardrails run at line 432. apply_input_guardrails then calls run_guardrails_validation(query.input.content.value, …) on that already-templated string, so when rephrase_needed=True the safe_text returned to the caller is a rephrase of template + user_input, not of the user's input. Returning that directly to the end user as the final "LLM" response (lines 447-458) will expose template boilerplate/instructions in the user-facing answer.

Consider one of:

Run input guardrails against the raw user input (before template interpolation), and only interpolate the (possibly rephrased) value into the template for the LLM path.

In the rephrase short-circuit, return the rephrased raw input rather than the templated-then-rephrased string.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@backend/app/services/llm/jobs.py` around lines 427 - 458, The guardrails are
being run on the already-interpolated string (template applied to
query.input.content.value) which causes rephrased "safe_text" to include
template scaffolding; preserve the raw user input (e.g., save original =
query.input.content.value before the template replace), call
apply_input_guardrails/run_guardrails_validation on that raw value, and if
guardrail_direct_response or rephrase_needed triggers, return the rephrased raw
input (not the templated string) as the short-circuit LLM response (update the
block that builds llm_response/guardrail_direct_response to use the saved
original/rephrased value); only perform template interpolation
(template.replace...) when proceeding to the actual LLM call path.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@backend/app/services/collections/create_collection.py`:
- Line 27: The job total calculation currently treats missing file_size_kb as 0
and under-reports total_size_mb; update the logic to call the imported helper
calculate_total_size_kb for documents that lack file_size_kb (or when
file_size_kb is falsy) and sum that returned value into total_size_kb before
converting to total_size_mb; apply this change wherever file_size_kb is
defaulted to 0 (references: calculate_total_size_kb, total_size_mb,
file_size_kb) so both the initial total computation and the later aggregation
use the helper instead of 0.

In `@backend/app/services/collections/helpers.py`:
- Line 106: Replace the current fallback expression "doc_size_kb =
doc.file_size_kb or 15 * 1024" with an explicit None check and a smaller safe
fallback to avoid treating 0.0 as missing and to prevent oversized batches; for
example set a constant SAFE_FALLBACK_KB = 5 * 1024 and compute "doc_size_kb =
doc.file_size_kb if doc.file_size_kb is not None else SAFE_FALLBACK_KB"
(referencing doc.file_size_kb and doc_size_kb in helpers.py).
- Around line 79-81: The log in calculate_total_size_kb prints the raw filename
(doc.fname); replace that with a masked value using the existing mask_string
utility so sensitive filenames are not logged. Update the logger.info call in
calculate_total_size_kb to keep the same prefix "[calculate_total_size_kb]" and
interpolate mask_string(doc.fname) instead of doc.fname (refer to logger.info,
calculate_total_size_kb, doc.fname, and mask_string to locate and change the
code).

In `@backend/app/services/llm/jobs.py`:
- Around line 441-458: The early-return branch that handles
guardrail_direct_response builds a synthetic LLMResponse and returns a
BlockResult but skips creating any call/audit record and skips telemetry,
execution observation, output validation, and metadata propagation; update the
guardrail_direct_response path to (1) call the existing create_llm_call (or
create a minimal audit record) and set llm_call_id so calls are auditable, (2)
invoke record_llm_call_started and record_llm_call_finished (and
observe_llm_execution) around the synthetic response to preserve telemetry
traces, (3) run apply_output_guardrails on the guardrail_direct_response output
before packaging it into LLMResponse/BlockResult, (4) include
metadata=request_metadata on the returned BlockResult, and (5) replace the
synthetic provider="guardrail" and model="guardrail" with meaningful identifiers
or a documented constant used elsewhere so downstream analytics see consistent
provider/model values (refer to symbols: guardrail_direct_response, Usage,
LLMCallResponse, LLMResponse, TextOutput, TextContent, BlockResult,
create_llm_call, record_llm_call_started, record_llm_call_finished,
observe_llm_execution, apply_output_guardrails, request_metadata).

---

Outside diff comments:
In `@backend/app/services/collections/helpers.py`:
- Around line 5-16: Add postponed evaluation of annotations so the runtime
NameError for CloudStorage (which is only imported under TYPE_CHECKING) is
avoided: insert "from __future__ import annotations" at the very top of the
module (before other imports) so any function or parameter annotated with
CloudStorage (the symbol imported only under TYPE_CHECKING and referenced later
in the file) is treated as a string and not evaluated at import time; this will
fix the NameError without changing the CloudStorage import or other type hints.

In `@backend/app/services/llm/jobs.py`:
- Around line 427-458: The guardrails are being run on the already-interpolated
string (template applied to query.input.content.value) which causes rephrased
"safe_text" to include template scaffolding; preserve the raw user input (e.g.,
save original = query.input.content.value before the template replace), call
apply_input_guardrails/run_guardrails_validation on that raw value, and if
guardrail_direct_response or rephrase_needed triggers, return the rephrased raw
input (not the templated string) as the short-circuit LLM response (update the
block that builds llm_response/guardrail_direct_response to use the saved
original/rephrased value); only perform template interpolation
(template.replace...) when proceeding to the actual LLM call path.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: defaults

Review profile: CHILL

Plan: Pro

Run ID: c0d8a039-a979-4c6f-8ad3-5f0954490395

📥 Commits

Reviewing files that changed from the base of the PR and between 7135ec2 and ea9434f.

📒 Files selected for processing (4)

backend/app/api/docs/documents/upload.md
backend/app/services/collections/create_collection.py
backend/app/services/collections/helpers.py
backend/app/services/llm/jobs.py

codecov · 2026-04-23T10:43:08Z

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

nishika26 self-assigned this Apr 23, 2026

nishika26 requested review from AkhileshNegi and vprashrex April 23, 2026 09:15

coderabbitai Bot reviewed Apr 23, 2026

View reviewed changes

Comment thread backend/app/services/collections/create_collection.py Outdated

Comment thread backend/app/services/collections/helpers.py Outdated

Comment thread backend/app/services/collections/helpers.py Outdated

Comment thread backend/app/services/llm/jobs.py Outdated

guardrails: adding rephrase support

c2b39fc

nishika26 force-pushed the bug/rephrase_support branch from ea9434f to c2b39fc Compare April 23, 2026 09:25

nishika26 linked an issue Apr 23, 2026 that may be closed by this pull request

Input Guardrails: Directly return safe_text #784

Closed

nishika26 added bug Something isn't working ready-for-merge ready-for-review and removed ready-for-merge labels Apr 23, 2026

vprashrex approved these changes Apr 23, 2026

View reviewed changes

fix coderabbit reviews and test cases

14554b7

nishika26 merged commit 0e188e6 into main Apr 23, 2026
3 checks passed

nishika26 deleted the bug/rephrase_support branch April 23, 2026 12:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Guardrails: add rephrase support for input validators#783

Guardrails: add rephrase support for input validators#783
nishika26 merged 2 commits intomainfrom
bug/rephrase_support

nishika26 commented Apr 23, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 23, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov Bot commented Apr 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nishika26 commented Apr 23, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Notes

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai Bot commented Apr 23, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

Possibly related PRs

Suggested labels

Suggested reviewers

Poem

❌ Failed checks (1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov Bot commented Apr 23, 2026

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nishika26 commented Apr 23, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 23, 2026 •

edited

Loading