RSPEED-2974: add privacy-safe RLSAPI observability logs by major · Pull Request #1646 · lightspeed-core/lightspeed-stack

major · 2026-04-30T15:04:11Z

Description

Adds INFO-level breadcrumbs around the RLSAPI v1 infer path so production can trace request progress without exposing user-provided data. The logs cover safe operational metadata only, including request IDs, quota state, shield outcome, model/provider selection, MCP tool counts, token counts, durations, and Splunk sourcetype.

Type of change

Tools used to create PR

Identify any AI code assistants used in this PR (for transparency and review context)

Assisted-by: N/A
Generated by: N/A

Related Tickets & Documents

Related Issue # RSPEED-2974
Closes # RSPEED-2974

Checklist before requesting a review

I have performed a self-review of my code.
PR has passed all pre-merge test jobs.
If it is a core feature, I have added thorough tests.

Testing

uv run black "src/app/endpoints/rlsapi_v1.py" "tests/unit/app/endpoints/test_rlsapi_v1.py"
uv run pytest tests/unit/app/endpoints/test_rlsapi_v1.py
uv run make verify

Note: uv run make format still fails on pre-existing unparsable demo docs files docs/demos/lcore/weak_points_for_ai/ex1.py and docs/demos/lcore/weak_points_for_ai/ex5.py; unrelated formatting changes from that attempt were reverted.

Summary by CodeRabbit

Release Notes

Bug Fixes
- Fixed issue where sensitive user information could appear in system error logs
- Improved error reporting with cleaner, more concise error messages
Improvements
- Enhanced logging throughout request processing for better operational visibility and diagnostics
- Better visibility into model selection, validation, and request completion stages

coderabbitai · 2026-04-30T15:04:25Z

Warning

Rate limit exceeded

@major has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 56 minutes and 20 seconds before requesting another review.

To keep reviews running without waiting, you can enable usage-based add-on for your organization. This allows additional reviews beyond the hourly cap. Account admins can enable it under billing.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: 1f18c678-b5a2-4f26-8226-c6c2e10fe888

📥 Commits

Reviewing files that changed from the base of the PR and between 8e7ab18 and db59c76.

📒 Files selected for processing (2)

src/app/endpoints/rlsapi_v1.py
tests/unit/app/endpoints/test_rlsapi_v1.py

Walkthrough

The changes add extensive info-level logging throughout the rlsapi v1 endpoint to track request handling stages, model selection, moderation decisions, inference completion, and quota enforcement. Error-to-HTTP mapping logging was adjusted to log only exception type names instead of full exception objects. Two new tests verify logging behavior and ensure sensitive data is not leaked in error logs.

Changes

Cohort / File(s)	Summary
Request Handling & Logging Enhancements `src/app/endpoints/rlsapi_v1.py`	Added extensive info-level logging for model selection/validation, Splunk event queuing, shield moderation decisions, inference failure recording, quota enforcement, LLM call stages with token counts, and final completion metrics. Adjusted error-to-HTTP mapping to log only exception type names. Expanded pylint disable annotation for `infer_endpoint`.
Test Suite for Logging & Error Handling `tests/unit/app/endpoints/test_rlsapi_v1.py`	Added `APIStatusError` import and fixture. Implemented two new async tests verifying `/infer` info logs capture processing/completion messages while filtering user-provided secrets from question, stdin, attachments, and terminal output. Verified exception logging includes class name without leaking backend prompt text.

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~22 minutes

🚥 Pre-merge checks | ✅ 5

✅ Passed checks (5 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The PR title accurately summarizes the main change: adding privacy-safe logging to RLSAPI observability.
Docstring Coverage	✅ Passed	Docstring coverage is 100.00% which is sufficient. The required threshold is 80.00%.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests

✨ Simplify code

Create PR with simplified code

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Review rate limit: 0/1 reviews remaining, refill in 56 minutes and 20 seconds.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/app/endpoints/rlsapi_v1.py (1)
475-488: ⚠️ Potential issue | 🟠 Major | ⚡ Quick win

Redact the exception payload before queueing the failure event.

str(error) can still contain backend-echoed prompt text. The new mock_api_status_error_with_private_text test fixture proves that an APIStatusError may embed PRIVATE prompt sk-backend-secret, and this path forwards that raw string into _queue_splunk_event(...). That leaves the logger sanitized, but not the observability event.
🔒 Suggested fix
     _queue_splunk_event(
         background_tasks,
         infer_request,
         request,
         request_id,
-        str(error),
+        type(error).__name__,
         inference_time,
         "infer_error",
     )
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/app/endpoints/rlsapi_v1.py` around lines 475 - 488, The exception string
passed into _queue_splunk_event may contain private prompt text (e.g.,
APIStatusError with "PRIVATE prompt sk-backend-secret"), so before calling
_queue_splunk_event replace or sanitize sensitive substrings in str(error) (for
example via a helper like redact_sensitive_info or a short regex that removes
"PRIVATE" blocks and sk- tokens) and pass the redacted string instead; update
the call site using _queue_splunk_event(..., redacted_error, ...) referencing
the variables infer_request and request_id to preserve context.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Outside diff comments:
In `@src/app/endpoints/rlsapi_v1.py`:
- Around line 475-488: The exception string passed into _queue_splunk_event may
contain private prompt text (e.g., APIStatusError with "PRIVATE prompt
sk-backend-secret"), so before calling _queue_splunk_event replace or sanitize
sensitive substrings in str(error) (for example via a helper like
redact_sensitive_info or a short regex that removes "PRIVATE" blocks and sk-
tokens) and pass the redacted string instead; update the call site using
_queue_splunk_event(..., redacted_error, ...) referencing the variables
infer_request and request_id to preserve context.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: ae9af949-e0da-4b1f-8112-4d8399b0ab21

📥 Commits

Reviewing files that changed from the base of the PR and between aa41ccd and 8e7ab18.

📒 Files selected for processing (2)

src/app/endpoints/rlsapi_v1.py
tests/unit/app/endpoints/test_rlsapi_v1.py

📜 Review details

⏰ Context from checks skipped due to timeout of 90000ms. You can increase the timeout in your CodeRabbit configuration to a maximum of 15 minutes (900000ms). (14)

GitHub Check: E2E: library mode / ci / group 2
GitHub Check: E2E: server mode / ci / group 1
GitHub Check: Pylinter
GitHub Check: E2E: server mode / ci / group 3
GitHub Check: E2E: server mode / ci / group 2
GitHub Check: E2E: library mode / ci / group 3
GitHub Check: E2E Tests for Lightspeed Evaluation job
GitHub Check: E2E: library mode / ci / group 1
GitHub Check: build-pr
GitHub Check: integration_tests (3.12)
GitHub Check: unit_tests (3.12)
GitHub Check: list_outdated_dependencies
GitHub Check: unit_tests (3.13)
GitHub Check: spectral

🧰 Additional context used

📓 Path-based instructions (5)

tests/unit/**/*.py