feat(observability): add chat completion genai span attributes by bzp2010 · Pull Request #65 · api7/aisix

bzp2010 · 2026-04-28T16:32:42Z

Preview

Summary by CodeRabbit

New Features
- Enhanced tracing/telemetry for chat completions (request/response/chunk), structured AI message fields, and streamed-output aggregation.
- Provider semantic metadata added for Azure, Bedrock, and Gemini.
- Models and providers expose lookups to resolve configured providers.
Bug Fixes
- More robust streaming behavior with clearer error classification and explicit failures when a provider is missing.
- Provider resolution tightened for embeddings and messaging flows.
Chores
- Primary-key entity lookups enabled in non-test builds.
- Tracing helpers and export surface narrowed.
Tests
- New unit tests covering span-property generation and stream aggregation.

coderabbitai · 2026-04-28T16:32:57Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 4540afd2-52a8-4b3c-8a21-4ad616e9cd05

📥 Commits

Reviewing files that changed from the base of the PR and between 08c8192 and 75d1070.

📒 Files selected for processing (1)

src/proxy/handlers/chat_completions/mod.rs

🚧 Files skipped from review as they are similar to previous changes (1)

src/proxy/handlers/chat_completions/mod.rs

📝 Walkthrough

Walkthrough

Expose primary-key reads for entity stores; add Model→Provider resolution and ProvidersStore accessor; introduce Provider semantic metadata and provider-specific overrides; add extensive chat-completion span telemetry, streaming output collector, and WithSpan propagation; change provider instance API to accept provider entries; add tracing in middlewares.

Changes

Cohort / File(s)	Summary
Config Entities `src/config/entities/mod.rs`, `src/config/entities/models.rs`, `src/config/entities/providers.rs`	Un-gate `ResourceStore::get` / `EntityStore::get` for non-test builds; add `Model::provider(&self, resources: &ResourceRegistry)` and `ProvidersStore::get_by_id`.
Provider Trait & Providers `src/gateway/traits/provider.rs`, `src/gateway/traits/mod.rs`, `src/gateway/providers/azure.rs`, `src/gateway/providers/bedrock.rs`, `src/gateway/providers/gemini.rs`	Add `ProviderSemanticConventions` and default `ProviderMeta::semantic_conventions()`; re-export the type and implement provider-specific overrides with unit-test assertions.
Chat Completions — Handler `src/proxy/handlers/chat_completions/mod.rs`	Create local `aisix.llm.chat_completion` span, resolve provider via `model.provider(...)`, wrap gateway calls with `WithSpan`, collect streaming output, and enrich spans with request/response/chunk/usage properties; adjust streaming error/finish annotations and usage observer timing.
Chat Completions — Span Attributes `src/proxy/handlers/chat_completions/span_attributes/...` `src/proxy/handlers/chat_completions/span_attributes/mod.rs`, `message_attributes.rs`, `stream_output.rs`, `telemetry.rs`, `tests.rs`	New module for message view serialization; `StreamOutputCollector` aggregates streamed chunks; telemetry helpers build request/response/chunk/usage span properties and OpenInference metadata; comprehensive unit tests added.
Handlers — Provider Resolution `src/proxy/handlers/embeddings/mod.rs`, `src/proxy/handlers/messages/mod.rs`	Resolve provider via `model.provider(resources)` and fail with `GatewayError::Internal("provider <id> not found")` when absent; minor SSE message change.
Provider Instance API `src/proxy/provider.rs`	`create_provider_instance` now accepts `&ResourceEntry<Provider>` instead of resolving provider internally; removed `#[fastrace::trace]`.
Tracing & Middleware `src/proxy/hooks/authorization/mod.rs`, `src/proxy/middlewares/auth.rs`, `src/proxy/middlewares/trace.rs`	Auth hook uses explicit span name; auth middleware creates/annotates span with API key id; trace middleware retains root span in response body and ends it on EOS/errors (handles terminal errors).
Future Utility & Observability `src/utils/future.rs`, `src/utils/observability/mod.rs`	Add `WithSpan<F>` future wrapper returning (output, Span); narrow observability re-exports to `BoxedSpanExporter` and `DynSpanExporter`.
Minor/Annotations `src/gateway/traits/chat_format.rs`	Suppress unused warning on `ChatFormat::name()` with `#[allow(unused)]`.

Sequence Diagram(s)

sequenceDiagram
  participant Client
  participant ProxyHandler as Proxy
  participant Resources as ResourceRegistry
  participant ModelEntry as Model
  participant ProviderEntry as Provider
  participant Gateway
  participant Span as FastraceSpan
  Client->>ProxyHandler: HTTP chat-completion request
  ProxyHandler->>Resources: lookup model entry
  Resources-->>ProxyHandler: ResourceEntry<Model>
  ProxyHandler->>ModelEntry: model.provider(resources)
  ModelEntry-->>ProxyHandler: Option<ResourceEntry<Provider>>
  alt provider found
    ProxyHandler->>Span: create span (aisix.llm.chat_completion)
    ProxyHandler->>WithSpan: wrap gateway call with span
    ProxyHandler->>Gateway: invoke provider via provider entry (streaming)
    Gateway-->>ProxyHandler: stream chunks
    ProxyHandler->>ProxyHandler: record_chunk → StreamOutputCollector
    ProxyHandler-->>Span: apply request/response/chunk/usage properties
    ProxyHandler-->>Client: streamed SSE/response
    ProxyHandler->>Span: end span on EOS
  else provider missing
    ProxyHandler-->>Client: GatewayError::Internal (provider not found)
  end

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~50 minutes

Possibly related PRs

feat(proxy): move chat completion to new provider integration #37: overlaps provider construction and create_provider_instance signature changes.
refactor: export crate functions as lib #43: directly related to gating/un-gating of EntityStore::get / ResourceStore::get.
feat(provider): add new provider gateway traits #16: related to provider trait surface and the introduction of ProviderMeta/semantic conventions.

Important

Pre-merge checks failed

Please resolve all errors before merging. Addressing warnings is optional.

❌ Failed checks (1 error, 1 warning)

Check name	Status	Explanation	Resolution
Security Check	❌ Error	PR captures full user message content and LLM responses in distributed tracing span attributes without redaction, exposing sensitive data like PII and credentials in observability systems.	Implement redaction/truncation of sensitive message content before capturing in spans, add configuration flags to disable capture, or provide field-level redaction masks for sensitive patterns.
E2e Test Quality Review	⚠️ Warning	PR lacks E2E tests validating complete span lifecycle from API request through export and has unsafe error handling with unwrap operations.	Add E2E tests verifying span attributes in chat_completions handler; replace unwrap_or_default() with explicit error logging; handle None case in WithSpan::poll() safely.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title 'feat(observability): add chat completion genai span attributes' accurately and concisely summarizes the primary focus of the changeset—adding observable span attributes for generative AI chat completions, which is evident across the handler modifications and new telemetry modules.
Linked Issues check	✅ Passed	Check skipped because no linked issues were found for this pull request.
Out of Scope Changes check	✅ Passed	Check skipped because no linked issues were found for this pull request.

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch bzp/feat-o11y-tracelog

_{Review rate limit: 3/5 reviews remaining, refill in 13 minutes and 30 seconds.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

coderabbitai

Actionable comments posted: 6

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

src/proxy/handlers/chat_completions/span_attributes/tests.rs (1)
1-252: 🛠️ Refactor suggestion | 🟠 Major

Switch this new test module to pretty_assertions::assert_eq.

Please import pretty_assertions::assert_eq so all equality checks in this new file follow the repo’s test-output standard.
Suggested patch
 use serde_json::{Value, json};
+use pretty_assertions::assert_eq;
As per coding guidelines: {tests,src}/**/*.rs: “Use pretty_assertions::assert_eq and assert_matches::assert_matches for better test output”.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/span_attributes/tests.rs` around lines 1
- 252, The tests use standard assert_eq which must be replaced by the repo
standard pretty_assertions; import the macro by adding use
pretty_assertions::assert_eq; at the top of this test module (near the existing
use serde_json::{Value, json};) so all equality checks in
request_span_properties_include_message_tool_and_user_attributes,
response_span_properties_include_output_messages_and_reasoning_tokens, and
stream_output_collector_accumulates_chunks_into_output_messages use
pretty_assertions::assert_eq for improved test diffs.

🧹 Nitpick comments (5)

src/gateway/traits/chat_format.rs (1)
29-30: Use #[allow(dead_code)] instead of #[allow(unused)] on the trait method.

#[allow(unused)] is overly broad and can hide unrelated warnings. The dead_code lint is the appropriate suppression for an unused associated function declaration.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/gateway/traits/chat_format.rs` around lines 29 - 30, The trait method
declaration for fn name() -> &'static str currently uses #[allow(unused)];
change this to #[allow(dead_code)] to narrowly suppress the
unused-associated-function lint. Locate the associated function declaration
named name in the trait in chat_format.rs and replace the attribute so the
suppression targets dead_code rather than the broader unused lint.
src/proxy/middlewares/trace.rs (1)
53-66: Consider whether error handling triggers premature span closure.

When Poll::Ready(Some(Err(_))) occurs, on_eos() is called which takes the span. However, this may not be the actual end of stream from the framework's perspective—the error is returned to the caller. If there's any subsequent polling (unlikely but possible depending on framework behavior), the span will already be consumed.

This is likely fine in practice since an error typically terminates the stream, but worth noting for robustness.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/middlewares/trace.rs` around lines 53 - 66, The code currently
calls self.on_eos() for Poll::Ready(Some(Err(_))) which may prematurely consume
the tracing span; change the error branch to avoid closing the span on a single
Err result—either call a new non-consuming handler like self.on_error(err)
(create on_error in the same impl) or simply log the error without calling
on_eos, and only call self.on_eos() on Poll::Ready(None); update the match arm
for Poll::Ready(Some(Err(e))) to forward the error to the caller or to
on_error(e) instead of on_eos(), keeping on_body_chunk and on_eos behavior
unchanged.
src/proxy/handlers/chat_completions/mod.rs (1)
159-179: Consider extracting stream state into a named struct for readability.

The 8-tuple state is functionally correct but harder to maintain. A named struct would improve clarity.
♻️ Optional: Extract to named struct
struct StreamState {
    stream: ChatResponseStream<OpenAIChatFormat>,
    span: Span,
    chunk_index: usize,
    request_ctx: RequestContext,
    done: bool,
    saw_chunk: bool,
    usage_rx: Option<oneshot::Receiver<Usage>>,
    output_collector: StreamOutputCollector,
}
This would make the unfold closure parameters and return values more self-documenting.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/mod.rs` around lines 159 - 179, The
closure passed to futures::stream::unfold that builds sse_stream currently uses
an 8-tuple state which is hard to read; create a named struct (e.g.,
StreamState) containing the fields stream
(ChatResponseStream<OpenAIChatFormat>), span (Span), chunk_index (usize),
request_ctx (RequestContext), done (bool), saw_chunk (bool), usage_rx
(Option<oneshot::Receiver<Usage>>), and output_collector
(StreamOutputCollector), replace the tuple in the unfold call and in the closure
signature with this struct, and adjust all usages inside the closure and the
initial state construction to use the struct fields for clarity.
src/proxy/handlers/chat_completions/span_attributes/telemetry.rs (1)
1-15: Import ordering could be improved per coding guidelines.

The imports should be sorted alphabetically within external crates. reqwest should come after fastrace alphabetically, but serde_json should be between them.
♻️ Optional: Sort imports alphabetically
 use fastrace::prelude::Span;
 use reqwest::Url;
 use serde_json::{Map, Value};
The current order happens to be alphabetical (fastrace < reqwest < serde_json), so this is actually correct. No change needed.
Actually, reviewing again: fastrace < reqwest < serde_json is alphabetically correct.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/span_attributes/telemetry.rs` around
lines 1 - 15, The import ordering for the external crates is already correct
(fastrace, reqwest, serde_json); leave the use statements as-is and ensure they
remain alphabetically sorted (fastrace, reqwest, serde_json) in telemetry.rs so
no change is required to the current imports.
src/proxy/handlers/chat_completions/span_attributes/message_attributes.rs (1)
225-235: Minor: Unnecessary intermediate vector allocation.

The intermediate Vec (line 230-233) clones each message unnecessarily. Consider passing an iterator or borrowing directly.
♻️ Optional: Avoid intermediate allocation
 pub(super) fn append_openinference_output_message_properties(
     properties: &mut Vec<(String, String)>,
     prefix: &str,
     messages: &[OutputMessageView],
 ) {
-    let message_views: Vec<_> = messages
-        .iter()
-        .map(|message| message.message.clone())
-        .collect();
-    append_openinference_message_properties(properties, prefix, &message_views);
+    for (message_index, message) in messages.iter().enumerate() {
+        append_single_message_properties(properties, prefix, message_index, &message.message);
+    }
 }
Alternatively, refactor append_openinference_message_properties to accept impl Iterator<Item = &MessageView>.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/span_attributes/message_attributes.rs`
around lines 225 - 235, The helper
append_openinference_output_message_properties currently allocates and clones
each OutputMessageView.message into a temporary Vec, which is unnecessary;
instead avoid cloning by passing borrowed MessageView references to
append_openinference_message_properties — either change
append_openinference_message_properties to accept impl
Iterator<Item=&MessageView> (preferred) and forward messages.iter().map(|m|
&m.message) directly, or build a Vec<&MessageView> (messages.iter().map(|m|
&m.message).collect::<Vec<_>>()) and pass that slice to the existing function;
update references to OutputMessageView and MessageView accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/config/entities/models.rs`:
- Around line 38-42: The public helper Model::provider should have a doc comment
and tracing; add a one-line doc comment (/// Returns the provider resource for
this model, if present.) above the method and annotate the function with
#[fastrace::trace], keeping the signature fn provider(&self, resources:
&ResourceRegistry) -> Option<ResourceEntry<Provider>> and the existing body that
calls resources.providers.get_by_id(&self.provider_id).

In `@src/gateway/traits/provider.rs`:
- Around line 324-327: The new test added uses the standard assert_eq; replace
it with pretty_assertions::assert_eq by adding an import for
pretty_assertions::assert_eq at the top of the test module and update the
assertion calls in the new test path (the block that inspects ProviderMeta /
ProviderSemanticConventions / StreamReaderKind and uses
ChatTransform/EmbedTransform/CompatQuirks) to call pretty_assertions::assert_eq
instead of the std macro so test failures show improved diffs.
- Around line 48-53: Add doc comments (///) to the public struct
ProviderSemanticConventions and to the public method
ProviderMeta::semantic_conventions() describing their purpose and usage; include
short field-level descriptions for gen_ai_provider_name, llm_system, and
llm_provider on ProviderSemanticConventions, and document what
ProviderMeta::semantic_conventions() returns and when it may be None or Some, so
the public API is properly documented per Rust doc guidelines.

In `@src/proxy/handlers/chat_completions/mod.rs`:
- Around line 99-102: The code incorrectly sets an error attribute on a
successful stream branch: when matching Ok(Ok(ChatResponse::Stream { stream,
usage_rx })), remove the LocalSpan::add_property(|| ("error.type", "no_stream"))
call so successful handling via handle_stream_request(stream, usage_rx, &mut
request_ctx, span).await does not record an error; keep any span handling but
delete that erroneous error.type assignment in the ChatResponse::Stream branch.

In `@src/proxy/provider.rs`:
- Around line 14-17: Add the missing #[fastrace::trace] attribute to the public
function create_provider_instance and update its doc comment to reference
"provider" rather than "model"; locate the function declaration for
create_provider_instance and prepend the #[fastrace::trace] decorator, and edit
the preceding /// doc comment text to replace any occurrences of "model" with
"provider" so docs and tracing follow project guidelines.

In `@src/utils/future.rs`:
- Around line 28-39: The poll implementation can panic because it calls unwrap()
on this.span which is an Option; update the WithSpan design to guarantee a Span
is present: either (preferred) change the field type from Option<Span> to Span
(update the WithSpan struct, all constructors/usages, and remove Option handling
in impl Future for WithSpan so poll uses this.span.set_local_parent() and
returns the owned Span directly), or if Option must remain, add a constructor
new(inner, span: Span) that stores Some(span) and replace unwrap() in poll with
a safe handling (e.g., expect with a clear message or return Poll::Ready((val,
this.span.take().expect("WithSpan must be constructed with Some(span)")))) and
document the invariant so external code cannot create WithSpan with span ==
None; adjust any call sites referencing WithSpan::span accordingly.

---

Outside diff comments:
In `@src/proxy/handlers/chat_completions/span_attributes/tests.rs`:
- Around line 1-252: The tests use standard assert_eq which must be replaced by
the repo standard pretty_assertions; import the macro by adding use
pretty_assertions::assert_eq; at the top of this test module (near the existing
use serde_json::{Value, json};) so all equality checks in
request_span_properties_include_message_tool_and_user_attributes,
response_span_properties_include_output_messages_and_reasoning_tokens, and
stream_output_collector_accumulates_chunks_into_output_messages use
pretty_assertions::assert_eq for improved test diffs.

---

Nitpick comments:
In `@src/gateway/traits/chat_format.rs`:
- Around line 29-30: The trait method declaration for fn name() -> &'static str
currently uses #[allow(unused)]; change this to #[allow(dead_code)] to narrowly
suppress the unused-associated-function lint. Locate the associated function
declaration named name in the trait in chat_format.rs and replace the attribute
so the suppression targets dead_code rather than the broader unused lint.

In `@src/proxy/handlers/chat_completions/mod.rs`:
- Around line 159-179: The closure passed to futures::stream::unfold that builds
sse_stream currently uses an 8-tuple state which is hard to read; create a named
struct (e.g., StreamState) containing the fields stream
(ChatResponseStream<OpenAIChatFormat>), span (Span), chunk_index (usize),
request_ctx (RequestContext), done (bool), saw_chunk (bool), usage_rx
(Option<oneshot::Receiver<Usage>>), and output_collector
(StreamOutputCollector), replace the tuple in the unfold call and in the closure
signature with this struct, and adjust all usages inside the closure and the
initial state construction to use the struct fields for clarity.

In `@src/proxy/handlers/chat_completions/span_attributes/message_attributes.rs`:
- Around line 225-235: The helper append_openinference_output_message_properties
currently allocates and clones each OutputMessageView.message into a temporary
Vec, which is unnecessary; instead avoid cloning by passing borrowed MessageView
references to append_openinference_message_properties — either change
append_openinference_message_properties to accept impl
Iterator<Item=&MessageView> (preferred) and forward messages.iter().map(|m|
&m.message) directly, or build a Vec<&MessageView> (messages.iter().map(|m|
&m.message).collect::<Vec<_>>()) and pass that slice to the existing function;
update references to OutputMessageView and MessageView accordingly.

In `@src/proxy/handlers/chat_completions/span_attributes/telemetry.rs`:
- Around line 1-15: The import ordering for the external crates is already
correct (fastrace, reqwest, serde_json); leave the use statements as-is and
ensure they remain alphabetically sorted (fastrace, reqwest, serde_json) in
telemetry.rs so no change is required to the current imports.

In `@src/proxy/middlewares/trace.rs`:
- Around line 53-66: The code currently calls self.on_eos() for
Poll::Ready(Some(Err(_))) which may prematurely consume the tracing span; change
the error branch to avoid closing the span on a single Err result—either call a
new non-consuming handler like self.on_error(err) (create on_error in the same
impl) or simply log the error without calling on_eos, and only call
self.on_eos() on Poll::Ready(None); update the match arm for
Poll::Ready(Some(Err(e))) to forward the error to the caller or to on_error(e)
instead of on_eos(), keeping on_body_chunk and on_eos behavior unchanged.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 168b96de-aab2-4d8c-b21b-5f45bf9ebea8

📥 Commits

Reviewing files that changed from the base of the PR and between 35eff35 and e9d94ee.

📒 Files selected for processing (23)

src/config/entities/mod.rs
src/config/entities/models.rs
src/config/entities/providers.rs
src/gateway/providers/azure.rs
src/gateway/providers/bedrock.rs
src/gateway/providers/gemini.rs
src/gateway/traits/chat_format.rs
src/gateway/traits/mod.rs
src/gateway/traits/provider.rs
src/proxy/handlers/chat_completions/mod.rs
src/proxy/handlers/chat_completions/span_attributes/message_attributes.rs
src/proxy/handlers/chat_completions/span_attributes/mod.rs
src/proxy/handlers/chat_completions/span_attributes/stream_output.rs
src/proxy/handlers/chat_completions/span_attributes/telemetry.rs
src/proxy/handlers/chat_completions/span_attributes/tests.rs
src/proxy/handlers/embeddings/mod.rs
src/proxy/handlers/messages/mod.rs
src/proxy/hooks/authorization/mod.rs
src/proxy/middlewares/auth.rs
src/proxy/middlewares/trace.rs
src/proxy/provider.rs
src/utils/future.rs
src/utils/observability/mod.rs

coderabbitai

🧹 Nitpick comments (1)

src/proxy/handlers/chat_completions/mod.rs (1)

158-178: Consider extracting the 8-tuple state into a named struct.

The unfold state has grown to an 8-element tuple, making it harder to reason about which element is which. A named struct would improve readability and reduce the risk of accidental reordering.

♻️ Example struct definition

struct StreamUnfoldState {
    stream: ChatResponseStream<OpenAIChatFormat>,
    span: Span,
    chunk_idx: usize,
    request_ctx: RequestContext,
    done: bool,
    saw_chunk: bool,
    usage_rx: Option<oneshot::Receiver<Usage>>,
    output_collector: StreamOutputCollector,
}

This would make the state transitions clearer and reduce the cognitive load when reading the destructuring patterns.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/mod.rs` around lines 158 - 178, The
8-tuple state passed to futures::stream::unfold is hard to read; extract it into
a named struct (e.g., StreamUnfoldState) that contains fields for stream
(ChatResponseStream<OpenAIChatFormat>), span (Span), chunk_idx/idx (usize),
request_ctx/stream_request_ctx (RequestContext), done (bool), saw_chunk (bool),
usage_rx (Option<oneshot::Receiver<Usage>>), and output_collector
(StreamOutputCollector), then update the unfold call to initialize that struct
and adjust the async closure to destructure/consume StreamUnfoldState instead of
the tuple so all state access is clear and ordering mistakes are avoided.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@src/proxy/handlers/chat_completions/mod.rs`:
- Around line 158-178: The 8-tuple state passed to futures::stream::unfold is
hard to read; extract it into a named struct (e.g., StreamUnfoldState) that
contains fields for stream (ChatResponseStream<OpenAIChatFormat>), span (Span),
chunk_idx/idx (usize), request_ctx/stream_request_ctx (RequestContext), done
(bool), saw_chunk (bool), usage_rx (Option<oneshot::Receiver<Usage>>), and
output_collector (StreamOutputCollector), then update the unfold call to
initialize that struct and adjust the async closure to destructure/consume
StreamUnfoldState instead of the tuple so all state access is clear and ordering
mistakes are avoided.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: ae11bbc8-b350-4d63-a3a2-b4c929475e3a

📥 Commits

Reviewing files that changed from the base of the PR and between e9d94ee and 669d1c9.

📒 Files selected for processing (4)

src/config/entities/models.rs
src/gateway/traits/provider.rs
src/proxy/handlers/chat_completions/mod.rs
src/proxy/provider.rs

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

src/proxy/handlers/chat_completions/mod.rs (1)
43-47: Public handler is missing required trace decorator and doc comment.

Please add #[fastrace::trace] and a /// doc comment on chat_completions.
📝 Suggested minimal change
+/// Handle OpenAI-compatible chat completion requests and emit request-scoped telemetry.
+#[fastrace::trace]
 pub async fn chat_completions(
As per coding guidelines: src/**/*.rs: Use #[fastrace::trace] decorator for distributed tracing on public functions; and **/*.rs: Use /// for doc comments on public items in Rust.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/proxy/handlers/chat_completions/mod.rs` around lines 43 - 47, Add a doc
comment and the distributed-tracing attribute to the public handler function
chat_completions: prepend a triple-slash doc comment (///) describing the
handler's purpose and then add the #[fastrace::trace] attribute above the fn
signature for chat_completions (the function taking State(state):
State<AppState>, mut request_ctx: RequestContext, Json(mut request_data):
Json<ChatCompletionRequest>) and returning Result<Response, ChatCompletionError>
so it conforms to the project's tracing and documentation guidelines.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@src/proxy/handlers/chat_completions/mod.rs`:
- Around line 231-239: The code currently calls span.add_properties(||
output_collector.output_message_span_properties()) which exports full streamed
messages/tool-call arguments to trace attributes; change this so
span.add_properties is only called when a config flag (e.g.,
config.export_streamed_output or runtime_settings.export_streamed_output) is
true, and before attaching ensure you sanitize the data by calling a
redaction/truncation helper (e.g.,
output_collector.redact_and_truncate_message_properties() or a new method that
returns sanitized span properties) instead of output_message_span_properties;
apply the same gated/sanitized logic in both places where span.add_properties is
used so raw sensitive content is never exported by default.
- Around line 95-108: The match arms in chat_completions should attach terminal
attributes directly to the resolved `span` instead of calling LocalSpan::add_*
so change the Complete/Gateway-error/Timeout branches to call
span.add_properties(...) and span.add_property(...) (use the existing `span`
variable) and convert the Err(Err) timeout mapping to use span before returning;
also add #[fastrace::trace] and the required /// doc comment to the public
chat_completions handler and add #[fastrace::trace] to the internal
handle_regular_request and handle_stream_request functions so traces are
created; finally stop recording raw model outputs into spans by removing or
redacting calls to output_collector.output_message_span_properties() (lines that
add full text/tool calls/args) and replace them with non-sensitive
summary/redacted properties instead.

---

Nitpick comments:
In `@src/proxy/handlers/chat_completions/mod.rs`:
- Around line 43-47: Add a doc comment and the distributed-tracing attribute to
the public handler function chat_completions: prepend a triple-slash doc comment
(///) describing the handler's purpose and then add the #[fastrace::trace]
attribute above the fn signature for chat_completions (the function taking
State(state): State<AppState>, mut request_ctx: RequestContext, Json(mut
request_data): Json<ChatCompletionRequest>) and returning Result<Response,
ChatCompletionError> so it conforms to the project's tracing and documentation
guidelines.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 591b0a83-db58-419b-9046-a287640c580c

📥 Commits

Reviewing files that changed from the base of the PR and between 669d1c9 and 08c8192.

📒 Files selected for processing (1)

src/proxy/handlers/chat_completions/mod.rs

feat(observability): add chat completion genai attributes

e9d94ee

coderabbitai Bot reviewed Apr 28, 2026

View reviewed changes

Comment thread src/config/entities/models.rs

Comment thread src/gateway/traits/provider.rs

Comment thread src/gateway/traits/provider.rs

Comment thread src/proxy/handlers/chat_completions/mod.rs

Comment thread src/proxy/provider.rs

Comment thread src/utils/future.rs

fix comments

669d1c9

coderabbitai Bot reviewed Apr 28, 2026

View reviewed changes

first token event

08c8192

coderabbitai Bot reviewed Apr 29, 2026

View reviewed changes

Comment thread src/proxy/handlers/chat_completions/mod.rs

Comment thread src/proxy/handlers/chat_completions/mod.rs

fix comment

75d1070

coderabbitai Bot mentioned this pull request Apr 29, 2026

feat(observability): make streamed output export to trace span attributes opt-in #66

Open

bzp2010 merged commit ccfa056 into main Apr 29, 2026
4 checks passed

bzp2010 deleted the bzp/feat-o11y-tracelog branch April 29, 2026 03:29

This was referenced Apr 29, 2026

feat(observability): add messages genai span attributes #68

Merged

chore(test): move to pretty-assertions #73

Merged

coderabbitai Bot mentioned this pull request May 6, 2026

feat: unified chat handler #92

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(observability): add chat completion genai span attributes#65

feat(observability): add chat completion genai span attributes#65
bzp2010 merged 4 commits into
mainfrom
bzp/feat-o11y-tracelog

bzp2010 commented Apr 28, 2026 •

edited by coderabbitai Bot

Loading

Uh oh!

coderabbitai Bot commented Apr 28, 2026 •

edited

Loading

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Pre-merge checks failed

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Uh oh!

coderabbitai Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

bzp2010 commented Apr 28, 2026 • edited by coderabbitai Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Preview

Summary by CodeRabbit

Uh oh!

coderabbitai Bot commented Apr 28, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Pre-merge checks failed

❌ Failed checks (1 error, 1 warning)

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

bzp2010 commented Apr 28, 2026 •

edited by coderabbitai Bot

Loading

coderabbitai Bot commented Apr 28, 2026 •

edited

Loading