fix(llm): stream_options.include_usage so cost tracking actually fires by rdwj · Pull Request #119 · fips-agents/agent-template

rdwj · 2026-04-28T02:56:01Z

Summary

LLMClient.call_model_stream_raw was never asking vLLM for the terminal usage chunk, so OpenAIChatServer._persist_cost_data (added in #117) returned early on every real call and cost_data accumulators stayed empty in deployed agents. Sets stream_options={\"include_usage\": True} by default. setdefault semantics so callers can opt out.

Surfaced during the cluster smoke for #116. Closes #118.

Test plan

Two regression tests in test_llm.py — default-on and caller-override
Full unit suite: 761 passing
Re-run cluster smoke after 0.14.1 lands (this PR's purpose)

Bumps fipsagents to 0.14.1.

Closes #118.

Assisted-by: Claude Code (Opus 4.7)

call_model_stream_raw now defaults stream_options to {"include_usage": True}. Without it, vLLM (and any OpenAI-compatible server) never emits a terminal usage chunk on streaming responses, which means StreamMetrics.prompt_tokens / completion_tokens stay None and OpenAIChatServer._persist_cost_data returns early -- cost_data accumulators stayed empty in production despite full unit-test green. Surfaced during the cluster smoke for #116. setdefault semantics so callers can opt out by passing stream_options={include_usage: False}. Bumps fipsagents to 0.14.1. Closes #118. Assisted-by: Claude Code (Opus 4.7)

rdwj merged commit 275b1f9 into main Apr 28, 2026

rdwj deleted the fix/stream-include-usage-118 branch April 28, 2026 02:56

rdwj mentioned this pull request Apr 28, 2026

[arch] Design discussion: shared pluggable service for cross-agent state (feedback, sessions, traces) #112

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(llm): stream_options.include_usage so cost tracking actually fires#119

fix(llm): stream_options.include_usage so cost tracking actually fires#119
rdwj merged 1 commit into
mainfrom
fix/stream-include-usage-118

rdwj commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

rdwj commented Apr 28, 2026

Summary

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant