Handle token usage chunks in OpenAI streamed response #32823

timtimjnvr · 2025-06-16T22:29:43Z

This pull request addresses and resolves a case related to issue #28850.

During chat completion requests streamed from OpenAI models hosted on OpenWebUI, the API can add token usage chunks in the response data.

This update modifies the handling of these responses to account for token usage chunks, preventing potential errors when these are encountered.

Release Notes

Fixed Open Web UI compatibility with Zed when Open Web UI is sending usage stats chuncks in streamed response.

cla-bot · 2025-06-16T22:29:46Z

Thank you for your pull request and welcome to our community. We could not parse the GitHub identity of the following contributors: Timothée JANVIER.
This is most likely caused by a git client misconfiguration; please make sure to:

check if your git client is configured with an email to sign commits git config --list | grep email
If not, set it up using git config --global user.email email@example.com
Make sure that the git commit email is configured in your GitHub account settings, see https://github.com/settings/emails

cla-bot · 2025-06-16T22:31:34Z

We require contributors to sign our Contributor License Agreement, and we don't have @timtimjnvr on file. You can sign our CLA at https://zed.dev/cla. Once you've signed, post a comment here that says '@cla-bot check'.

timtimjnvr · 2025-06-16T22:32:36Z

@cla-bot check

cla-bot · 2025-06-16T22:32:39Z

The cla-bot has been summoned, and re-checked this pull request!

zed-industries-bot · 2025-06-16T22:42:54Z

	Warnings
⚠️	`fix: handle token usage chuncks in open ai streamed response ^` Write PR titles using sentence case.
⚠️	This PR is missing release notes. Please add a "Release Notes" section that describes the change: `Release Notes: - Added/Fixed/Improved ...` If your change is not user-facing, you can use "N/A" for the entry: `Release Notes: - N/A`

Have feedback on this plugin? Let's hear it!

Generated by 🚫 dangerJS against dbb2873

imumesh18 · 2025-06-17T10:45:51Z

crates/language_models/src/provider/open_ai.rs

@@ -526,6 +526,15 @@ impl OpenAiEventMapper {
        &mut self,
        event: ResponseStreamEvent,
    ) -> Vec<Result<LanguageModelCompletionEvent, LanguageModelCompletionError>> {
+        if let Some(usage) = event.usage {


This will create an issue in case of the stop event as then also we get the usage data. Now the problem is that we are just returning after usage data. Now this will break the whole event chain and the thread will be in broken state. You might wanna move this to else condition of choices check.

In the documentation I understand that the stop event ("finished_reason": "stop") is a field of a choice inside the choices array of the chunck object (https://platform.openai.com/docs/api-reference/chat-streaming/.

usage field seems to be sent in a different object chunck with empty choices:

"choices: A list of chat completion choices. Can contain more than one elements if n is greater than 1. Can also be empty for the last chunk if you set stream_options: {"include_usage": true}." (source)

However your proposal if some implementation have both fields (choices & usage) returned in the same event.

Thanks for the feedback 🙌

…gnore usage in this case and use choices)

imumesh18 · 2025-06-19T07:32:57Z

I believe there was another pr merged yesterday which fixes this: #32982

fix: handle token usage chuncks in open ai streamed response

dbb2873

timtimjnvr force-pushed the fix/handle-empty-choices-openwebui branch from a967fe2 to dbb2873 Compare June 16, 2025 22:31

cla-bot bot added the cla-signed The user has signed the Contributor License Agreement label Jun 16, 2025

timtimjnvr mentioned this pull request Jun 16, 2025

Assistant Panel: <error interacting with language model response contained no choices> #28850

Open

SomeoneToIgnore added the ai Improvement related to Assistant, Copilot, or other AI features label Jun 16, 2025

maxdeviant changed the title ~~fix: handle token usage chuncks in open ai streamed response~~ Handle token usage chunks in OpenAI streamed response Jun 16, 2025

imumesh18 reviewed Jun 17, 2025

View reviewed changes

fix: usage field could be sent in an object with non empty choices (i…

fc3500c

…gnore usage in this case and use choices)

timtimjnvr closed this Jun 19, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle token usage chunks in OpenAI streamed response #32823

Handle token usage chunks in OpenAI streamed response #32823

timtimjnvr commented Jun 16, 2025 •

edited

Loading

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

timtimjnvr commented Jun 16, 2025

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

zed-industries-bot commented Jun 16, 2025

Uh oh!

imumesh18 Jun 17, 2025 •

edited

Loading

Uh oh!

timtimjnvr Jun 17, 2025 •

edited

Loading

Uh oh!

imumesh18 commented Jun 19, 2025 •

edited

Loading

Uh oh!

Uh oh!

Handle token usage chunks in OpenAI streamed response #32823

Handle token usage chunks in OpenAI streamed response #32823

Conversation

timtimjnvr commented Jun 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

timtimjnvr commented Jun 16, 2025

Uh oh!

cla-bot bot commented Jun 16, 2025

Uh oh!

zed-industries-bot commented Jun 16, 2025

Uh oh!

imumesh18 Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

timtimjnvr Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

imumesh18 commented Jun 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

timtimjnvr commented Jun 16, 2025 •

edited

Loading

imumesh18 Jun 17, 2025 •

edited

Loading

timtimjnvr Jun 17, 2025 •

edited

Loading

imumesh18 commented Jun 19, 2025 •

edited

Loading