BUG: fix llm stream response by amumu96 · Pull Request #3115 · xorbitsai/inference

amumu96 · 2025-03-24T14:33:41Z

Modify xinference/client/tests/test_client.py: For chunk where finish_reason is not None, assert that delta = {"content": ""}.
Modify xinference/model/llm/llama_cpp/core.py: Filter out keys in the returned results that do not belong to ChatCompletionChunk.
Modify xinference/model/llm/reasoning_parser.py: Fix the issue where both reasoning_content="" and content="".
Modify xinference/model/llm/utils.py: Ensure that chunk with finish_reason not being None includes values for both content and reasoning_content.

xinference/client/tests/test_client.py

xinference/model/llm/llama_cpp/core.py

qinxuye

LGTM

buf: stream response fix

d6e051c

XprobeBot added the bug Something isn't working label Mar 24, 2025

XprobeBot added this to the v1.x milestone Mar 24, 2025

amumu96 mentioned this pull request Mar 24, 2025

和DeepSeek保持一致的流式输出 #3082

Closed

buf: fix transformers

e0a50c5

amumu96 changed the title ~~BUG: fix vllm stream response~~ BUG: fix llm stream response Mar 24, 2025

wuzhaoxin added 10 commits March 24, 2025 14:49

rm debug

da391ab

fix test

44fcde6

fix test

ebe6f4a

fix test

7881f97

fix test

f4f1868

fix test

44cf49c

fix test

a58b35d

fix test

2374b79

fix test

849448e

rm useless code

01d15c8

qinxuye reviewed Mar 31, 2025

View reviewed changes

xinference/client/tests/test_client.py Outdated Show resolved Hide resolved

xinference/model/llm/llama_cpp/core.py Outdated Show resolved Hide resolved

wuzhaoxin added 2 commits March 31, 2025 09:41

rm useless code

a0dacb5

add comment

9296ae6

qinxuye approved these changes Mar 31, 2025

View reviewed changes

qinxuye merged commit a6e99b4 into xorbitsai:main Mar 31, 2025
12 of 13 checks passed

qinxuye deleted the bug/stream-resp branch March 31, 2025 10:39

qinxuye pushed a commit to qinxuye/inference that referenced this pull request May 9, 2025

BUG: fix llm stream response (xorbitsai#3115)

2398f44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

BUG: fix llm stream response#3115

BUG: fix llm stream response#3115
qinxuye merged 14 commits intoxorbitsai:mainfrom
amumu96:bug/stream-resp

amumu96 commented Mar 24, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

qinxuye left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

amumu96 commented Mar 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

qinxuye left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amumu96 commented Mar 24, 2025 •

edited

Loading