New issue

Jump to bottom

refactor: Update how we look for finish_reason #9046

Merged

sjrl merged 6 commits into main from openai-streaming-usage

Mar 17, 2025

Contributor

sjrl commented Mar 17, 2025 •

edited

Loading

Related Issues

fixes failing Mistral tests in core-integrations

Proposed Changes:

Changes how we look for the finish_reason when processing streaming chunks. OpenAI and Mistral put the finish_reason in different chunks so we just now look for the last one that is not None.

How did you test it?

Tested manually for mistral and ensured existing OpenAI tests still work.

Notes for the reviewer

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added unit tests and updated the docstrings
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test: and added ! in case the PR includes breaking changes.
I documented my code
I ran pre-commit hooks and fixed any issue


          Update how we look for finish_Reason

99cb56f

sjrl requested a review from a team as a code owner

March 17, 2025 08:42

sjrl requested review from Amnah199 and removed request for a team

March 17, 2025 08:42

sjrl added the ignore-for-release-notes label

Collaborator

coveralls commented Mar 17, 2025 •

edited

Loading

Pull Request Test Coverage Report for Build 13898923105

Details

0 of 0 changed or added relevant lines in 0 files are covered.
No unchanged relevant lines lost coverage.
Overall coverage increased (+0.04%) to 89.966%

Totals
Change from base Build 13859234349:	0.04%
Covered Lines:	9728
Relevant Lines:	10813

💛 - Coveralls

sjrl marked this pull request as draft

March 17, 2025 08:49


          Additional change

a3ebcbb

sjrl marked this pull request as ready for review

March 17, 2025 08:55

sjrl self-assigned this

Amnah199 reviewed

View reviewed changes

haystack/components/generators/chat/openai.py Outdated Show resolved Hide resolved

sjrl added 2 commits

March 17, 2025 11:30


          Add unit test and integration test

824bb1a


          Refactor

66a9766

github-actions bot added topic:tests type:documentation labels


          Use correct mock

0c374d9

sjrl requested a review from Amnah199

March 17, 2025 11:02

Contributor Author

sjrl commented Mar 17, 2025

Hey @Amnah199 this turned into a slightly larger refactor to help simplify how we process streaming chunks and to add tests to make sure we capture the usage correctly when it's asked for. All of these changes still work with the Mistral Chat Generator

sjrl commented

View reviewed changes

haystack/components/generators/chat/openai.py

Comment on lines -578 to -582

-                  def _convert_usage_chunk_to_streaming_chunk(self, chunk: ChatCompletionChunk) -> StreamingChunk:
-                      """
-                      Converts the usage chunk received from the OpenAI API when `include_usage` is set to `True` to a StreamingChunk.
-                      :param chunk: The usage chunk returned by the OpenAI API.

Contributor Author

sjrl Mar 17, 2025

I removed this function since we didn't actually use the processed usage data from this when constructing the final ChatMessage. See the updated _convert_streaming_chunks_to_chat_message where we actually use the native chunk from OpenAI to provide the usage data.

Contributor

Amnah199 Mar 17, 2025 •

edited

Loading

Right, I'll run it locally for the original use case behind this PR for verification that this works.

Contributor Author

sjrl Mar 17, 2025 •

edited

Loading

I've also added a unit test and an integration test testing for usage stats when streaming and include_usage: True so I hope that covers it, but let me know if you have any issues

sjrl commented

View reviewed changes

haystack/components/generators/chat/openai.py Show resolved Hide resolved

sjrl commented

View reviewed changes

haystack/components/generators/chat/openai.py

    
                          "index": 0,

                          "finish_reason": finish_reason,

                          "completion_start_time": chunks[0].meta.get("received_at"),  # first chunk received

                          "usage": chunk.usage or {},

                          "usage": dict(last_chunk.usage or {}),  # last chunk has the final usage data if available

Contributor Author

sjrl Mar 17, 2025

I updated this to follow what we did in our normal chat completion processing where we store the dict version of the usage otherwise it's returned as an openai Python type

sjrl commented

View reviewed changes

haystack/components/generators/chat/openai.py Show resolved Hide resolved

Amnah199 reviewed

View reviewed changes

test/components/generators/chat/test_openai.py Outdated Show resolved Hide resolved


          PR comments

fda1820

Amnah199 approved these changes

View reviewed changes

Contributor

Amnah199 left a comment

LG!

sjrl merged commit 6f98cc2 into main

15 checks passed

sjrl deleted the openai-streaming-usage branch

March 17, 2025 12:25

sjrl mentioned this pull request

chore: Add missing release note for PR https://github.com/deepset-ai/haystack/pull/9046 #9051

Merged

sjrl added a commit that referenced this pull request


          refactor: Update how we look for finish_reason (#9046)

a30a265

* Update how we look for finish_Reason

* Additional change

* Add unit test and integration test

* Refactor

* Use correct mock

* PR comments

anakin87 mentioned this pull request

TestMistralChatGenerator::test_live_run_streaming is failing deepset-ai/haystack-core-integrations#1540

Closed

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

ignore-for-release-notes topic:tests type:documentation