fix: Remove streaming LLM tracking; they are all streaming now #4944

vblagoje · 2023-05-17T16:10:57Z

Related Issues

After HF introduced token streaming in the 4.28.0 release, all their local LLMs and modes are now streaming-capable. All other PromptNode-supported models, including OpenAI, Cohere, and Anthropic, are already streaming-capable. With the introduction of recent HF inferencing API additions, all remote HF LLMs are also streaming.

There is no point in tracking streaming-capable models any longer. By default, streaming is on. If, for some reason, the Agent needs to be non-streaming, we provide the boolean parameter switch.

Proposed Changes:

Add streaming boolean parameter to base Agent class, by default streaming is True
Remove all tracking of streaming-capable models

How did you test it?

Manual tests

Notes for the reviewer

Let's manually test multiple models in both modes

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used one of the conventional commit types for my PR title: fix:, feat:, build:, chore:, ci:, docs:, style:, refactor:, perf:, test:.
I documented my code
I ran pre-commit hooks and fixed any issue

julian-risch

All streaming related changes look good to me. I have one change request regarding the added newline "\n". It means we can also change the example(s), if I am not mistaken. That's all. 🙂

julian-risch · 2023-05-17T16:18:39Z

haystack/agents/base.py

@@ -318,7 +317,7 @@ def on_agent_start(**kwargs: Any) -> None:
            self.callback_manager.on_new_token += lambda token, **kwargs: print_text(token, color=agent_color)
        else:
            self.callback_manager.on_agent_step += lambda agent_step: print_text(
-                agent_step.prompt_node_response, color=agent_color
+                agent_step.prompt_node_response + "\n", color=agent_color


This change is not related to the streaming but it means that we can slightly change the print(results) in the Agent examples, right? For example, here: https://github.com/deepset-ai/haystack/blob/main/examples/agent_multihop_qa.py#L112
Instead of

result = agent.run(query=question) print(f"\n{result}")

we can do just print(result) in the end.

@vblagoje If I run the Agent example from this branch, the \n is still needed:

result = agent.run(query=question) print(f"\n{result}")

Could you please briefly explain the effect of having the change end="\n" in this PR?

Sure thing. The effect is best observed when running non streaming agent. Simply switch Agent to be non streaming in init method and run multi hop examples with and without end="\n" fix. This fix has no effect on streaming agents, only on non-streaming!

coveralls · 2023-05-17T16:28:48Z

Coverage: 38.233% (-0.007%) from 38.239% when pulling cf0183f on all_models_are_streaming into 9d52998 on main.

julian-risch

Looks good to me! 👍 Just tried it in an example and a tutorial. Thanks for explaining the "\n".

Remove streaming LLM tracking; they are all streaming now

48fc634

vblagoje requested a review from a team as a code owner May 17, 2023 16:10

vblagoje requested review from julian-risch and removed request for a team May 17, 2023 16:10

github-actions bot added topic:agent type:documentation Improvements on the docs labels May 17, 2023

julian-risch requested changes May 17, 2023

View reviewed changes

PR feedback

cf0183f

vblagoje force-pushed the all_models_are_streaming branch from 75fe82f to cf0183f Compare May 17, 2023 16:38

vblagoje added this to the 1.17 milestone May 22, 2023

julian-risch approved these changes May 22, 2023

View reviewed changes

masci merged commit 361cb1d into main May 22, 2023

masci deleted the all_models_are_streaming branch May 22, 2023 10:33

masci removed this from the 1.17 milestone May 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: Remove streaming LLM tracking; they are all streaming now #4944

fix: Remove streaming LLM tracking; they are all streaming now #4944

vblagoje commented May 17, 2023

julian-risch left a comment

julian-risch May 17, 2023

julian-risch May 22, 2023

vblagoje May 22, 2023

coveralls commented May 17, 2023 •

edited

Loading

julian-risch left a comment

fix: Remove streaming LLM tracking; they are all streaming now #4944

fix: Remove streaming LLM tracking; they are all streaming now #4944

Conversation

vblagoje commented May 17, 2023

Related Issues

Proposed Changes:

How did you test it?

Notes for the reviewer

Checklist

julian-risch left a comment

Choose a reason for hiding this comment

julian-risch May 17, 2023

Choose a reason for hiding this comment

julian-risch May 22, 2023

Choose a reason for hiding this comment

vblagoje May 22, 2023

Choose a reason for hiding this comment

coveralls commented May 17, 2023 • edited Loading

julian-risch left a comment

Choose a reason for hiding this comment

coveralls commented May 17, 2023 •

edited

Loading