Streaming ONLY the final message?[Feature Request]: #1143

tyler-suard-parker · 2024-01-04T18:11:09Z

Is your feature request related to a problem? Please describe.

Hello. I am using AutoGen as a retrieval augmented generation agent. It works fantastically, and it performs multiple searches for different topics when necessary. However, building and sending the final answer takes a long time, too long for my users. I was hoping there is a way to stream just that one final answer, as it takes the majority of the time (Like 20 seconds out of 30 seconds total). I looked at all the open issues and pull requests and I am still not sure of the status of streaming with AutoGen.

Describe the solution you'd like

In the user_proxy agent class, have a parameter called stream_final_message = True.
This will allow all the agents to converse back and forth and pull whatever information is needed, but the final message is streamed so users don't have to wait for the entire formation of that message, because it tends to be long.

Additional context

No response

rickyloynd-microsoft · 2024-01-04T18:31:15Z

@thinkall Do you think streaming would help here?

victordibia · 2024-01-05T02:18:23Z

I am not sure streaming might help here.
Interaction between agents in AutoGen is sequential currently ..ie, each agent generates their response which gets sent to the next agent (written into their message history). This means all previous messages must be generated (and the associated latent), before the final response is generated.
In terms of UX, what could help might be showing users the intermediate messages as they are generated towards a final answer.
Happy to hear more thoughts here.

tyler-suard-parker · 2024-01-05T16:13:37Z

@victordibia Thank you for your input. I understand that the interactions between agents are sequential. Our agent interaction is something like this:

Agent receives question (0 seconds)
Agent generates a query (1 second)
Search is performed using query and results are returned (1 second)
Answer to user question is generated using the query results (30 seconds)

I am hoping to stream just number 4 to my frontend, because users are not willing to wait those 30 seconds to receive an answer, and it would be great if they could at least see the first few words immediately, as would be the case with streaming.

victordibia · 2024-01-05T17:17:06Z

Ah ... got it. You want to stream responses (in your case, just the last message).
I recall there was a PR for streaming.
@ragyabraham has extensive experience in that area (he's built a tool that implements this functionality)
@ragyabraham , any pointers you can share will be appreciated!

tyler-suard-parker · 2024-01-05T19:12:29Z

Thank you @victordibia ! @ragyabraham I am sure this is a common use case. I want to be able to stream just the last message to my front end, as it is being created. Do you have any suggestions on how I could do that?

ragyabraham · 2024-01-05T19:33:11Z

Hey @tyler-suard-parker sure. We utilise sockets to stream messages to the FE. We instantiate a socket client and pass that as a callable in the agent config. Then we use that to emit the message to the FE. If you want more detail checkout our fork of autogen

tyler-suard-parker · 2024-01-06T15:52:07Z

@ragyabraham Thank you so much for your help! I was not able to get your branch to run, I opened an issue. For my use case, I am using a frontend, an azure functions app for the backend, and openai. My main concern is the OpenAI generation time, some answers take up to 2 minutes to generate and users are complaining, so I want every word to hit my frontend as it is generated by OpenAI. Would I be able to do that using your fork?

lordlinus · 2024-01-08T06:50:28Z

+1 looking for the same.
How can i stream the final messages as a stream ( also, ideal if we can stream the intermediate messages )

tyler-suard-parker added the enhancement New feature or request label Jan 4, 2024

krlng mentioned this issue Jan 11, 2024

[Feature Request]: Use self instaed of class functions in register_reply #1209

Open

Tylersuard mentioned this issue Jan 26, 2024

Add websockets for streaming to a frontend #1414

Closed

3 tasks

davorrunje mentioned this issue Feb 5, 2024

Introducing IOStream protocol and adding support for websockets #1551

Merged

11 tasks

balakreshnan mentioned this issue Dec 3, 2023

Database is locked error #675

Open

sonichi closed this as completed in #1551 Mar 26, 2024

sschet mentioned this issue Apr 8, 2024

[Bug]: Missing required arguments; Expected either ('model' and 'prompt') or ('model', 'prompt' and 'stream') #1210

Open

rajeshbatchu1028 mentioned this issue Dec 30, 2023

Database locked #836

Closed

kattapug mentioned this issue Nov 5, 2023

UnicodeEncodeError #153

Closed

JinSeoung-Oh mentioned this issue Jun 25, 2024

[Bug]: Error on autobuild_agnet_library #3017

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Streaming ONLY the final message?[Feature Request]: #1143

Streaming ONLY the final message?[Feature Request]: #1143

tyler-suard-parker commented Jan 4, 2024

rickyloynd-microsoft commented Jan 4, 2024

victordibia commented Jan 5, 2024

tyler-suard-parker commented Jan 5, 2024

victordibia commented Jan 5, 2024 •

edited

Loading

tyler-suard-parker commented Jan 5, 2024

ragyabraham commented Jan 5, 2024

tyler-suard-parker commented Jan 6, 2024 •

edited

Loading

lordlinus commented Jan 8, 2024

Streaming ONLY the final message?[Feature Request]: #1143

Streaming ONLY the final message?[Feature Request]: #1143

Comments

tyler-suard-parker commented Jan 4, 2024

Is your feature request related to a problem? Please describe.

Describe the solution you'd like

Additional context

rickyloynd-microsoft commented Jan 4, 2024

victordibia commented Jan 5, 2024

tyler-suard-parker commented Jan 5, 2024

victordibia commented Jan 5, 2024 • edited Loading

tyler-suard-parker commented Jan 5, 2024

ragyabraham commented Jan 5, 2024

tyler-suard-parker commented Jan 6, 2024 • edited Loading

lordlinus commented Jan 8, 2024

victordibia commented Jan 5, 2024 •

edited

Loading

tyler-suard-parker commented Jan 6, 2024 •

edited

Loading