core[minor]: Add v2 implementation of astream events #21638

eyurtsev · 2024-05-13T22:05:29Z

This PR introduces a v2 implementation of astream events that removes intermediate abstractions and fixes some issues with v1 implementation.

The v2 implementation significantly reduces relevant code that's associated with the astream events implementation together with overhead.

After this PR, the astream events implementation:

Uses an async callback handler
No longer relies on BaseTracer
No longer relies on json patch

As a result of this re-write, a number of issues were discovered with the existing implementation.

Changes in V2 vs. V1

on_chat_model_end `output`

The outputs associated with on_chat_model_end changed depending on whether it was within a chain or not.

As a root level runnable the output was:

"data": {"output": AIMessageChunk(content="hello world!", id='some id')}

As part of a chain the output was:

            "data": {
                "output": {
                    "generations": [
                        [
                            {
                                "generation_info": None,
                                "message": AIMessageChunk(
                                    content="hello world!", id=AnyStr()
                                ),
                                "text": "hello world!",
                                "type": "ChatGenerationChunk",
                            }
                        ]
                    ],
                    "llm_output": None,
                }
            },

After this PR, we will always use the simpler representation:

"data": {"output": AIMessageChunk(content="hello world!", id='some id')}

NOTE Non chat models (i.e., regular LLMs) are still associated with the more verbose format.

Remove some `_stream` events

on_retriever_stream and on_tool_stream events were removed -- these were not real events, but created as an artifact of implementing on top of astream_log.

The same information is already available in the x_on_end events.

Propagating Names

Names of runnables have been updated to be more consistent

  model = GenericFakeChatModel(messages=infinite_cycle).configurable_fields(
        messages=ConfigurableField(
            id="messages",
            name="Messages",
            description="Messages return by the LLM",
        )
    )

Before:

"name": "RunnableConfigurableFields",

After:

"name": "GenericFakeChatModel",

on_retriever_end

on_retriever_end will always return output which is a list of documents (rather than a dict containing a key called "documents")

Retry events

Removed the on_retry callback handler. It was incorrectly showing that the failed function being retried has invoked on_chain_end

https://github.com/langchain-ai/langchain/pull/21638/files#diff-e512e3f84daf23029ebcceb11460f1c82056314653673e450a5831147d8cb84dL1394

vercel · 2024-05-13T22:05:32Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		May 15, 2024 2:53pm

nfcampos · 2024-05-14T20:15:54Z

libs/core/langchain_core/tracers/event_stream.py

+    ) -> None:
+        """End a trace for an LLM run."""
+        # "chat_model" is only used for the experimental new streaming_events format.
+        # This change should not affect any existing tracers.


this comment shouldn't be here?

jacoblee93 · 2024-05-14T20:19:40Z

libs/core/langchain_core/runnables/base.py

@@ -1936,7 +1800,8 @@ async def _atransform_stream_with_config(
        """Helper method to transform an Async Iterator of Input values into an Async
        Iterator of Output values, with callbacks.
        Use this to implement `astream()` or `atransform()` in Runnable subclasses."""
-        from langchain_core.tracers.log_stream import LogStreamCallbackHandler
+        # Mixing that is used by both astream log and astream events implementation


eyurtsev · 2024-05-14T20:22:53Z

@jacoblee93 / @nfcampos

on_llm_end still has a very verbose output.

Options are:

Keep as is
output only the text (but makes it impossible to pluck out metadata)
output only the text, and start feeding metadata into a separate key in data
do a hack and start outputting AIMessageChunk

libs/core/langchain_core/runnables/base.py

jacoblee93 · 2024-05-14T20:43:43Z

@jacoblee93 / @nfcampos

on_llm_end still has a very verbose output.

Options are:

Keep as is

output only the text (but makes it impossible to pluck out metadata)

output only the text, and start feeding metadata into a separate key in data

do a hack and start outputting AIMessageChunk

I'd opt for making as few transformations as possible, which I think would be 1 or 4?

eyurtsev · 2024-05-14T21:16:14Z

We can keep as (1) for now until we decide what to do. (4) Might be a bad idea since we don't use AIMessage in regular LLMs currently.

libs/core/langchain_core/tracers/event_stream.py

This PR introduces a v2 implementation of astream events that removes intermediate abstractions and fixes some issues with v1 implementation. The v2 implementation significantly reduces relevant code that's associated with the astream events implementation together with overhead. After this PR, the astream events implementation: - Uses an async callback handler - No longer relies on BaseTracer - No longer relies on json patch As a result of this re-write, a number of issues were discovered with the existing implementation. ## Changes in V2 vs. V1 ### on_chat_model_end `output` The outputs associated with `on_chat_model_end` changed depending on whether it was within a chain or not. As a root level runnable the output was: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` As part of a chain the output was: ``` "data": { "output": { "generations": [ [ { "generation_info": None, "message": AIMessageChunk( content="hello world!", id=AnyStr() ), "text": "hello world!", "type": "ChatGenerationChunk", } ] ], "llm_output": None, } }, ``` After this PR, we will always use the simpler representation: ```python "data": {"output": AIMessageChunk(content="hello world!", id='some id')} ``` **NOTE** Non chat models (i.e., regular LLMs) are still associated with the more verbose format. ### Remove some `_stream` events `on_retriever_stream` and `on_tool_stream` events were removed -- these were not real events, but created as an artifact of implementing on top of astream_log. The same information is already available in the `x_on_end` events. ### Propagating Names Names of runnables have been updated to be more consistent ```python model = GenericFakeChatModel(messages=infinite_cycle).configurable_fields( messages=ConfigurableField( id="messages", name="Messages", description="Messages return by the LLM", ) ) ``` Before: ```python "name": "RunnableConfigurableFields", ``` After: ```python "name": "GenericFakeChatModel", ``` ### on_retriever_end on_retriever_end will always return `output` which is a list of documents (rather than a dict containing a key called "documents") ### Retry events Removed the `on_retry` callback handler. It was incorrectly showing that the failed function being retried has invoked `on_chain_end` https://github.com/langchain-ai/langchain/pull/21638/files#diff-e512e3f84daf23029ebcceb11460f1c82056314653673e450a5831147d8cb84dL1394

eyurtsev added 5 commits May 13, 2024 15:35

x

823d9a3

base stream

ea304e0

x

aaa4dd8

x

200b997

x

209814c

eyurtsev added 10 commits May 13, 2024 22:11

x

807cd54

x

66d0bb2

x

6a81c5d

x

1fb3b49

x

9725c36

q

211fce6

x

869f322

temporary test modifications

e6d1970

x

909a810

x

bf79546

This was referenced May 14, 2024

core[patch]: Add unit test to catch ordering #21669

Merged

core[patch]: Add unit tests with some streaming scenarios #21668

Merged

eyurtsev added 5 commits May 14, 2024 11:49

Merge branch 'master' into eugene/async_astream_events

a3eb3ef

x

29d2125

finish implementation

b0be772

x

a4dc8d6

x

2ea033f

eyurtsev changed the title ~~core[minor]: Replace implementation of astream events~~ core[major]: Replace implementation of astream events May 14, 2024

eyurtsev added 6 commits May 14, 2024 14:46

x

91aa542

x

c0ac435

x

0d231d0

x

30f0fda

x

b909268

x

f448ae7

vercel bot deployed to Preview May 14, 2024 19:37 View deployment

eyurtsev added 2 commits May 14, 2024 15:39

x

f22ccf8

lint

dd9829c

eyurtsev changed the title ~~core[major]: Replace implementation of astream events~~ core[minor]: Replace implementation of astream events May 14, 2024

eyurtsev changed the title ~~core[minor]: Replace implementation of astream events~~ core[minor]: Add v2 implementation of astream events May 14, 2024

vercel bot deployed to Preview May 14, 2024 19:50 View deployment

update doc-strings and on_retriever_end output shape

48597e2

vercel bot deployed to Preview May 14, 2024 20:09 View deployment

eyurtsev marked this pull request as ready for review May 14, 2024 20:11

dosubot bot added the size:XXL This PR changes 1000+ lines, ignoring generated files. label May 14, 2024

eyurtsev requested a review from nfcampos May 14, 2024 20:11

eyurtsev assigned efriis and nfcampos and unassigned efriis May 14, 2024

dosubot bot added the 🤖:improvement Medium size change to existing code to handle new use-cases label May 14, 2024

nfcampos reviewed May 14, 2024

View reviewed changes

jacoblee93 reviewed May 14, 2024

View reviewed changes

eyurtsev commented May 14, 2024

View reviewed changes

libs/core/langchain_core/runnables/base.py Outdated Show resolved Hide resolved

Update libs/core/langchain_core/runnables/base.py

e5aed24

vercel bot deployed to Preview May 14, 2024 20:45 View deployment

eyurtsev added 2 commits May 14, 2024 17:29

Merge branch 'master' into eugene/async_astream_events

08b7919

x

aaacb16

eyurtsev commented May 15, 2024

View reviewed changes

libs/core/langchain_core/tracers/event_stream.py Outdated Show resolved Hide resolved

Update libs/core/langchain_core/tracers/event_stream.py

cac3587

eyurtsev merged commit 5c2cfab into master May 15, 2024
96 checks passed

eyurtsev deleted the eugene/async_astream_events branch May 15, 2024 15:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core[minor]: Add v2 implementation of astream events #21638

core[minor]: Add v2 implementation of astream events #21638

eyurtsev commented May 13, 2024 •

edited

Loading

vercel bot commented May 13, 2024 •

edited

Loading

nfcampos May 14, 2024

jacoblee93 May 14, 2024

eyurtsev commented May 14, 2024

jacoblee93 commented May 14, 2024

eyurtsev commented May 14, 2024

core[minor]: Add v2 implementation of astream events #21638

core[minor]: Add v2 implementation of astream events #21638

Conversation

eyurtsev commented May 13, 2024 • edited Loading

Changes in V2 vs. V1

on_chat_model_end output

Remove some _stream events

Propagating Names

on_retriever_end

Retry events

vercel bot commented May 13, 2024 • edited Loading

nfcampos May 14, 2024

Choose a reason for hiding this comment

jacoblee93 May 14, 2024

Choose a reason for hiding this comment

eyurtsev commented May 14, 2024

jacoblee93 commented May 14, 2024

eyurtsev commented May 14, 2024

eyurtsev commented May 13, 2024 •

edited

Loading

on_chat_model_end `output`

Remove some `_stream` events

vercel bot commented May 13, 2024 •

edited

Loading