core[minor]: Add llm cache in stream #15575

pprados · 2024-01-05T09:33:11Z

Description:
The current implementation does not exploit the cache when using stream.

This PR add the usage of the llm cache with the stream.

Twitter handle:
@pprados

vercel · 2024-01-05T09:33:15Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jan 15, 2024 7:36pm

baskaryan

could we add some unit tests

baskaryan · 2024-01-15T19:39:59Z

libs/core/langchain_core/language_models/llms.py

+                    assert generation is not None
+                else:
+                    generation = GenerationChunk(text=existing_prompts[0][0].text)
+                    yield generation


we should be yielding strings not GenerationChunks

@baskaryan

…22065) # package community: Fix SQLChatMessageHistory ## Description Here is a rewrite of `SQLChatMessageHistory` to properly implement the asynchronous approach. The code circumvents [issue 22021](#22021) by accepting a synchronous call to `def add_messages()` in an asynchronous scenario. This bypasses the bug. For the same reasons as in [PR 22](langchain-ai/langchain-postgres#32) of `langchain-postgres`, we use a lazy strategy for table creation. Indeed, the promise of the constructor cannot be fulfilled without this. It is not possible to invoke a synchronous call in a constructor. We compensate for this by waiting for the next asynchronous method call to create the table. The goal of the `PostgresChatMessageHistory` class (in `langchain-postgres`) is, among other things, to be able to recycle database connections. The implementation of the class is problematic, as we have demonstrated in [issue 22021](#22021). Our new implementation of `SQLChatMessageHistory` achieves this by using a singleton of type (`Async`)`Engine` for the database connection. The connection pool is managed by this singleton, and the code is then reentrant. We also accept the type `str` (optionally complemented by `async_mode`. I know you don't like this much, but it's the only way to allow an asynchronous connection string). In order to unify the different classes handling database connections, we have renamed `connection_string` to `connection`, and `Session` to `session_maker`. Now, a single transaction is used to add a list of messages. Thus, a crash during this write operation will not leave the database in an unstable state with a partially added message list. This makes the code resilient. We believe that the `PostgresChatMessageHistory` class is no longer necessary and can be replaced by: ``` PostgresChatMessageHistory = SQLChatMessageHistory ``` This also fixes the bug. ## Issue - [issue 22021](#22021) - Bug in _exit_history() - Bugs in PostgresChatMessageHistory and sync usage - Bugs in PostgresChatMessageHistory and async usage - [issue 36](langchain-ai/langchain-postgres#36) ## Twitter handle: pprados ## Tests - libs/community/tests/unit_tests/chat_message_histories/test_sql.py (add async test) @baskaryan, @eyurtsev or @hwchase17 can you check this PR ? And, I've been waiting a long time for validation from other PRs. Can you take a look? - [PR 32](langchain-ai/langchain-postgres#32) - [PR 15575](#15575) - [PR 13200](#13200) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>

@baskaryan

…22065) # package community: Fix SQLChatMessageHistory ## Description Here is a rewrite of `SQLChatMessageHistory` to properly implement the asynchronous approach. The code circumvents [issue 22021](#22021) by accepting a synchronous call to `def add_messages()` in an asynchronous scenario. This bypasses the bug. For the same reasons as in [PR 22](langchain-ai/langchain-postgres#32) of `langchain-postgres`, we use a lazy strategy for table creation. Indeed, the promise of the constructor cannot be fulfilled without this. It is not possible to invoke a synchronous call in a constructor. We compensate for this by waiting for the next asynchronous method call to create the table. The goal of the `PostgresChatMessageHistory` class (in `langchain-postgres`) is, among other things, to be able to recycle database connections. The implementation of the class is problematic, as we have demonstrated in [issue 22021](#22021). Our new implementation of `SQLChatMessageHistory` achieves this by using a singleton of type (`Async`)`Engine` for the database connection. The connection pool is managed by this singleton, and the code is then reentrant. We also accept the type `str` (optionally complemented by `async_mode`. I know you don't like this much, but it's the only way to allow an asynchronous connection string). In order to unify the different classes handling database connections, we have renamed `connection_string` to `connection`, and `Session` to `session_maker`. Now, a single transaction is used to add a list of messages. Thus, a crash during this write operation will not leave the database in an unstable state with a partially added message list. This makes the code resilient. We believe that the `PostgresChatMessageHistory` class is no longer necessary and can be replaced by: ``` PostgresChatMessageHistory = SQLChatMessageHistory ``` This also fixes the bug. ## Issue - [issue 22021](#22021) - Bug in _exit_history() - Bugs in PostgresChatMessageHistory and sync usage - Bugs in PostgresChatMessageHistory and async usage - [issue 36](langchain-ai/langchain-postgres#36) ## Twitter handle: pprados ## Tests - libs/community/tests/unit_tests/chat_message_histories/test_sql.py (add async test) @baskaryan, @eyurtsev or @hwchase17 can you check this PR ? And, I've been waiting a long time for validation from other PRs. Can you take a look? - [PR 32](langchain-ai/langchain-postgres#32) - [PR 15575](#15575) - [PR 13200](#13200) --------- Co-authored-by: Eugene Yurtsev <eyurtsev@gmail.com>

eyurtsev · 2024-07-11T15:35:03Z

@pprados i'm going to close this PR for now due to lack of activity, if you want to make a fix for supporting cache in streaming for llms or chat models feel free to make a new PR!

Add llm cache in stream

155cc5b

dosubot bot added the size:L This PR changes 100-499 lines, ignoring generated files. label Jan 5, 2024

dosubot bot added Ɑ: models Related to LLMs or chat model modules 🤖:improvement Medium size change to existing code to handle new use-cases 🔌: redis Primarily related to Redis integrations labels Jan 5, 2024

fmt

d756a14

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jan 15, 2024

baskaryan reviewed Jan 15, 2024

View reviewed changes

hwchase17 closed this Jan 30, 2024

baskaryan reopened this Jan 30, 2024

pprados mentioned this pull request May 23, 2024

community[minor]: Add native async support to SQLChatMessageHistory #22065

Merged

pprados changed the title ~~Add llm cache in stream~~ langchain[minor]: Add llm cache in stream Jun 11, 2024

pprados changed the title ~~langchain[minor]: Add llm cache in stream~~ core[minor]: Add llm cache in stream Jun 11, 2024

pprados marked this pull request as draft June 19, 2024 05:46

ccurme added the Ɑ: core Related to langchain-core label Jun 19, 2024

hwchase17 assigned eyurtsev Jul 8, 2024

eyurtsev closed this Jul 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core[minor]: Add llm cache in stream #15575

core[minor]: Add llm cache in stream #15575

pprados commented Jan 5, 2024

vercel bot commented Jan 5, 2024 •

edited

Loading

baskaryan left a comment

baskaryan Jan 15, 2024

eyurtsev commented Jul 11, 2024

core[minor]: Add llm cache in stream #15575

core[minor]: Add llm cache in stream #15575

Conversation

pprados commented Jan 5, 2024

vercel bot commented Jan 5, 2024 • edited Loading

baskaryan left a comment

Choose a reason for hiding this comment

baskaryan Jan 15, 2024

Choose a reason for hiding this comment

eyurtsev commented Jul 11, 2024

vercel bot commented Jan 5, 2024 •

edited

Loading