Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer #22155

lstein · 2024-05-25T01:35:41Z

langchain: ConversationVectorStoreTokenBufferMemory

-Description: This PR adds ConversationVectorStoreTokenBufferMemory. It is similar in concept to ConversationSummaryBufferMemory. It maintains an in-memory buffer of messages up to a preset token limit. After the limit is hit timestamped messages are written into a vectorstore retriever rather than into a summary. The user's prompt is then used to retrieve relevant fragments of the previous conversation. By persisting the vectorstore, one can maintain memory from session to session.
-Issue: n/a
-Dependencies: none
-Twitter handle: Please no!!!

Add tests and docs: I looked to see how the unit tests were written for the other ConversationMemory modules, but couldn't find anything other than a test for successful import. I need to know whether you are using pytest.mock or another fixture to simulate the LLM and vectorstore. In addition, I would like guidance on where to place the documentation. Should it be a notebook file in docs/docs?
Lint and test: I am seeing some linting errors from a couple of modules unrelated to this PR.

If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17.

vercel · 2024-05-25T01:35:45Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jun 22, 2024 0:22am

libs/langchain/langchain/memory/token_buffer_vectorstore_memory.py

lstein · 2024-06-14T15:42:25Z

BTW, I'm using this routinely with persistence in my personal chatbots and am constantly amazed at how frequently the chatbot recalls something relevant from a previous conversation. For small (7b) local models, this really works much better than the conversation summary chain.

I think a maintainer needs to approve the required workflows?

… errors

vercel · 2024-06-18T15:39:48Z

Deployment failed with the following error:

The provided GitHub repository does not contain the requested branch or commit reference. Please ensure the repository is not empty.

lstein · 2024-06-18T15:48:03Z

The memory/test_imports.py unit test was failing due to a circular import issue. I have fixed by changing the import order.

lstein · 2024-06-23T00:50:42Z

@isahers1 All tests are passing, documentation looks good. At what point does this get merged with the main branch — or is more needed to be done?

isahers1 · 2024-06-26T03:17:20Z

Just merged, sorry for the delay!

…torstore history with a token buffer (#22155) **langchain: ConversationVectorStoreTokenBufferMemory** -**Description:** This PR adds ConversationVectorStoreTokenBufferMemory. It is similar in concept to ConversationSummaryBufferMemory. It maintains an in-memory buffer of messages up to a preset token limit. After the limit is hit timestamped messages are written into a vectorstore retriever rather than into a summary. The user's prompt is then used to retrieve relevant fragments of the previous conversation. By persisting the vectorstore, one can maintain memory from session to session. -**Issue:** n/a -**Dependencies:** none -**Twitter handle:** Please no!!! - [X] **Add tests and docs**: I looked to see how the unit tests were written for the other ConversationMemory modules, but couldn't find anything other than a test for successful import. I need to know whether you are using pytest.mock or another fixture to simulate the LLM and vectorstore. In addition, I would like guidance on where to place the documentation. Should it be a notebook file in docs/docs? - [X] **Lint and test**: I am seeing some linting errors from a couple of modules unrelated to this PR. If no one reviews your PR within a few days, please @-mention one of baskaryan, efriis, eyurtsev, ccurme, vbarda, hwchase17. --------- Co-authored-by: Lincoln Stein <lstein@gmail.com> Co-authored-by: isaac hershenson <ihershenson@hmc.edu>

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. Ɑ: memory Related to memory module Ɑ: vector store Related to vector store module 🤖:improvement Medium size change to existing code to handle new use-cases labels May 25, 2024

isahers1 reviewed Jun 13, 2024

View reviewed changes

libs/langchain/langchain/memory/token_buffer_vectorstore_memory.py Outdated Show resolved Hide resolved

lstein requested a review from isahers1 June 14, 2024 15:39

Lincoln Stein added 4 commits June 14, 2024 14:18

add ConversationTokenBufferVectorStoreMemory class

91f4de0

added vectorstore-backed conversation + buffer memory

2f8dd75

move timestamp to top

73af5f5

tweak history prompt; add code-block annotation to usage example

ba23101

lstein force-pushed the conversation_memory_vectorstore branch from f1e65aa to ba23101 Compare June 14, 2024 18:18

Lincoln Stein and others added 3 commits June 16, 2024 19:58

fix code-block syntax

08bba33

Merge branch 'master' into conversation_memory_vectorstore

2aa5118

rename vectorstore_token_buffer_memory to work around circular import…

ac929f5

… errors

Merge branch 'master' into conversation_memory_vectorstore

3f2c03f

lstein added 2 commits June 18, 2024 11:48

Merge branch 'master' into conversation_memory_vectorstore

ee088ad

Merge branch 'master' into conversation_memory_vectorstore

049b776

ccurme added the langchain Related to the langchain package label Jun 21, 2024

lstein and others added 2 commits June 21, 2024 15:14

Merge branch 'master' into conversation_memory_vectorstore

a0246db

fmt

b7c1d09

isahers1 merged commit c314222 into langchain-ai:master Jun 26, 2024
73 checks passed

lstein deleted the conversation_memory_vectorstore branch June 26, 2024 03:38

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer #22155

Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer #22155

lstein commented May 25, 2024 •

edited

Loading

vercel bot commented May 25, 2024 •

edited

Loading

lstein commented Jun 14, 2024 •

edited

Loading

vercel bot commented Jun 18, 2024

lstein commented Jun 18, 2024

lstein commented Jun 23, 2024

isahers1 commented Jun 26, 2024

Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer #22155

Add a conversation memory that combines a (optionally persistent) vectorstore history with a token buffer #22155

Conversation

lstein commented May 25, 2024 • edited Loading

vercel bot commented May 25, 2024 • edited Loading

lstein commented Jun 14, 2024 • edited Loading

vercel bot commented Jun 18, 2024

lstein commented Jun 18, 2024

lstein commented Jun 23, 2024

isahers1 commented Jun 26, 2024

lstein commented May 25, 2024 •

edited

Loading

vercel bot commented May 25, 2024 •

edited

Loading

lstein commented Jun 14, 2024 •

edited

Loading