Optional Document Storage? #1023
Unanswered
martinossx
asked this question in
1. Q&A
Replies: 1 comment 1 reply
-
The main reason for document storage is to allow updates. When a document is uploaded, the ID is persisted in document storage, using a folder name. The folder contains information used during the update process. This information would be hard to store in other places. If documents are never updated and KM runs in a single node, you can use SimpleDocumentStorage and keep data in memory (with the risk of losses in case of reboots though). Considering multiple nodes and reliability, data needs to be centralized somewhere and Azure Blobs is one of the options. It should not be that slow though. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
I have a similar use-case like this where I want to recognize a user's intent. For every intent I have some utterances. Basically I want to replace LUIS with KernelMemory.
Planned setup:
For my use case, I only want to find the name of the intent based on a user prompt. Utterances, intent name and vector data is all stored on Azure AI Search after ingestion.
So I asked myself, how does a document store helping me here? Everything I need can be pulled from the Vector store.
I started with SimpleDocumentStorage, but had to enable AllowMixingVolatileAndPersistentData.
Then I configured KM with WithAzureBlobsDocumentStorage but I noticed that this makes the ingestion super slow, even when I import the same set of data again.
I expected KM to at least skip updating intents that has not changed, but that didn't happen..
I guess I get something wrong.
Can someone explain me, why I need a DocumentStorage at all? Can I opt out from it, or is there an option on DocumentStorage that avoid re-importing unchanged data?
Beta Was this translation helpful? Give feedback.
All reactions