Skip to content

[Pinecone] Controlling index ID's #8118

Discussion options

You must be logged in to vote

So after some extra research I think I have a good solution which is not having too much of a performance decrease.

I'm know adding the external document ID from whatever service we're using to the metadata of the extracted document. This metadata will be duplicated when chunking, embedding and inserting in the vector database. When I want to delete 1 full document, I can just use the filter on metadata. Very powerful and quick.

docsearch.delete(filter={'external_id':'2'})

Replies: 2 comments 1 reply

Comment options

You must be logged in to vote
1 reply
@guidorietbroek
Comment options

Comment options

You must be logged in to vote
0 replies
Answer selected by guidorietbroek
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
1 participant