Add vector store support (Weaviate, Pinecone, Faiss) #108

jerryjliu · 2022-12-18T23:18:33Z

GPT Index now offers multiple integration points with vector stores / vector databases:

GPT Index can load data from vector stores, similar to any other data connector. This data can then be used within GPT Index data structures. PineconeReader, WeaviateReader, FaissReader.
GPT Index can use a vector store itself (Faiss) as an index. Like any other index, this index can store documents and be used to answer queries. GPTFaissIndex

teoh · 2022-12-19T03:06:25Z

docs/how_to/vector_stores.md

@@ -0,0 +1,46 @@
+# Using Vector Stores


for a later PR: i think this page would benefit from some diagrams to show the differences between how gpt index interacts with the vector stores.

made an for later #109

Yeah totally!

teoh · 2022-12-19T03:09:05Z

docs/how_to/vector_stores.md

+[Example notebooks can be found here](https://github.com/jerryjliu/gpt_index/tree/main/examples/data_connectors).
+
+
+## Using a Vector Store as an Index


(possibly noob question, i'm still getting familiar with faiss and vector dbs)

is the difference between this, vs using faiss directly to store embeddings of the paul graham essay, mainly that gpt index also generates a coherent answer? or are there other things going on?

Oh so using Faiss as a data loader (the first section) means that you load documents from an existing Faiss index (say that the user already has), and can use a GPT index structure on top of the retrieved documents - say you build a tree over the retrieved documents.

In this section it's saying that once you have documents, you can also build a GPT Index data struct, with Faiss under the hood, over these documents. So these documents could be from anywhere (e.g. Slack, notion), and we'll create an index data structure over that, taking care of tokenization/chunking/querying.

this is something where a diagram absolutely would help!

teoh · 2022-12-19T03:18:41Z

gpt_index/readers/weaviate.py

+from gpt_index.schema import Document
+
+
+class WeaviateReader(BaseReader):


would it simplify things if we made one base class for a VectorDbReader that all faiss, pinecone, weaviate inherited from? i'm wondering if there's enough shared logic here to do that.

since people have been asking for vector db support, this might save us time in the future if we have to add more vector db readers

Good q. Tbh I thought about it and the interfaces between these three are actually quite different, each one has different required args during loading and query time. But definitely something to think about as I add more abstractions!

Jerry Liu added 14 commits December 16, 2022 23:46

cr

a2aba89

cr

f95b7fb

cr

8657a38

cr

45bfbe0

cr

e97a6ec

add ipython nb

a8a66a7

cr

485bb07

cr

7f62bd7

cr

4e76a70

Merge branch 'main' into jerry/add_weaviate

f92a6f9

cr

492069a

cr

8d18e35

cr

ab779b5

cr

01dadff

jerryjliu requested a review from teoh December 18, 2022 23:18

Jerry Liu added 5 commits December 18, 2022 15:47

cr

b6c8508

add ability to load/save faiss indices

bb0b84f

cR

10d132c

cr

f819aca

cr

692f313

teoh mentioned this pull request Dec 19, 2022

add diagrams to docs for embeddings-related use #109

Closed

teoh reviewed Dec 19, 2022

View reviewed changes

jerryjliu merged commit d421afa into main Dec 19, 2022

jerryjliu deleted the jerry/add_weaviate branch December 19, 2022 04:16

viveksilimkhan1 pushed a commit to viveksilimkhan1/llama_index that referenced this pull request Oct 30, 2023

Fix BK secret name. (run-llama#108)

82e2980

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add vector store support (Weaviate, Pinecone, Faiss) #108

Add vector store support (Weaviate, Pinecone, Faiss) #108

jerryjliu commented Dec 18, 2022

teoh Dec 19, 2022

jerryjliu Dec 19, 2022

teoh Dec 19, 2022

jerryjliu Dec 19, 2022

jerryjliu Dec 19, 2022

teoh Dec 19, 2022

jerryjliu Dec 19, 2022

		[Example notebooks can be found here](https://github.com/jerryjliu/gpt_index/tree/main/examples/data_connectors).


		## Using a Vector Store as an Index

		from gpt_index.schema import Document


		class WeaviateReader(BaseReader):

Add vector store support (Weaviate, Pinecone, Faiss) #108

Add vector store support (Weaviate, Pinecone, Faiss) #108

Conversation

jerryjliu commented Dec 18, 2022

teoh Dec 19, 2022

Choose a reason for hiding this comment

jerryjliu Dec 19, 2022

Choose a reason for hiding this comment

teoh Dec 19, 2022

Choose a reason for hiding this comment

jerryjliu Dec 19, 2022

Choose a reason for hiding this comment

jerryjliu Dec 19, 2022

Choose a reason for hiding this comment

teoh Dec 19, 2022

Choose a reason for hiding this comment

jerryjliu Dec 19, 2022

Choose a reason for hiding this comment