Vector Databases

Overview

A vector database is a type of database designed to store and query vector embeddings efficiently. Vector embeddings are numerical representations of data, such as text, images, or other types of information, transformed into a high-dimensional space by an embedding model. These databases are particularly useful for tasks involving similarity search, clustering, and other operations on high-dimensional data.

Key Concepts

Embedding Model: A machine learning model that transforms raw data (e.g., text, images) into vector embeddings. These vectors capture the semantic meaning of the data in a numerical form that can be processed by the vector database.
Vector Embeddings: High-dimensional vectors that represent the semantic meaning of the input data. These embeddings enable efficient similarity searches, as similar items will have similar vectors.
Indexing: The process of organizing vector embeddings in a way that allows for efficient querying. Various indexing techniques (e.g., KD-trees, HNSW) can be used to speed up search operations.

Diagram

The diagram illustrates the workflow of how data is processed and stored in a vector database:

Raw Text Inputs: Various texts (Text 1 to Text 5) are fed into the system.
Embedding Model: Each text is processed by an embedding model, which converts the text into a vector embedding.
Vector Embeddings: The embeddings for each text are displayed as lists of numerical values (e.g., [4.6, 6.1, 9.1, 7.2, ...]).
Indexing: Each vector embedding is indexed, making it easier to search and retrieve similar vectors.
Vector Database: The indexed vectors are stored in a vector database. This database allows for efficient querying and retrieval of vectors based on similarity.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
LICENSE		LICENSE
README.md		README.md
diagram.png		diagram.png
postgres_as_a_vectordb.ipynb		postgres_as_a_vectordb.ipynb
vectordb_chroma.ipynb		vectordb_chroma.ipynb
vectordb_pinecone.ipynb		vectordb_pinecone.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vector Databases

Overview

Key Concepts

Diagram

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vector Databases

Overview

Key Concepts

Diagram

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages