

## ðŸ†“ Free / Open-Source Vector DBs

| Vector DB        | Type | Strengths | Limitations |
|------------------|------|-----------|-------------|
| **FAISS (Meta)** | Library | Fast, mature, supports CPU/GPU, great for prototyping | No built-in persistence (you must handle saving/loading); limited distributed scaling; not a full DB, more a library. |
| **Annoy (Spotify)** | Library | Lightweight, great for approximate nearest neighbor search | Read-only once built; slower updates; not ideal for dynamic datasets. |
| **Weaviate (OSS edition)** | Full DB | Rich features: hybrid search, schema, modules for text/image/audio | Free tier limited; heavier setup; advanced modules may require paid cloud. |
| **Milvus** | Full DB | Scalable, distributed, supports billions of vectors | Requires cluster setup for scale; more ops overhead than FAISS. |
| **Qdrant** | Full DB | Rust-based, fast, user-friendly API, integrates well with LangChain | Free self-hosting; cloud has paid tiers; fewer enterprise integrations than Pinecone. |
| **Chroma** | Lightweight DB | Simple, Python-native, great for small RAG projects | Not optimized for huge datasets; limited scaling/distributed features. |

---

## ðŸ’° Paid / Commercial Vector DBs

| Vector DB        | Type | Strengths | Limitations |
|------------------|------|-----------|-------------|
| **Pinecone** | Cloud DB | Fully managed, scalable, easy API, strong LangChain support | Paid usage scales with vector count and queries; no free self-hosting. |
| **Weaviate Cloud** | Managed DB | Same features as OSS + enterprise support | Subscription cost; vendor lock-in. |
| **Qdrant Cloud** | Managed DB | Hosted version with scaling and monitoring | Paid tiers; free tier limited in size. |
| **Milvus Cloud (Zilliz)** | Managed DB | Enterprise-grade scaling, monitoring, support | Paid; more complex pricing. |
| **Redis Enterprise (with vector search)** | General DB + vector search | Combines caching + vector search; enterprise-ready | Paid license; heavier infra if only vectors are needed. |
| **ElasticSearch + vector search (Elastic Cloud)** | General search engine with vector support | Combines keyword + vector search | Paid for cloud; heavier setup compared to pure vector DBs. |

---

## âš¡ Key Takeaways
- **Free / OSS** â†’ FAISS, Annoy, Chroma are great for small projects or prototyping. Weaviate, Milvus, Qdrant scale better but need infra setup.  
- **Paid / Cloud** â†’ Pinecone, Qdrant Cloud, Weaviate Cloud are easiest to use, fully managed, but cost grows with dataset size.  
- **Hybrid search** (text + vector) â†’ Weaviate, ElasticSearch, Qdrant support this natively.  
- **Charts & tables in PDFs** â†’ Once extracted, embeddings go into these DBs for retrieval in RAG.  

---

ðŸ‘‰ For your **RAG pipeline**, a common free stack is:  
- **Chroma or Qdrant (self-hosted)** for embeddings storage.  
- **FAISS** if you just need fast local similarity search.  

A common paid stack is:  
- **Pinecone** for managed scaling + monitoring.  
- **Weaviate Cloud** if you want hybrid search with enterprise support.  


