docs: per-backend guides for Postgres, Qdrant, and Pinecone vector stores

jddunn · jddunn · commit 17427fca9685 · 2026-03-26T21:00:39.000-07:00
Three new guides covering setup, configuration, and troubleshooting:
- POSTGRES_BACKEND.md: pgvector setup, hybrid search (HNSW + tsvector RRF), multi-tenancy, cloud providers
- QDRANT_BACKEND.md: Docker/Cloud setup, BM25 hybrid search, scaling (sharding, quantization), sidecar SQLite
- PINECONE_BACKEND.md: namespace isolation, metadata filtering, limitations, migration to self-hosted, cost comparison
diff --git a/docs/PINECONE_BACKEND.md b/docs/PINECONE_BACKEND.md
@@ -0,0 +1,153 @@
+# Pinecone Backend
+
+The Pinecone backend stores embeddings in [Pinecone](https://www.pinecone.io/), a fully managed vector database. This is the simplest backend to set up — no infrastructure to manage — but has limitations compared to self-hosted options.
+
+## Prerequisites
+
+| Requirement | Notes |
+|---|---|
+| Pinecone account | Free tier available at [pinecone.io](https://www.pinecone.io/) |
+| API key | Found in the Pinecone console under "API Keys" |
+| Index created | Create an index in the console or via the Pinecone API |
+| Node.js 18+ | Uses native `fetch` (no SDK dependency) |
+
+## Configuration
+
+```typescript
+import { PineconeVectorStore } from '@framers/agentos/rag/implementations/vector_stores/PineconeVectorStore';
+
+const store = new PineconeVectorStore({
+  id: 'my-pinecone',
+  type: 'pinecone',
+  apiKey: process.env.PINECONE_API_KEY!,
+  indexHost: 'https://my-index-abc123.svc.aped-1234.pinecone.io',
+  namespace: 'agent-default',
+  defaultDimension: 1536,
+});
+
+await store.initialize();
+```
+
+### Configuration options
+
+| Option | Type | Default | Description |
+|---|---|---|---|
+| `apiKey` | `string` | **required** | Pinecone API key |
+| `indexHost` | `string` | **required** | Data Plane URL for your index (from Pinecone console) |
+| `namespace` | `string` | `''` | Default namespace; collections map to namespaces |
+| `defaultDimension` | `number` | `1536` | Embedding dimensions (must match the index) |
+
+The `indexHost` is the **Data Plane** endpoint for a specific index — not the control plane URL. Find it in the Pinecone console under your index details. It looks like `https://my-index-abc123.svc.aped-1234.pinecone.io`.
+
+## Namespace-based collection isolation
+
+Pinecone namespaces are used as "collections". Each namespace is fully isolated within the same index:
+
+```typescript
+// Agent A's memories
+await store.upsert('agent-alice', documents);
+
+// Agent B's memories — completely separate namespace
+await store.upsert('agent-bob', documents);
+
+// Query only Agent A's namespace
+await store.query('agent-alice', embedding, { topK: 10 });
+```
+
+Namespaces are created implicitly on first upsert. `createCollection()` is a no-op.
+
+## Metadata filtering
+
+Pinecone supports MongoDB-style metadata filter operators. AgentOS translates its unified `MetadataFilter` format to Pinecone's native syntax:
+
+```typescript
+const results = await store.query('my-namespace', embedding, {
+  topK: 10,
+  filter: {
+    type: { $eq: 'semantic' },        // Equality
+    importance: { $gte: 0.5 },        // Range
+    tags: { $in: ['project', 'decision'] }, // Set membership
+  },
+});
+```
+
+### Supported operators
+
+| Operator | Description | Example |
+|---|---|---|
+| `$eq` | Equal to | `{ status: { $eq: 'active' } }` |
+| `$ne` | Not equal to | `{ status: { $ne: 'deleted' } }` |
+| `$gt`, `$gte` | Greater than (or equal) | `{ score: { $gt: 0.8 } }` |
+| `$lt`, `$lte` | Less than (or equal) | `{ age: { $lt: 30 } }` |
+| `$in` | In set | `{ type: { $in: ['a', 'b'] } }` |
+| `$nin` | Not in set | `{ type: { $nin: ['x'] } }` |
+| `$exists` | Field exists | `{ tags: { $exists: true } }` |
+
+Metadata values must be string, number, boolean, or string arrays. Complex objects are JSON-stringified before storage.
+
+## Limitations
+
+### No hybrid search
+
+Pinecone requires a separate sparse encoder (e.g., SPLADE) for hybrid search. The AgentOS `hybridSearch()` method falls back to dense-only search on Pinecone. For true hybrid search, use the Postgres or Qdrant backends.
+
+### No knowledge graph
+
+There is no sidecar storage for knowledge graph data. If you enable `graph: true` in `MemoryConfig` with the Pinecone backend, graph data is not persisted.
+
+### Not self-hostable
+
+Pinecone is a managed service only. You cannot run it on your own infrastructure. If self-hosting is a requirement, use Qdrant or Postgres.
+
+### Batch size limit
+
+Pinecone limits upserts to 100 vectors per request. AgentOS handles this automatically by splitting batches, but large ingestion jobs will make many sequential API calls.
+
+### No `deleteAll` count
+
+`delete({ deleteAll: true })` returns `deletedCount: -1` because Pinecone's API does not report how many vectors were deleted in a bulk operation.
+
+## Migration FROM Pinecone to self-hosted backends
+
+Use the AgentOS migration engine to move data from Pinecone to Postgres or Qdrant:
+
+```typescript
+import { MigrationEngine } from '@framers/agentos/rag/migration/MigrationEngine';
+
+await MigrationEngine.migrate({
+  from: {
+    type: 'pinecone',
+    // PineconeSourceAdapter uses indexHost + apiKey + namespace
+    url: 'https://my-index-abc123.svc.aped-1234.pinecone.io',
+    apiKey: process.env.PINECONE_API_KEY!,
+  },
+  to: {
+    type: 'postgres',
+    connectionString: 'postgresql://postgres:wunderland@localhost:5432/agent_memory',
+  },
+  batchSize: 100,
+  onProgress: (done, total, table) => {
+    console.log(`[${table}] ${done}/${total}`);
+  },
+});
+```
+
+The migration reads vectors via Pinecone's `list` + `fetch` APIs and writes them to the target backend. Non-vector data (knowledge graph, conversations) is not stored in Pinecone and will not be migrated.
+
+## Cost comparison
+
+| Tier | Vectors | Monthly cost | Notes |
+|---|---|---|---|
+| **Starter (free)** | 100K | $0 | 1 index, 1 project, community support |
+| **Standard** | 1M+ | ~$70+ | Multiple indexes, backup, 99.95% SLA |
+| **Enterprise** | 10M+ | Custom | Dedicated infra, HIPAA, SOC2 |
+
+Self-hosted alternatives for comparison:
+
+| Backend | Vectors | Monthly cost | Notes |
+|---|---|---|---|
+| **Postgres + pgvector** | 10M+ | ~$15 (Neon free) to ~$50 (RDS) | Full SQL, hybrid search included |
+| **Qdrant (Docker)** | 10M+ | Cost of your VM (~$5-20) | Built-in BM25, quantization |
+| **Qdrant Cloud** | 1M+ | ~$25+ | Managed Qdrant, auto-scaling |
+
+Pinecone is the easiest to start with but becomes expensive at scale. For production agents processing large knowledge bases, self-hosted Postgres or Qdrant offer better cost efficiency and more features (hybrid search, knowledge graph).
diff --git a/docs/POSTGRES_BACKEND.md b/docs/POSTGRES_BACKEND.md
@@ -0,0 +1,173 @@
+# Postgres + pgvector Backend
+
+The Postgres backend stores embeddings, metadata, and full-text content in a single relational database using the [pgvector](https://github.com/pgvector/pgvector) extension. This gives you ACID transactions, hybrid search (dense vectors + BM25 in one query), and JSONB metadata filtering — all without a separate vector service.
+
+## Prerequisites
+
+| Requirement | Minimum version |
+|---|---|
+| PostgreSQL | 14+ (15+ recommended for `HNSW` index type) |
+| pgvector extension | 0.5.0+ (`CREATE EXTENSION vector`) |
+| Node.js | 18+ (uses the `pg` npm package) |
+
+## Quick start — Docker
+
+```bash
+docker run -d \
+  --name agentos-pgvector \
+  -e POSTGRES_PASSWORD=wunderland \
+  -p 5432:5432 \
+  pgvector/pgvector:pg16
+
+# Verify
+psql postgresql://postgres:wunderland@localhost:5432/postgres \
+  -c "CREATE EXTENSION IF NOT EXISTS vector; SELECT extversion FROM pg_extension WHERE extname='vector';"
+```
+
+The `pgvector/pgvector` image ships with the extension pre-installed. No manual compilation needed.
+
+## Manual setup
+
+If you are using an existing Postgres instance (self-hosted or managed), install pgvector manually:
+
+```sql
+-- Run as a superuser or a user with CREATE EXTENSION privilege.
+CREATE EXTENSION IF NOT EXISTS vector;
+```
+
+AgentOS creates its own tables on first use. The schema looks like:
+
+```sql
+CREATE TABLE IF NOT EXISTS "<prefix>my_collection" (
+  id            TEXT PRIMARY KEY,
+  embedding     vector(1536),          -- pgvector column
+  metadata_json JSONB,                 -- GIN-indexed for filtering
+  text_content  TEXT,                  -- raw text for hybrid search
+  tsv           tsvector GENERATED ALWAYS AS (to_tsvector('english', COALESCE(text_content, ''))) STORED,
+  created_at    BIGINT NOT NULL,
+  updated_at    BIGINT
+);
+
+-- Indexes created automatically:
+-- 1. HNSW index for approximate nearest neighbor search
+-- 2. GIN index on metadata_json for JSONB filtering
+-- 3. GIN index on tsv for full-text search
+```
+
+## Configuration
+
+```typescript
+import { PostgresVectorStore } from '@framers/agentos/rag/implementations/vector_stores/PostgresVectorStore';
+
+const store = new PostgresVectorStore({
+  id: 'my-pg-store',
+  type: 'postgres',
+  connectionString: 'postgresql://postgres:wunderland@localhost:5432/agent_memory',
+  poolSize: 10,              // Connection pool size (default: 10)
+  defaultDimension: 1536,    // Default embedding dimensions (default: 1536)
+  similarityMetric: 'cosine', // 'cosine' | 'euclidean' | 'dotproduct'
+  tablePrefix: 'agent1_',    // Optional prefix for multi-tenancy
+});
+
+await store.initialize();
+```
+
+### Configuration options
+
+| Option | Type | Default | Description |
+|---|---|---|---|
+| `connectionString` | `string` | **required** | Standard Postgres connection URI |
+| `poolSize` | `number` | `10` | Max concurrent connections in the pool |
+| `defaultDimension` | `number` | `1536` | Embedding vector dimensions for new collections |
+| `similarityMetric` | `string` | `'cosine'` | Distance function: `cosine`, `euclidean`, or `dotproduct` |
+| `tablePrefix` | `string` | `''` | Table name prefix for multi-tenant deployments |
+
+## Hybrid search
+
+The Postgres backend is the only backend that supports true **single-query hybrid search**: pgvector HNSW for dense vectors and PostgreSQL tsvector for BM25 lexical matching, fused with Reciprocal Rank Fusion (RRF) in a single SQL statement.
+
+```typescript
+const results = await store.hybridSearch(
+  'my_collection',
+  queryEmbedding,
+  'natural language query text',
+  {
+    topK: 10,
+    rrfK: 60,  // RRF constant (default: 60)
+  },
+);
+```
+
+How it works internally:
+
+1. **Dense CTE**: Finds top candidates by pgvector HNSW distance (`<=>` for cosine).
+2. **Lexical CTE**: Finds top candidates by `ts_rank()` against the `tsvector` column.
+3. **Fusion CTE**: Merges both result sets with `1/(k + rank_dense) + 1/(k + rank_lexical)`.
+4. **Final join**: Fetches full documents for the top fused results.
+
+This avoids two separate queries and application-level fusion.
+
+## Multi-tenancy via schema isolation
+
+For SaaS deployments where each tenant needs isolated data:
+
+```typescript
+// Tenant A
+const storeA = new PostgresVectorStore({
+  // ...
+  tablePrefix: 'tenant_a_',
+});
+
+// Tenant B
+const storeB = new PostgresVectorStore({
+  // ...
+  tablePrefix: 'tenant_b_',
+});
+```
+
+Each prefix creates a separate set of tables: `"tenant_a_my_collection"`, `"tenant_a__collections"`, etc. Alternatively, use Postgres schemas (`SET search_path`) for stronger isolation.
+
+## Cloud providers
+
+Any managed Postgres with pgvector works. Just set the connection string:
+
+| Provider | Connection string example |
+|---|---|
+| **Neon** | `postgresql://user:pass@ep-cool-grass-123456.us-east-2.aws.neon.tech/neondb?sslmode=require` |
+| **Supabase** | `postgresql://postgres:pass@db.xyzabc.supabase.co:5432/postgres` |
+| **AWS RDS** | `postgresql://postgres:pass@mydb.cluster-xyz.us-east-1.rds.amazonaws.com:5432/mydb` |
+| **Google Cloud SQL** | `postgresql://postgres:pass@/mydb?host=/cloudsql/project:region:instance` |
+| **Azure Flexible Server** | `postgresql://postgres:pass@myserver.postgres.database.azure.com:5432/mydb?sslmode=require` |
+
+All of these support pgvector. Neon and Supabase have it pre-installed. For RDS, enable the `pgvector` extension in the parameter group.
+
+## Troubleshooting
+
+### `ERROR: could not open extension control file "vector"`
+
+pgvector is not installed. On managed services, check that the extension is enabled in your database configuration. For self-hosted:
+
+```bash
+# Ubuntu/Debian
+sudo apt install postgresql-16-pgvector
+
+# macOS (Homebrew)
+brew install pgvector
+```
+
+Then run `CREATE EXTENSION vector;` as a superuser.
+
+### `ERROR: different vector dimensions`
+
+You changed `defaultDimension` after creating a collection. pgvector enforces dimension constraints at the column level. Drop and recreate the collection, or create a new collection with the correct dimension.
+
+### Connection refused / timeout
+
+- Verify the connection string host, port, and credentials.
+- Check that `pg_hba.conf` allows connections from your IP.
+- For Docker: ensure `-p 5432:5432` is set and the container is running.
+- For cloud: check firewall / security group rules.
+
+### Pool exhaustion (`too many clients already`)
+
+Increase `poolSize` in the config, or reduce concurrent usage. The default of 10 is usually sufficient for single-agent deployments. Multi-agent setups may need 20-50.
diff --git a/docs/QDRANT_BACKEND.md b/docs/QDRANT_BACKEND.md