Embedding Services

**Referenced Files in This Document** - [service.ts](file://src/services/embedding/service.ts) - [providers.ts](file://src/services/embedding/providers.ts) - [config.ts](file://src/services/embedding/config.ts) - [types.ts](file://src/services/embedding/types.ts) - [bm25-tokenizer.ts](file://src/services/embedding/bm25-tokenizer.ts) - [health.ts](file://src/services/embedding/health.ts) - [audit.ts](file://src/services/embedding/audit.ts) - [config.ts](file://src/config.ts) - [embedding-metrics.ts](file://src/services/metrics/embedding-metrics.ts)

Introduction

This document describes the KAIROS MCP embedding services, focusing on the embedding provider architecture that supports multiple backends (OpenAI, TEI), configuration and model selection, batch processing, BM25 tokenizer for sparse vectors, and hybrid search readiness. It also covers service initialization, health monitoring, error handling, embedding quality assessment, provider fallback strategies, and practical configuration examples.

Project Structure

The embedding subsystem resides under src/services/embedding and integrates with configuration, metrics, and audit/logging utilities.

graph TB
subgraph "Embedding Service Layer"
SVC["service.ts<br/>EmbeddingService"]
CFG["config.ts<br/>Dimensions, Endpoints"]
TYPES["types.ts<br/>EmbeddingResult, BatchEmbeddingResult"]
HEALTH["health.ts<br/>runEmbeddingHealthCheck"]
AUDIT["audit.ts<br/>detectEmbeddingAnomalies"]
BM25["bm25-tokenizer.ts<br/>tokenizeToSparse"]
end
subgraph "Providers"
PROV["providers.ts<br/>postEmbeddings*"]
end
subgraph "Configuration"
CONF["config.ts<br/>Environment Variables"]
end
subgraph "Metrics"
MET["embedding-metrics.ts<br/>Prometheus Counters/Histograms"]
end
SVC --> PROV
SVC --> CFG
SVC --> TYPES
SVC --> HEALTH
SVC --> AUDIT
BM25 --> SVC
PROV --> CONF
SVC --> MET

Diagram sources

service.ts:38-286
providers.ts:251-278
config.ts:1-40
types.ts:1-17
health.ts:16-119
audit.ts:94-157
bm25-tokenizer.ts:37-56
config.ts:67-74
embedding-metrics.ts:11-47

Section sources

service.ts:1-293
providers.ts:1-280
config.ts:1-40
types.ts:1-17
bm25-tokenizer.ts:1-57
health.ts:1-121
audit.ts:1-197
config.ts:67-74
embedding-metrics.ts:1-51

Core Components

EmbeddingService: Orchestrates single and batch embedding generation, provider selection, dimension probing, cosine similarity computation, memory embedding composition, health checks, and configuration reporting.
Providers: Encapsulate OpenAI and TEI embedding endpoints, request retries, error classification, and response normalization.
Config: Resolves endpoints, caches embedding dimension, and validates runtime expectations.
Types: Defines standardized result structures for single and batch embeddings.
BM25 Tokenizer: Produces sparse vectors for BM25-style retrieval compatible with Qdrant sparse vectors.
Health: Performs runtime health checks for configured providers.
Audit: Detects anomalies (latency, norm, dimension mismatch) and logs structured audit events.
Metrics: Exposes counters and histograms for embedding requests, durations, errors, vector sizes, and batch sizes.

Section sources

service.ts:38-286
providers.ts:77-278
config.ts:12-36
types.ts:1-17
bm25-tokenizer.ts:27-56
health.ts:16-119
audit.ts:94-157
embedding-metrics.ts:11-47

Architecture Overview

The embedding service selects a provider based on configuration and availability, normalizes responses, validates dimensions, and emits metrics and audit logs. Batch processing aggregates statistics and applies anomaly detection.

sequenceDiagram
participant Caller as "Caller"
participant ESvc as "EmbeddingService"
participant Prov as "postEmbeddings*"
participant OA as "OpenAI Endpoint"
participant TEI as "TEI Endpoint"
Caller->>ESvc : generateEmbedding(text)
ESvc->>ESvc : getProvider(), getModelName()
ESvc->>Prov : postEmbeddings(normalizedText)
alt Provider=OpenAI
Prov->>OA : POST /v1/embeddings
OA-->>Prov : embeddings[]
Prov-->>ESvc : embeddings[]
else Provider=TEI
Prov->>TEI : POST /v1/embeddings
TEI-->>Prov : embeddings[]
Prov-->>ESvc : embeddings[]
end
ESvc->>ESvc : detectEmbeddingAnomalies()
ESvc-->>Caller : EmbeddingResult

Diagram sources

service.ts:47-127
providers.ts:251-278
config.ts:5-10

Section sources

service.ts:47-127
providers.ts:251-278

Detailed Component Analysis