Provider Configuration & Selection

**Referenced Files in This Document** - [config.ts](file://src/config.ts) - [providers.ts](file://src/services/embedding/providers.ts) - [service.ts](file://src/services/embedding/service.ts) - [config.ts](file://src/services/embedding/config.ts) - [health.ts](file://src/services/embedding/health.ts) - [audit.ts](file://src/services/embedding/audit.ts) - [README.md](file://README.md) - [docs/install/prerequisites.md](file://docs/install/prerequisites.md)

Introduction

This document explains how the embedding provider configuration and selection mechanism works in the system. It covers automatic provider detection logic that chooses between OpenAI, TEI, and local providers based on environment variables, the EMBEDDING_PROVIDER preference setting and fallback behavior, configuration parameters, practical configuration examples, environment variable precedence rules, troubleshooting provider selection issues, provider-specific settings, model name resolution, and runtime provider switching capabilities.

Project Structure

The embedding subsystem is organized around three main modules:

Configuration parsing and defaults for environment variables
Provider selection and invocation logic
Service wrapper that exposes embedding generation and health checks

graph TB
subgraph "Embedding Subsystem"
CFG["src/config.ts<br/>Environment variable parsing"]
SVC["src/services/embedding/service.ts<br/>EmbeddingService"]
PRV["src/services/embedding/providers.ts<br/>Provider selection & calls"]
CONF["src/services/embedding/config.ts<br/>Provider endpoints & dimension cache"]
HLTH["src/services/embedding/health.ts<br/>Health checks"]
AUD["src/services/embedding/audit.ts<br/>Audit & anomaly detection"]
end
CFG --> PRV
CFG --> CONF
PRV --> SVC
CONF --> SVC
HLTH --> SVC
AUD --> SVC

Diagram sources

config.ts:67-74
providers.ts:1-280
service.ts:1-293
config.ts:1-40
health.ts:1-121
audit.ts:1-197

Section sources

config.ts:67-74
providers.ts:251-278
service.ts:38-284

Core Components

Environment configuration: centralizes all environment variable parsing and defaults for embedding-related settings.
Provider selection: encapsulates provider detection logic and invokes the appropriate provider implementation.
Service wrapper: exposes embedding generation APIs, dimension probing, health checks, and runtime provider switching.

Key configuration parameters:

OPENAI_API_KEY: OpenAI API key
OPENAI_EMBEDDING_MODEL: OpenAI embedding model name
OPENAI_API_URL: Base URL for OpenAI-compatible endpoints (e.g., Azure or Ollama)
EMBEDDING_PROVIDER: Provider preference ('auto', 'openai', 'tei')
TEI_BASE_URL: Base URL for TEI endpoint
TEI_MODEL: TEI model name
TEI_API_KEY: Optional API key header for TEI

Section sources

config.ts:67-74
providers.ts:251-278
service.ts:258-265

Architecture Overview

The provider selection and invocation flow is centralized in the provider module and consumed by the service wrapper. The service also manages dimension probing and runtime provider switching.

sequenceDiagram
participant Client as "Caller"
participant Service as "EmbeddingService"
participant Providers as "Providers Module"
participant OpenAI as "OpenAI Endpoint"
participant TEI as "TEI Endpoint"
Client->>Service : generateEmbedding()/generateBatchEmbeddings()
Service->>Service : getProvider()
Service->>Providers : postEmbeddings(input)
alt EMBEDDING_PROVIDER == "openai"
Providers->>OpenAI : POST /v1/embeddings
OpenAI-->>Providers : embeddings[]
Providers-->>Service : embeddings[]
else EMBEDDING_PROVIDER == "tei"
Providers->>TEI : POST /v1/embeddings or model-specific
TEI-->>Providers : embeddings[]
Providers-->>Service : embeddings[]
else auto-detection
Providers->>OpenAI : Try OpenAI first
alt OpenAI fails
Providers->>TEI : Fallback to TEI
TEI-->>Providers : embeddings[]
else OpenAI succeeds
OpenAI-->>Providers : embeddings[]
end
Providers-->>Service : embeddings[]
end
Service->>Service : validate dimension & anomalies
Service-->>Client : embedding(s)

Diagram sources

providers.ts:251-278
service.ts:47-221
config.ts:5-10

Detailed Component Analysis

Provider Detection and Selection Logic

The selection logic prioritizes explicit preference over auto-detection:

If EMBEDDING_PROVIDER is set to 'openai' or 'tei', the system uses that provider exclusively.
In auto mode, the system prefers OpenAI when both OPENAI_API_KEY and OPENAI_EMBEDDING_MODEL are present; otherwise it falls back to TEI if TEI_BASE_URL and TEI_MODEL are configured.
If both providers are unavailable, the system returns a local provider type for compatibility.

flowchart TD
Start(["postEmbeddings()"]) --> Pref["Read EMBEDDING_PROVIDER"]
Pref --> IsOpenAI{"Pref == 'openai'?"}
IsOpenAI --> |Yes| CheckOpenAI["Validate OPENAI_API_KEY + MODEL"]
CheckOpenAI --> |OK| CallOpenAI["Call OpenAI"]
CheckOpenAI --> |Missing| ThrowOA["Throw error"]
IsOpenAI --> |No| IsTEI{"Pref == 'tei'?"}
IsTEI --> |Yes| CheckTEI["Validate TEI_BASE_URL + MODEL"]
CheckTEI --> |OK| CallTEI["Call TEI"]
CheckTEI --> |Missing| ThrowTEI["Throw error"]
IsTEI --> |No| Auto["Auto-detection"]
Auto --> HasOpenAI{"OPENAI_API_KEY + MODEL?"}
HasOpenAI --> |Yes| TryOpenAI["Try OpenAI"]
TryOpenAI --> OA_OK{"OpenAI OK?"}
OA_OK --> |Yes| ReturnOA["Return OpenAI embeddings"]
OA_OK --> |No| HasTEI{"TEI_BASE_URL + MODEL?"}
HasTEI --> |Yes| CallTEI
HasTEI --> |No| ThrowBoth["Throw no provider configured"]
Auto --> |No| HasTEI2{"TEI_BASE_URL + MODEL?"}
HasTEI2 --> |Yes| CallTEI
HasTEI2 --> |No| ThrowBoth

Diagram sources

providers.ts:251-278
service.ts:258-265

Section sources

providers.ts:251-278
service.ts:258-265

Configuration Parameters and Defaults

OPENAI_API_KEY: Required for OpenAI; empty disables OpenAI usage.
OPENAI_EMBEDDING_MODEL: Defaults to a common small model when unset.
OPENAI_API_URL: Defaults to the public OpenAI base URL; trailing slash is normalized.
EMBEDDING_PROVIDER: Defaults to 'auto'.
TEI_BASE_URL: Base URL for TEI; endpoint is constructed as BASE_URL + '/v1/embeddings' if provided.
TEI_MODEL: TEI model name; defaults to a commonly used model when unset.
TEI_API_KEY: Optional; if present, sent as 'x-api-key' header.

Provider endpoint construction:

OpenAI: OPENAI_API_URL + '/v1/embeddings'
TEI: TEI_BASE_URL + '/v1/embeddings' if TEI_BASE_URL ends with '/', the trailing slash is removed before appending the path.

Section sources

config.ts:67-74
config.ts:5-10

Practical Configuration Examples

OpenAI
- Set OPENAI_API_KEY and optionally OPENAI_EMBEDDING_MODEL.
- Example values:
  - OPENAI_API_KEY=sk-...
  - OPENAI_EMBEDDING_MODEL=text-embedding-3-small
Ollama (OpenAI-compatible)
- Set OPENAI_API_URL to the base URL (without /v1), OPENAI_EMBEDDING_MODEL to the local model, and OPENAI_API_KEY to 'ollama'.
- Example values:
  - OPENAI_API_URL=http://host.docker.internal:11434
  - OPENAI_EMBEDDING_MODEL=nomic-embed-text
  - OPENAI_API_KEY=ollama
TEI
- Set TEI_BASE_URL and optionally TEI_MODEL; TEI_API_KEY is optional.
- Example values:
  - TEI_BASE_URL=http://your-tei:8080
  - TEI_MODEL=Alibaba-NLP/gte-large-en-v1.5
  - TEI_API_KEY=secret
Preference override
- Set EMBEDDING_PROVIDER to 'openai' or 'tei' to force a specific provider regardless of environment variables.

Section sources

docs/install/prerequisites.md:101-107
docs/install/prerequisites.md:144-150
docs/install/prerequisites.md:177-182
config.ts:67-74

Environment Variable Precedence Rules

Explicit preference takes precedence:
- If EMBEDDING_PROVIDER is set to 'openai' or 'tei', the system uses that provider regardless of other variables.
Auto-detection rules:
- Prefer OpenAI when OPENAI_API_KEY and OPENAI_EMBEDDING_MODEL are both present.
- Otherwise, prefer TEI when TEI_BASE_URL and TEI_MODEL are both present.
- If neither is available, the system falls back to a local provider type for compatibility.
Provider-specific validation:
- OpenAI: requires both OPENAI_API_KEY and OPENAI_EMBEDDING_MODEL.
- TEI: requires both TEI_BASE_URL and TEI_MODEL.

Section sources

providers.ts:251-278
service.ts:258-265

Model Name Resolution

The service resolves the effective model name based on the selected provider:
- For TEI, the model is taken from TEI_MODEL.
- For OpenAI, the model is taken from OPENAI_EMBEDDING_MODEL.
The service also validates that the returned embedding dimension matches the previously resolved dimension, ensuring consistency across provider switches.

Section sources

service.ts:43-45
config.ts:12-36

Runtime Provider Switching Capabilities

The service exposes a runtime method to determine the current provider:
- getProvider() evaluates EMBEDDING_PROVIDER and environment variables to decide between 'openai', 'tei', or 'local'.
The service also exposes a configuration inspection method:
- getConfig() returns the current provider, model, dimension, and configuration status for diagnostics.

Section sources

service.ts:258-283

Health Checks and Audit

Health checks:
- runEmbeddingHealthCheck() validates the current provider configuration and performs a minimal embedding call to verify operation.
- It returns a structured result indicating whether the provider is healthy and a human-readable message.
Audit and anomaly detection:
- Embedding requests are audited with details such as provider, model, input counts, output dimensions, and latency.
- Anomaly detection flags unusual latencies, vector norms, and dimension mismatches.

Section sources

health.ts:16-119
audit.ts:60-157

Dependency Analysis

The embedding subsystem depends on configuration parsing and exposes a clean service interface. The provider module encapsulates network calls and retry logic, while the service module orchestrates dimension probing, validation, and diagnostics.

graph TB
CFG["src/config.ts"]
CONF["src/services/embedding/config.ts"]
PRV["src/services/embedding/providers.ts"]
SVC["src/services/embedding/service.ts"]
HLTH["src/services/embedding/health.ts"]
AUD["src/services/embedding/audit.ts"]
CFG --> PRV
CFG --> CONF
PRV --> SVC
CONF --> SVC
HLTH --> SVC
AUD --> SVC

Diagram sources

config.ts:67-74
providers.ts:1-280
service.ts:1-293
health.ts:1-121
audit.ts:1-197

Section sources

config.ts:67-74
providers.ts:1-280
service.ts:1-293

Performance Considerations

Retry strategy: The provider module implements a bounded retry mechanism for transient network errors and specific HTTP statuses (rate limits, gateway timeouts).
Latency monitoring: Embedding requests are timed and logged as part of audit events.
Vector size tracking: The service observes vector sizes for telemetry and anomaly detection.
Dimension caching: The system caches the resolved embedding dimension after the first successful embedding call to avoid repeated probing.

Section sources

providers.ts:31-47
service.ts:75-95
audit.ts:105-157
config.ts:12-36

Troubleshooting Guide

Common issues and resolutions:

No embedding provider configured
- Symptom: Error indicating that no provider is configured.
- Resolution: Set OPENAI_API_KEY and OPENAI_EMBEDDING_MODEL for OpenAI, or TEI_BASE_URL and TEI_MODEL for TEI.
OpenAI authentication failure
- Symptom: Authentication error or 401.
- Resolution: Verify OPENAI_API_KEY and ensure the key has permissions for embeddings.
OpenAI rate limit or gateway errors
- Symptom: 429, 502, 503, or 504 responses.
- Resolution: The system retries transient errors; if persistent, reduce request rate or switch providers.
TEI authentication or rate limit
- Symptom: 401 or 429.
- Resolution: Verify TEI_API_KEY and endpoint availability; retry logic applies similarly.
Dimension mismatch
- Symptom: Embedding dimension mismatch error.
- Resolution: Ensure consistent model selection across the deployment; re-run dimension probing if switching models.
Health check failures
- Symptom: Health endpoint indicates unhealthy provider.
- Resolution: Use runEmbeddingHealthCheck() to diagnose; verify endpoint URLs, keys, and model names.

Section sources

providers.ts:77-175
providers.ts:177-249
health.ts:16-119
README.md:372-378

Conclusion

The embedding subsystem provides a robust, configurable, and observable mechanism for selecting and invoking embedding providers. By combining explicit preferences with sensible auto-detection, it ensures reliable operation across diverse environments. The service layer adds dimension validation, audit logging, and health checks to improve reliability and observability. Proper configuration of environment variables and adherence to the precedence rules will minimize provider selection issues and enable smooth runtime switching when needed.

Provider Configuration & Selection

Provider Configuration & Selection

Table of Contents

Introduction

Project Structure

Core Components

Architecture Overview

Detailed Component Analysis

Provider Detection and Selection Logic

Configuration Parameters and Defaults

Practical Configuration Examples

Environment Variable Precedence Rules

Model Name Resolution

Runtime Provider Switching Capabilities

Health Checks and Audit

Dependency Analysis

Performance Considerations

Troubleshooting Guide

Conclusion

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!