feat(embedding): add litellm as embedding provider by stubbi · Pull Request #853 · volcengine/OpenViking

stubbi · 2026-03-21T21:54:04Z

Summary

Adds LiteLLM as a new embedding provider, resolving the gap between VLM (which already supports litellm) and the embedding layer. This enables users to route embedding requests through OpenRouter, Ollama, vLLM, and any OpenAI-compatible endpoint via litellm's unified interface.

New file: openviking/models/embedder/litellm_embedders.py — LiteLLMDenseEmbedder class extending DenseEmbedderBase
Updated: embedding_config.py — added "litellm" to the provider validation list, factory registry, and docstrings
Updated: embedder/__init__.py — added conditional import (graceful fallback if litellm not installed)
New tests: tests/unit/test_litellm_embedder.py — 15 tests covering embed, batch embed, non-symmetric mode, factory integration, and config validation

Usage example

"embedding": {
  "dense": {
    "provider": "litellm",
    "model": "openai/text-embedding-3-small",
    "api_base": "https://openrouter.ai/api/v1",
    "api_key": "<openrouter-key>",
    "dimension": 1536
  }
}

Closes #847

Test plan

All 15 new unit tests pass
All existing embedding tests still pass (no regressions)
Manual test with OpenRouter endpoint
Manual test with local Ollama embeddings

🤖 Generated with Claude Code

Adds LiteLLM as a new embedding provider, bringing embedding parity with the VLM layer which already supports litellm. This enables users to route embedding requests through OpenRouter, Ollama, vLLM, and any other OpenAI-compatible endpoint via litellm's unified interface. Closes volcengine#847 Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

CLAassistant · 2026-03-21T21:54:11Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

github-actions · 2026-03-21T21:54:53Z

Failed to generate code suggestions for PR

ZaynJarvis

Critical Issues Found

1. Module-level side effect on import (`litellm_embedders.py:13`)

os.environ.setdefault("LITELLM_LOCAL_MODEL_COST_MAP", "True")

This mutates the process environment at import time — not when the embedder is used. Since __init__.py imports this module (behind try/except), this fires as soon as the embedder package is loaded, even if the user never uses litellm. This can interfere with other litellm usage in the same process that intentionally sets this env var differently.

Fix: Move this inside LiteLLMDenseEmbedder.__init__ or into _build_kwargs.

2. Probe API call during `init` / silent fallback (`litellm_embedders.py:89-93`)

def _detect_dimension(self) -> int:
    try:
        result = self.embed("test")
        return len(result.dense_vector) if result.dense_vector else 1536
    except Exception:
        return 1536

When dimension=None, the constructor calls _detect_dimension() which calls self.embed("test"). This means:

A billable API call the user didn't ask for, fired during object construction
A network side effect inside __init__ — surprising and hard to test
Silent fallback to 1536 on any exception (auth failure, network error, wrong model) — the user gets no indication their config is wrong

Fix: Require dimension as a mandatory parameter (other providers do this), or at minimum make the probe call lazy and log a warning on fallback instead of silently swallowing errors.

3. Missing `None` guard for `LiteLLMDenseEmbedder` in factory

The factory in embedding_config.py imports LiteLLMDenseEmbedder but doesn't check if it's None (which happens when litellm isn't installed). Other providers like Gemini have this guard. If a user configures provider: "litellm" without litellm installed, they'll get a confusing TypeError: 'NoneType' object is not callable instead of a clear error message.

1. Move os.environ.setdefault from module-level to __init__ to avoid mutating the process environment on import 2. Require dimension as mandatory — removes the probe API call during construction that caused surprise billable requests and silent fallbacks 3. Add None guard for LiteLLMDenseEmbedder in factory to give a clear error when litellm is not installed Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

stubbi · 2026-03-22T08:17:41Z

Thanks for the review @ZaynJarvis! All three issues have been addressed in the latest push:

Module-level side effect — os.environ.setdefault("LITELLM_LOCAL_MODEL_COST_MAP", "True") moved from module scope into LiteLLMDenseEmbedder.__init__, so it only fires when an embedder is actually instantiated.
Probe API call / silent fallback — Removed _detect_dimension() entirely. dimension is now required at both the config validation layer (EmbeddingModelConfig) and the embedder constructor, with clear error messages if missing.
Missing None guard — The factory now checks if LiteLLMDenseEmbedder is None before attempting to instantiate, raising ValueError("LiteLLM is not installed. Install it with: pip install litellm") instead of a confusing TypeError.

Added 3 new tests covering the new validation paths (18 total, all passing).

ZaynJarvis

lgtm

github-project-automation bot added this to OpenViking project Mar 21, 2026

github-project-automation bot moved this to Backlog in OpenViking project Mar 21, 2026

ZaynJarvis reviewed Mar 22, 2026

View reviewed changes

ZaynJarvis approved these changes Mar 22, 2026

View reviewed changes

ZaynJarvis merged commit 49a50fd into volcengine:main Mar 22, 2026
4 of 6 checks passed

github-project-automation bot moved this from Backlog to Done in OpenViking project Mar 22, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(embedding): add litellm as embedding provider#853

feat(embedding): add litellm as embedding provider#853
ZaynJarvis merged 2 commits intovolcengine:mainfrom
stubbi:feat/litellm-embedding-provider

stubbi commented Mar 21, 2026

Uh oh!

CLAassistant commented Mar 21, 2026

Uh oh!

github-actions bot commented Mar 21, 2026

Uh oh!

ZaynJarvis left a comment

Uh oh!

stubbi commented Mar 22, 2026

Uh oh!

ZaynJarvis left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

stubbi commented Mar 21, 2026

Summary

Usage example

Test plan

Uh oh!

CLAassistant commented Mar 21, 2026

Uh oh!

github-actions bot commented Mar 21, 2026

Uh oh!

ZaynJarvis left a comment

Choose a reason for hiding this comment

Critical Issues Found

1. Module-level side effect on import (litellm_embedders.py:13)

2. Probe API call during __init__ / silent fallback (litellm_embedders.py:89-93)

3. Missing None guard for LiteLLMDenseEmbedder in factory

Uh oh!

stubbi commented Mar 22, 2026

Uh oh!

ZaynJarvis left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

1. Module-level side effect on import (`litellm_embedders.py:13`)

2. Probe API call during `init` / silent fallback (`litellm_embedders.py:89-93`)

3. Missing `None` guard for `LiteLLMDenseEmbedder` in factory