ExEmbed

Elixir-native text embeddings via Ortex (ONNX Runtime) and Tokenizers, with a FastEmbed-compatible model registry backed by HuggingFace.

No Python. No PyTorch. Runs entirely inside the BEAM.

Features

Tier 1 — Raw ONNX pipeline: tokenize → infer → mean pool → L2 normalize
Tier 2 — Nx.Serving wrapper for batching and backpressure
Tier 3 — B+C hybrid registry: vendored metadata + HuggingFace file resolution

Installation

def deps do
  [{:ex_embed, "~> 0.1"}]
end

Quick start

# Embed a single text (downloads model on first use)
{:ok, tensor} = ExEmbed.embed("Hello, world!")
# => {:ok, #Nx.Tensor<f32[1][384]>}

# Embed a batch with a specific model
{:ok, tensor} = ExEmbed.embed(["text one", "text two"], model: "BAAI/bge-base-en-v1.5")

# List available models
ExEmbed.list_models()

Production: Nx.Serving

# In your supervision tree:
{Nx.Serving,
  serving: ExEmbed.Serving.new("BAAI/bge-small-en-v1.5"),
  name: MyApp.EmbeddingServing,
  batch_size: 32,
  batch_timeout: 100}

# At call time (e.g. on note save in LiveView):
{:ok, vec} = Nx.Serving.run(MyApp.EmbeddingServing, note.content)

Mix tasks

mix ex_embed.list                           # show all registered models
mix ex_embed.download bge-small-en-v1.5    # prefetch a model
mix ex_embed.check_registry                # diff against FastEmbed upstream

Supported models

Model	Dim	Size	Notes
BAAI/bge-small-en-v1.5 (default)	384	67 MB	Fast, English
BAAI/bge-base-en-v1.5	768	210 MB	Balanced, English
BAAI/bge-large-en-v1.5	1024	590 MB	High quality, English
BAAI/bge-m3	1024	1.2 GB	Multilingual, 100+ langs
sentence-transformers/all-MiniLM-L6-v2	384	90 MB	Popular general-purpose
nomic-ai/nomic-embed-text-v1.5	768	130 MB	Long context (8192 tokens)
intfloat/multilingual-e5-small	384	120 MB	Multilingual
intfloat/multilingual-e5-base	768	270 MB	Multilingual
mixedbread-ai/mxbai-embed-large-v1	1024	560 MB	Strong MTEB scores
Alibaba-NLP/gte-base-en-v1.5	768	210 MB	Strong English

Run mix ex_embed.check_registry to check for new models in the FastEmbed upstream.

Configuration

# config/config.exs
config :ex_embed,
  cache_dir: "/path/to/model/cache"  # default: ~/.cache/ex_embed

License

Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
config		config
lib		lib
priv/registry		priv/registry
test		test
.formatter.exs		.formatter.exs
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
SECURITY.md		SECURITY.md
mix.exs		mix.exs
mix.lock		mix.lock
setup.sh		setup.sh
setup_ex_embed.sh		setup_ex_embed.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ExEmbed

Features

Installation

Quick start

Production: Nx.Serving

Mix tasks

Supported models

Configuration

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

ExEmbed

Features

Installation

Quick start

Production: Nx.Serving

Mix tasks

Supported models

Configuration

License

About

Resources

License

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages