Add MNEMON_EMBED_DIMENSIONS for Matryoshka dimension truncation by achinth-b · Pull Request #5 · mnemon-dev/mnemon

achinth-b · 2026-05-02T00:22:45Z

Closes #4

Support Matryoshka Representation Learning by passing the dimensions parameter to Ollama's /api/embed endpoint. When MNEMON_EMBED_DIMENSIONS is set (e.g., 256), Ollama truncates and re-normalizes the embedding vector, giving faster similarity search with minimal quality loss on MRL-trained models like nomic-embed-text.

Backward compatible: when unset, behavior is identical to before.

What

Added dims field to the Client struct (0 = use native dimensions)
NewClient() reads the MNEMON_EMBED_DIMENSIONS env var (parsed as int, ignores invalid/negative values)
Added Dimensions field to embedRequest with json:"dimensions,omitempty" so it's only included when set
Embed() conditionally populates req.Dimensions when dims > 0
Updated CHANGELOG.md under [Unreleased]
Updated docs/USAGE.md Configuration table with the new env var

Why

Here is the issue which describes the problem.

Matryoshka-trained models encode the most important semantic signal in the first N dimensions. Using 256 dims instead of 768 gives ~95% retrieval quality at 3× less storage and faster cosine similarity. This is especially impactful for large knowledge bases where similarity search dominates recall latency.

Verified locally

Tested against Ollama nomic-embed-text on macOS:

# Without dimensions → 768-dim vector
curl -s http://localhost:11434/api/embed \
  -d '{"model":"nomic-embed-text","input":"test query"}' 
# → Vector length: 768

# With dimensions: 256 → 256-dim vector
curl -s http://localhost:11434/api/embed \
  -d '{"model":"nomic-embed-text","input":"test query","dimensions":256}'
# → Vector length: 256

Warning

Re-indexing caveat
If users change dimensions on an existing store, old embeddings (768-dim) and new embeddings (256-dim) will have incompatible vector sizes. For this PR, the env var only affects newly generated embeddings. A follow-up could add a mnemon embed --reindex command.

Support Matryoshka Representation Learning by passing the dimensions parameter to Ollama's /api/embed endpoint. When MNEMON_EMBED_DIMENSIONS is set (e.g., 256), Ollama truncates and re-normalizes the embedding vector, giving faster similarity search with minimal quality loss on MRL-trained models like nomic-embed-text. Backward compatible: when unset, behavior is identical to before.

Grivn · 2026-05-03T09:26:35Z

LGTM. Thanks for the clean, focused change.

Grivn merged commit 29e0cc2 into mnemon-dev:master May 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add MNEMON_EMBED_DIMENSIONS for Matryoshka dimension truncation#5

Add MNEMON_EMBED_DIMENSIONS for Matryoshka dimension truncation#5
Grivn merged 1 commit into
mnemon-dev:masterfrom
achinth-b:achinth/matryoshka-dimensions

achinth-b commented May 2, 2026

Uh oh!

Grivn commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

achinth-b commented May 2, 2026

What

Why

Verified locally

Uh oh!

Grivn commented May 3, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants