Skip to content

.Net: New Feature: AddBertOnnxEmbeddingGenerator should support multi-langugage embedding models which do not have vocab.txt files #12571

@philipag

Description

@philipag

name: Feature request
about: Suggest an idea for this project


Multi-language text embeddings models like arctic_embed_m_v2_0 do not use vocab.txt files are currently not supported by AddBertOnnxEmbeddingGenerator(). The following throws an exception:

		builder.Services.AddBertOnnxEmbeddingGenerator(
			onnxModelPath: "....\arctic_embed_m_v2_0_model_int8.onnx",
			vocabPath: null
			);

The embeddings generator should support these newer models.

Metadata

Metadata

Assignees

Labels

.NETIssue or Pull requests regarding .NET codeai connectorAnything related to AI connectors

Projects

Status

Backlog: Planned

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions