Skip to content

.Net: New Feature: AddBertOnnxEmbeddingGenerator should support multi-langugage embedding models which do not have vocab.txt files #12571

@philipag

Description

@philipag

name: Feature request
about: Suggest an idea for this project


Multi-language text embeddings models like arctic_embed_m_v2_0 do not use vocab.txt files are currently not supported by AddBertOnnxEmbeddingGenerator(). The following throws an exception:

		builder.Services.AddBertOnnxEmbeddingGenerator(
			onnxModelPath: "....\arctic_embed_m_v2_0_model_int8.onnx",
			vocabPath: null
			);

The embeddings generator should support these newer models.

Metadata

Metadata

Assignees

Labels

.NETIssue or Pull requests regarding .NET codeai connectorAnything related to AI connectorsneeds more infoAny issue that is requires more information from contributors

Projects

Status

Backlog: Planned

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions