bge-small-en ONNX component fails to load with "Opset 19 is under development" #28161

eostis · 2023-08-27T16:29:56Z

From https://blog.vespa.ai/bge-embedding-models-in-vespa-using-bfloat16/

Vespa 8.216.8

(no issues with model multilingual-e5-small)

The ONNX generation traces (tried also with --opset 17):

optimum-cli export onnx --task sentence-similarity -m BAAI/bge-small-en --optimize O3 wpsolr/models/bge-small-en-onnx
Framework not specified. Using pt to export to ONNX.
Downloading model.safetensors: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 133M/133M [00:06<00:00, 20.6MB/s]
Downloading (…)okenizer_config.json: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 366/366 [00:00<00:00, 1.89MB/s]
Using framework PyTorch: 2.0.1
Overriding 1 configuration item(s)
- use_cache -> False
================ Diagnostic Run torch.onnx.export version 2.0.1 ================
verbose: False, log level: Level.ERROR
======================= 0 NONE 0 NOTE 0 WARNING 0 ERROR ========================

/usr/local/lib/python3.11/site-packages/optimum/onnxruntime/configuration.py:765: FutureWarning: disable_embed_layer_norm will be deprecated soon, use disable_embed_layer_norm_fusion instead, disable_embed_layer_norm_fusion is set to True.
warnings.warn(
Optimizing model...
symbolic shape inference disabled or failed.
Configuration saved in wpsolr/models/bge-small-en-onnx/ort_config.json
Optimized model saved at: wpsolr/models/bge-small-en-onnx (external data format: False; saved all tensor to one file: True)
Post-processing the exported models...
Validating models in subprocesses...
Validating ONNX model wpsolr/models/bge-small-en-onnx/model.onnx...
-[✓] ONNX model output names match reference model (last_hidden_state)
- Validating ONNX Model output "last_hidden_state":
-[✓] (2, 16, 384) matches (2, 16, 384)
-[x] values not close enough, max diff: 1.8654606342315674 (atol: 0.0001)
The ONNX export succeeded with the warning: The maximum absolute difference between the output of the reference model and the ONNX exported model is not within the set tolerance 0.0001:

last_hidden_state: max diff = 1.8654606342315674.
The exported model was saved at: wpsolr/models/bge-small-en-onnx

The component:

  <component id="wpsolr_bge_small_en_onnx" type="hugging-face-embedder">
       <transformer-model url="https://www.dropbox.com/scl/fi/91x8qmxsq87plberfv238/model.onnx?rlkey=(...)&amp;dl=1"/>
       <tokenizer-model url="https://www.dropbox.com/scl/fi/nmhz3pzwhcc13ypz33kh1/tokenizer.json?rlkey=(...)&amp;dl=1"/>
      <pooling-strategy>cls</pooling-strategy>
      <normalize>true</normalize>
    </component>

The error in Vespa logs

Container.com.yahoo.jdisc.core.StandaloneMain JDisc exiting: Throwable caught: \nexception=\ncom.yahoo.container.di.componentgraph.core.ComponentNode$ComponentConstructorException: Error constructing 'wpsolr_bge_small_en_onnx' of type 'ai.vespa.embedding.huggingface.HuggingFaceEmbedder': null\nCaused by: java.lang.RuntimeException: ONNX Runtime exception\n\tat ai.vespa.modelintegration.evaluator.OnnxEvaluator.createSession(OnnxEvaluator.java:161)\n\tat ai.vespa.modelintegration.evaluator.OnnxEvaluator.createSession(OnnxEvaluator.java:156)\n\tat ai.vespa.modelintegration.evaluator.OnnxEvaluator.(OnnxEvaluator.java:36)\n\tat ai.vespa.modelintegration.evaluator.OnnxRuntime.evaluatorOf(OnnxRuntime.java:81)\nCaused by: ai.onnxruntime.OrtException: Error code - ORT_FAIL - message: Load model from /opt/vespa/var/db/vespa/download/-1287799194143085460/contents failed:/builddir/build/BUILD/vespa-onnxruntime-1.13.1/onnxruntime/core/graph/model_load_utils.h:47 void onnxruntime::model_load_utils::ValidateOpsetForDomain(const std::unordered_map<std::__cxx11::basic_string, int>&, const onnxruntime::logging::Logger&, bool, const string&, int) ONNX Runtime only guarantees support for models stamped with official released onnx opset versions. Opset 19 is under development and support for this is limited. The operator schemas and or other functionality may change before next ONNX release and in this case ONNX Runtime will not guarantee backward compatibility. Current official support for domain com.ms.internal.nhwc is till opset 17.\n\n\tat ai.onnxruntime.OrtSession.createSession(Native Method)\n\tat ai.onnxruntime.OrtSession.(OrtSession.java:73)\n\tat ai.onnxruntime.OrtEnvironment.createSession(OrtEnvironment.java:222)\n\tat ai.onnxruntime.OrtEnvironment.createSession(OrtEnvironment.java:208)\n\tat ai.vespa.modelintegration.evaluator.OnnxRuntime$1.create(OnnxRuntime.java:46)\n\tat ai.vespa.modelintegration.evaluator.OnnxRuntime.acquireSession(OnnxRuntime.java:149)\n\tat ai.vespa.modelintegration.evaluator.OnnxEvaluator.createSession(OnnxEvaluator.java:144)\n\t... 3 more\n

The text was updated successfully, but these errors were encountered:

jobergum · 2023-08-28T07:58:54Z

Hey - this is likely because you are using a new version of onnxruntime (onnxruntime 1.13.1). Try to downgrade onnxruntime to 1.13.1

jobergum · 2023-08-28T08:31:55Z

We should document an easy way to find which onnxruntime is used in any Vespa target version, to avoid using a model exported with a newer version than what vespa uses.

eostis · 2023-08-28T08:42:02Z

Would you know why exporting model "multilingual-e5-small" works, while "bge-small-en" does not?

jobergum · 2023-08-28T08:48:01Z

They have slightly different model architectures that can trigger different compute graphs, which causes this forward compatibility issue with onnxruntime. What I want this ticket to be about is that it should be easy to target a specific Vespa version with a given onnxruntime version.

eostis · 2023-08-28T08:53:55Z

I will wait for the FR :)

baldersheim · 2023-08-28T09:24:08Z

We have fallen behind on onnx versions, 1.15 is on the way.

eostis · 2023-08-28T09:45:42Z

This makes sense to me now: multilingual-e5-small is an older model than bge-small-en

jobergum · 2023-08-31T07:15:43Z

@arnej27959 or @lesters, does the onnx file include which version was used to export it? So that we could potentially sniff on that and reject if it was exported with a newer runtime than Vespa uses?

lesters · 2023-09-01T10:11:28Z

ONNX files contain both the opset version and ir version of the file.

jobergum · 2023-09-01T11:08:44Z

What is ir version?

lesters · 2023-09-04T07:31:13Z

The ir (intermediate representation) version refers to the representation of the graph and the operators of the model, e.g. that overall computation that should be done, or the overall structure. The op version refers to the version of the individual operators, as they can change or introduce optimizations. However, when new operators are introduced, the IR version necessarily must increase, but so does the op version. So the op version in general is the most important to follow.

eostis mentioned this issue Aug 27, 2023

A checklist for WooCommerce #26694

Open

jobergum self-assigned this Aug 28, 2023

jobergum added the enhancement label Aug 28, 2023

jobergum added this to the soon milestone Aug 28, 2023

johans1 assigned bjorncs and unassigned jobergum Sep 6, 2023

eostis mentioned this issue Sep 28, 2023

Plug & Play download & configuration of HuggingFace sentence transformer models #26696

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

bge-small-en ONNX component fails to load with "Opset 19 is under development" #28161

bge-small-en ONNX component fails to load with "Opset 19 is under development" #28161

eostis commented Aug 27, 2023 •

edited

jobergum commented Aug 28, 2023

jobergum commented Aug 28, 2023

eostis commented Aug 28, 2023

jobergum commented Aug 28, 2023

eostis commented Aug 28, 2023

baldersheim commented Aug 28, 2023

eostis commented Aug 28, 2023

jobergum commented Aug 31, 2023

lesters commented Sep 1, 2023

jobergum commented Sep 1, 2023

lesters commented Sep 4, 2023

bge-small-en ONNX component fails to load with "Opset 19 is under development" #28161

bge-small-en ONNX component fails to load with "Opset 19 is under development" #28161

Comments

eostis commented Aug 27, 2023 • edited

The ONNX generation traces (tried also with --opset 17):

The component:

The error in Vespa logs

jobergum commented Aug 28, 2023

jobergum commented Aug 28, 2023

eostis commented Aug 28, 2023

jobergum commented Aug 28, 2023

eostis commented Aug 28, 2023

baldersheim commented Aug 28, 2023

eostis commented Aug 28, 2023

jobergum commented Aug 31, 2023

lesters commented Sep 1, 2023

jobergum commented Sep 1, 2023

lesters commented Sep 4, 2023

eostis commented Aug 27, 2023 •

edited