[Web] Error using opus-mt-mul fp16 models with WebGPU

### Describe the issue

An unknown error occurs when loading "https://huggingface.co/Xenova/opus-mt-mul-en/resolve/main/onnx/encoder_model_fp16.onnx" and "https://huggingface.co/Xenova/opus-mt-mul-en/resolve/main/onnx/decoder_model_merged_fp16.onnx" models. The opus-mt-mul-en fp32 models load correctly and there are no errors.

### To reproduce

Create a translate inference pipeline using "@huggingface/transformers" version 3.3.3, which uses onnxruntime-web - version 1.21.0-dev.20250206-d981b153d3. The error occurs when loading the  - https://huggingface.co/Xenova/opus-mt-mul-en/resolve/main/onnx/encoder_model_fp16.onnx

### Urgency

_No response_

### ONNX Runtime Installation

Released Package

### ONNX Runtime Version or Commit ID

1.21.0-dev.20250206-d981b153d3

### Execution Provider

'webgpu' (WebGPU)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Web] Error using opus-mt-mul fp16 models with WebGPU #25125

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

Execution Provider

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Web] Error using opus-mt-mul fp16 models with WebGPU #25125

Description

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

Execution Provider

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions