Skip to content

[Web] Can't use Conv3DNaive on webgpu #22974

Open
@pdufour

Description

@pdufour

Describe the issue

When there is a Conv3d op in an q4f16 quantized onnx model, it throws the following error:

An uncaught WebGPU validation error was raised: Error while parsing WGSL: :51:15 error: return statement type must match its function return type, returned 'f16', expected 'f32'
              return get_xByIndices(aIndices);
              ^^^^^^


 - While validating [ShaderModuleDescriptor ""Conv3DNaive""]
 - While calling [Device].CreateShaderModule([ShaderModuleDescriptor ""Conv3DNaive""]).

I think it's this op here specifically that is causing the error:
Image

This does not happen on WASM, but only on the webgpu exec provider.

To reproduce

Visit https://huggingface.co/spaces/pdufour/Qwen2VL_TransformersJS_Demo (space I setup to use this model)
Select q4f16 model to load
Select example image and type in text and hit enter

Actual results
See error mentioned above

Expected results
Should process query

ONNX Runtime Version or Commit ID

1.20.1

Execution Provider

'webgpu' (WebGPU)

Metadata

Metadata

Assignees

Labels

ep:WebGPUort-web webgpu providerplatform:webissues related to ONNX Runtime web; typically submitted using template

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions