Open
Description
Describe the issue
When there is a Conv3d op in an q4f16 quantized onnx model, it throws the following error:
An uncaught WebGPU validation error was raised: Error while parsing WGSL: :51:15 error: return statement type must match its function return type, returned 'f16', expected 'f32'
return get_xByIndices(aIndices);
^^^^^^
- While validating [ShaderModuleDescriptor ""Conv3DNaive""]
- While calling [Device].CreateShaderModule([ShaderModuleDescriptor ""Conv3DNaive""]).
I think it's this op here specifically that is causing the error:
This does not happen on WASM, but only on the webgpu exec provider.
To reproduce
Visit https://huggingface.co/spaces/pdufour/Qwen2VL_TransformersJS_Demo (space I setup to use this model)
Select q4f16 model to load
Select example image and type in text and hit enter
Actual results
See error mentioned above
Expected results
Should process query
ONNX Runtime Version or Commit ID
1.20.1
Execution Provider
'webgpu' (WebGPU)