Open
Description
Essentially, we are upstreaming https://github.com/microsoft/onnxruntime/blob/main/onnxruntime/python/tools/transformers/fusion_constant_fold.py
https://github.com/microsoft/onnxruntime/blob/838b97e73289cd11caf969f9f5c01ce153d6069f/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L182
If initializer is not consumed by other inputs, we can transpose the initializer in advance.
Metadata
Metadata
Assignees
Labels
No labels
Activity
Support Gemma3 with Clip fused attention (#24280)
Support Gemma3 with Clip fused attention (microsoft#24280)
titaiwangms commentedon Apr 30, 2025
Related https://github.com/microsoft/onnxscript/pull/2025/files