Extend optimize_for_ort to cover passes #2274

titaiwangms · 2025-05-05T22:11:43Z

A draft for discussion. We should cover all post-processing the model shipping needs

codecov · 2025-05-05T22:15:09Z

Codecov Report

Attention: Patch coverage is 33.33333% with 4 lines in your changes missing coverage. Please review.

Project coverage is 73.75%. Comparing base (2ae13be) to head (fd1a225).

Files with missing lines	Patch %	Lines
onnxscript/rewriter/ort_fusions/_core.py	33.33%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2274      +/-   ##
==========================================
- Coverage   73.76%   73.75%   -0.02%     
==========================================
  Files         239      239              
  Lines       30904    30907       +3     
  Branches     3494     3494              
==========================================
- Hits        22797    22796       -1     
- Misses       6907     6911       +4     
  Partials     1200     1200

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

gramalingam · 2025-05-05T23:14:34Z

Please also consider whether this method should be optimize in-place or not. I think we can make it in-place now that shape-inference itself is in-place.

onnxscript/rewriter/ort_fusions/_core.py

justinchuby · 2025-05-07T16:06:55Z

Please also consider whether this method should be optimize in-place or not. I think we can make it in-place now that shape-inference itself is in-place.

I think making it out-of-place is safer, in case we have passes in the future that need to be functional?

titaiwangms · 2025-05-08T22:19:04Z

onnxscript/rewriter/ort_fusions/_core.py

+        # https://github.com/microsoft/onnxruntime/blob/74dcf7e296639095dfa55d31336998b6f719ed76/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L172
+        common_passes.ClearMetadataAndDocStringPass(),
+        # https://github.com/microsoft/onnxruntime/blob/74dcf7e296639095dfa55d31336998b6f719ed76/onnxruntime/python/tools/transformers/dynamo_onnx_helper.py#L139
+        common_passes.LiftConstantsToInitializersPass(lift_all_constants=False, size_limit=1),


We have another pass called LiftSubgraphInitializersToMainGraphPass. Do we know if it's needed in genAI? @kunal-vaishnavi

If the pass logic is in DynamoOnnxHelper, then it is used for ONNX Runtime GenAI.

We don't really produce graphs with subgraph initializers. I think we are ok either way

justinchuby · 2025-05-16T00:53:02Z

onnxscript/rewriter/ort_fusions/_core.py

@@ -135,4 +135,18 @@ def optimize_for_ort(
    )
    # Apply the ORT pattern rewrite rules.
    rewrite(model, ORT_PATTERN_REWRITE_RULES)
-    return model, fusion_count
+
+    passes = [


Suggested change

passes = [

passes = ir.passes.Sequential(

justinchuby · 2025-05-16T00:53:11Z

onnxscript/rewriter/ort_fusions/_core.py

+    ]
+    optimize_for_ort_passes = ir.passes.Sequential(*passes)


Suggested change

]

optimize_for_ort_passes = ir.passes.Sequential(*passes)

)

titaiwangms · 2025-05-16T18:16:09Z

I will set up Whisper and test it before merge this PR microsoft/onnxruntime#24382

draft

322851a

github-project-automation bot added this to ONNX Script Review Board May 5, 2025

github-project-automation bot moved this to Todo in ONNX Script Review Board May 5, 2025

titaiwangms requested review from gramalingam, shubhambhokare1, justinchuby and xadupre May 5, 2025 22:11

titaiwangms added the topic: api label May 5, 2025

justinchuby reviewed May 7, 2025

View reviewed changes

onnxscript/rewriter/ort_fusions/_core.py Outdated Show resolved Hide resolved

use pass manager

0226ed7

titaiwangms commented May 8, 2025

View reviewed changes

titaiwangms added 3 commits May 8, 2025 15:20

Merge branch 'main' into titaiwang/add_default_passes_to_ort_fusion

42ecb33

Merge branch 'main' into titaiwang/add_default_passes_to_ort_fusion

aa1e4a3

Merge branch 'main' into titaiwang/add_default_passes_to_ort_fusion

fd1a225

justinchuby reviewed May 16, 2025

View reviewed changes

justinchuby approved these changes May 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Extend optimize_for_ort to cover passes #2274

Extend optimize_for_ort to cover passes #2274

Uh oh!

titaiwangms commented May 5, 2025 •

edited

Loading

Uh oh!

codecov bot commented May 5, 2025 •

edited

Loading

Uh oh!

gramalingam commented May 5, 2025

Uh oh!

Uh oh!

justinchuby commented May 7, 2025

Uh oh!

titaiwangms May 8, 2025

Uh oh!

kunal-vaishnavi May 8, 2025

Uh oh!

justinchuby May 9, 2025

Uh oh!

justinchuby May 16, 2025

Uh oh!

justinchuby May 16, 2025

Uh oh!

titaiwangms commented May 16, 2025

Uh oh!

Uh oh!

Extend optimize_for_ort to cover passes #2274

Are you sure you want to change the base?

Extend optimize_for_ort to cover passes #2274

Uh oh!

Conversation

titaiwangms commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented May 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

gramalingam commented May 5, 2025

Uh oh!

Uh oh!

justinchuby commented May 7, 2025

Uh oh!

titaiwangms May 8, 2025

Choose a reason for hiding this comment

Uh oh!

kunal-vaishnavi May 8, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby May 9, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby May 16, 2025

Choose a reason for hiding this comment

Uh oh!

justinchuby May 16, 2025

Choose a reason for hiding this comment

Uh oh!

titaiwangms commented May 16, 2025

Uh oh!

Uh oh!

titaiwangms commented May 5, 2025 •

edited

Loading

codecov bot commented May 5, 2025 •

edited

Loading