Pass for Nvidia ModelOpt graph surgery framework by hthadicherla · Pull Request #2377 · microsoft/Olive

hthadicherla · 2026-03-31T10:08:03Z

Describe your changes

Add NVModelOptGraphSurgery pass to integrate NVIDIA ModelOpt graph surgeries into Olive. Supports all existing surgeries in ModelOpt like GQA fusion, DQ-Transpose and all future surgeries that will be added in ModelOpt

Changes:

New pass: olive/passes/onnx/nvmo_graph_surgery.py
Pass registration in olive_config.json
Unit tests: test/passes/onnx/test_nvmo_graph_surgery.py
Documentation in pass.rst and onnx-transformations.md

Usage

Example

{
    "type": "NVModelOptGraphSurgery",
    "config": {
        "surgery_type": "replace-gqa" # Surgery-key in Modelopt,
        "surgery_params": {
             # Surgery specific parameters of that particular surgery, example below
            "hf_model_id": "meta-llama/Llama-2-7b-hf",
            "io_dtype": "float16"
        }
    }
}

Checklist before requesting a review

Add unit tests for this change.
Make sure all tests can pass.
Update documents if necessary.
Lint and apply fixes to your code by running lintrunner -a
Is this a user-facing change? If yes, give a description of this change to be included in the release notes.

Release note: Added NVModelOptGraphSurgery pass for running NVIDIA ModelOpt graph surgeries on ONNX models.

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

hthadicherla · 2026-04-06T06:27:25Z

@jambayk Can you review this PR? This is graph surgery pass for NVIDIA stack. The implementation of the surgeries will be in modelopt and essentially we are calling them through the olive pass.

docs/source/features/onnx-transformations.md

olive/passes/onnx/nvmo_graph_surgery.py

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

Pipeline is failing because graph_surgeon dependency is using onnx.helper.bfloat.float32_to_bfloat16 which has been deprecated in onnx since version 1.18 and removed in 1.20.

hthadicherla · 2026-04-08T12:40:52Z

This is a conflict between onnx_graphsurgeon and onnx and needs to be resolved within onnx_graphsurgeon. I have raised a PR in ModelOpt for a workaround temporarily. NVIDIA/Model-Optimizer#1204.

After it is merged, this failure should resolve itself

hthadicherla · 2026-04-08T13:08:45Z

The PR has been merged. Can you re review it again @jambayk ?

docs/source/features/onnx-transformations.md

tests need to be skipped until new release of modelopt due to incompatibility with latest onnx versions

xiaoyu-work · 2026-04-08T22:55:11Z

@hthadicherla can you update this PR? we plan to release new Olive version this Friday and this PR will be in the new release

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

hthadicherla · 2026-04-09T08:28:51Z

@xiaoyu-work @jambayk I have added skip to the test. Can you reapprove it now ?

hthadicherla added 3 commits March 31, 2026 14:37

Added ModelOpt graph surgery pass in Olive

4dfbbdb

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

Added ModelOpt graph surgery pass in Olive

4adc8bb

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

Updated documentation with usage

51a22c6

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

jambayk reviewed Apr 6, 2026

View reviewed changes

docs/source/features/onnx-transformations.md Show resolved Hide resolved

jambayk reviewed Apr 6, 2026

View reviewed changes

olive/passes/onnx/nvmo_graph_surgery.py Outdated Show resolved Hide resolved

changed to pathlib.Path from os.path.join()

5b287d5

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

hthadicherla force-pushed the hthadicherla/graph-surgery-pass branch from 6e92bd3 to 5b287d5 Compare April 7, 2026 08:46

hthadicherla requested a review from jambayk April 7, 2026 08:47

jambayk previously approved these changes Apr 7, 2026

View reviewed changes

jambayk enabled auto-merge (squash) April 7, 2026 18:39

jambayk disabled auto-merge April 7, 2026 21:10

hthadicherla requested a review from jambayk April 8, 2026 13:08

jambayk previously approved these changes Apr 8, 2026

View reviewed changes

jambayk reviewed Apr 8, 2026

View reviewed changes

docs/source/features/onnx-transformations.md Show resolved Hide resolved

added skip for tests

ba3f2a7

Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

hthadicherla force-pushed the hthadicherla/graph-surgery-pass branch from 2b09f13 to ba3f2a7 Compare April 9, 2026 08:24

Merge branch 'main' into hthadicherla/graph-surgery-pass

a05d657

hthadicherla requested a review from jambayk April 9, 2026 08:29

jambayk approved these changes Apr 9, 2026

View reviewed changes

jambayk enabled auto-merge (squash) April 9, 2026 16:18

jambayk merged commit 27f477f into microsoft:main Apr 9, 2026
14 of 15 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Pass for Nvidia ModelOpt graph surgery framework#2377

Pass for Nvidia ModelOpt graph surgery framework#2377
jambayk merged 6 commits intomicrosoft:mainfrom
hthadicherla:hthadicherla/graph-surgery-pass

hthadicherla commented Mar 31, 2026 •

edited

Loading

Uh oh!

hthadicherla commented Apr 6, 2026

Uh oh!

Uh oh!

Uh oh!

hthadicherla commented Apr 8, 2026 •

edited

Loading

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

Uh oh!

xiaoyu-work commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

hthadicherla commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes

Usage

Checklist before requesting a review

Uh oh!

hthadicherla commented Apr 6, 2026

Uh oh!

Uh oh!

Uh oh!

hthadicherla commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

Uh oh!

xiaoyu-work commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

hthadicherla commented Mar 31, 2026 •

edited

Loading

hthadicherla commented Apr 8, 2026 •

edited

Loading