Added workaround to onnx.helper function depreciated in onnx 1.21 by hthadicherla · Pull Request #1204 · NVIDIA/Model-Optimizer

hthadicherla · 2026-04-08T11:05:00Z

What does this PR do?

Type of change: Bug Fix

With onnx 1.21 and onnx graphsurgeon there is a incompatibility issue where onnx discontinued the float32 to bfloat16 helper , thereby causing an import failure while importing onnx graph surgeon which has that function dependency.

Traceback (most recent call last):
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\examples\windows\onnx_ptq\genai_llm\quantize.py", line 29, in <module>
    from modelopt.onnx.quantization.int4 import quantize as quantize_int4
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\__init__.py", line 36, in <module>
    from . import quantization
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\quantization\__init__.py", line 19, in <module>
    from .int4 import quantize as quantize_int4
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\quantization\int4.py", line 30, in <module>
    import onnx_graphsurgeon as gs
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\venv\Lib\site-packages\onnx_graphsurgeon\__init__.py", line 1, in <module>
    from onnx_graphsurgeon.exporters.onnx_exporter import export_onnx
  File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\venv\Lib\site-packages\onnx_graphsurgeon\exporters\onnx_exporter.py", line 134, in <module>
    np.uint16, onnx.helper.float32_to_bfloat16
               ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: module 'onnx.helper' has no attribute 'float32_to_bfloat16'

Added mldtypes converter and overrided the helper to use this workaround function as onnx.helper.float32_to_bfloat16.

Long term fix is an AI for TensorRT team to fix this within onnx graph surgeon i guess.

Testing

Tested quantization works after fix with onnx 1.21.

Summary by CodeRabbit

Chores
- Relaxed ONNX version constraints in example configurations to support a wider range of compatible versions.
Bug Fixes
- Added compatibility support for ONNX to ensure proper functionality across different library versions.

…this is depreciated in onnx 1.21 Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

coderabbitai · 2026-04-08T11:05:13Z

📝 Walkthrough

Walkthrough

Two requirements files have unpinned ONNX dependencies from version 1.19.0 to accept any version. A compatibility shim was added to modelopt/onnx/__init__.py that monkey-patches onnx.helper.float32_to_bfloat16 when missing, using NumPy and ml_dtypes for bfloat16 conversion.

Changes

Cohort / File(s)	Summary
ONNX Dependency Unpinning `examples/windows/onnx_ptq/genai_llm/requirements.txt`, `examples/windows/onnx_ptq/whisper/requirements.txt`	Removed exact version constraints on `onnx`, changing from `onnx==1.19.0` to `onnx` to allow installation of any compatible ONNX version.
ONNX Compatibility Shim `modelopt/onnx/__init__.py`	Added conditional monkey-patch for `onnx.helper.float32_to_bfloat16` function that converts float32 to bfloat16 using NumPy and ml_dtypes when the function is absent from the installed ONNX version.

Estimated code review effort

🎯 2 (Simple) | ⏱️ ~12 minutes

🚥 Pre-merge checks | ✅ 3 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (3 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Security Anti-Patterns	✅ Passed	No security anti-patterns found: no torch.load(weights_only=False), numpy.load(allow_pickle=True), trust_remote_code=True, eval/exec calls, or # nosec comments.
Title check	✅ Passed	The title accurately describes the main change—adding a workaround for a deprecated onnx.helper function in onnx 1.21—which aligns with the core fix implemented in the changeset.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings

Create stacked PR
Commit on current branch

🧪 Generate unit tests (beta)

Create PR with unit tests
Commit unit tests in branch hthadicherla/add-onnx-1.21-fix

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

github-actions · 2026-04-08T11:09:57Z

PR Preview Action v1.8.1
Preview removed because the pull request was closed.
2026-04-08 13:07 UTC

coderabbitai

🧹 Nitpick comments (3)

examples/windows/onnx_ptq/whisper/requirements.txt (1)

7-7: Consider adding a lower version bound for onnx.

Completely unpinning the dependency could lead to installing incompatible older versions or future breaking versions. Consider specifying at minimum a lower bound (e.g., onnx>=1.19.0) to ensure compatibility with the codebase while still allowing 1.21+.
💡 Suggested constraint
-onnx
+onnx>=1.19.0
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@examples/windows/onnx_ptq/whisper/requirements.txt` at line 7, The
requirements file currently lists an unpinned dependency "onnx"; update
examples/windows/onnx_ptq/whisper/requirements.txt to add a sensible lower bound
(e.g., change "onnx" to "onnx>=1.19.0" or "onnx>=1.21.0" depending on tested
compatibility) so older incompatible versions are not installed; ensure the new
constraint is the only change and matches the tested runtime for the
whisper/onnx_ptq example.

modelopt/onnx/__init__.py (1)

21-31: Add a comment explaining this workaround.

The monkey-patch works, but future maintainers would benefit from a brief comment explaining why this exists and linking to the upstream issue (onnx_graphsurgeon needs to be updated).

📝 Suggested documentation

 import onnx.helper

+# Workaround: onnx 1.21 removed onnx.helper.float32_to_bfloat16, but onnx_graphsurgeon
+# still depends on it. Provide a compatible implementation until onnx_graphsurgeon is updated.
+# See: https://github.com/NVIDIA/TensorRT/issues/... (link to upstream issue if available)
 if not hasattr(onnx.helper, "float32_to_bfloat16"):
     import ml_dtypes
     import numpy as np

     def _float32_to_bfloat16(value):
+        """Convert a scalar float32 value to its bfloat16 representation as uint16."""
         arr = np.array(value, dtype=np.float32)
         return int(arr.astype(ml_dtypes.bfloat16).view(np.uint16))

     onnx.helper.float32_to_bfloat16 = _float32_to_bfloat16

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In `@modelopt/onnx/__init__.py` around lines 21 - 31, Add a short explanatory
comment above the monkey-patch in modelopt/onnx/__init__.py describing that
onnx.helper.float32_to_bfloat16 is missing in some ONNX builds, that this
runtime fallback (_float32_to_bfloat16 using ml_dtypes and numpy) restores
expected behavior for callers, and reference the upstream onnx_graphsurgeon
issue/PR that requires updating; place the comment adjacent to the
onnx.helper.float32_to_bfloat16 assignment and mention the involved symbols
(_float32_to_bfloat16, ml_dtypes, onnx_graphsurgeon) so future maintainers know
why and where to remove it.

examples/windows/onnx_ptq/genai_llm/requirements.txt (1)

2-2: Consider adding a lower version bound for onnx.

Same as the whisper requirements file - consider specifying at minimum a lower bound to ensure compatibility while allowing newer versions.
💡 Suggested constraint
-onnx
+onnx>=1.19.0
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@examples/windows/onnx_ptq/genai_llm/requirements.txt` at line 2, The
requirements line currently lists only "onnx" with no lower bound; update the
dependency entry in requirements.txt to include a minimum compatible version
(for example change "onnx" to "onnx>=1.14.0" or match the lower bound used in
the whisper requirements) so consumers get a known-compatible ONNX release while
still allowing newer versions.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Nitpick comments:
In `@examples/windows/onnx_ptq/genai_llm/requirements.txt`:
- Line 2: The requirements line currently lists only "onnx" with no lower bound;
update the dependency entry in requirements.txt to include a minimum compatible
version (for example change "onnx" to "onnx>=1.14.0" or match the lower bound
used in the whisper requirements) so consumers get a known-compatible ONNX
release while still allowing newer versions.

In `@examples/windows/onnx_ptq/whisper/requirements.txt`:
- Line 7: The requirements file currently lists an unpinned dependency "onnx";
update examples/windows/onnx_ptq/whisper/requirements.txt to add a sensible
lower bound (e.g., change "onnx" to "onnx>=1.19.0" or "onnx>=1.21.0" depending
on tested compatibility) so older incompatible versions are not installed;
ensure the new constraint is the only change and matches the tested runtime for
the whisper/onnx_ptq example.

In `@modelopt/onnx/__init__.py`:
- Around line 21-31: Add a short explanatory comment above the monkey-patch in
modelopt/onnx/__init__.py describing that onnx.helper.float32_to_bfloat16 is
missing in some ONNX builds, that this runtime fallback (_float32_to_bfloat16
using ml_dtypes and numpy) restores expected behavior for callers, and reference
the upstream onnx_graphsurgeon issue/PR that requires updating; place the
comment adjacent to the onnx.helper.float32_to_bfloat16 assignment and mention
the involved symbols (_float32_to_bfloat16, ml_dtypes, onnx_graphsurgeon) so
future maintainers know why and where to remove it.

ℹ️ Review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: CHILL

Plan: Pro

Run ID: 8003d185-395c-417c-afb2-f5eefda83238

📥 Commits

Reviewing files that changed from the base of the PR and between bd80265 and b2348a4.

📒 Files selected for processing (3)

examples/windows/onnx_ptq/genai_llm/requirements.txt
examples/windows/onnx_ptq/whisper/requirements.txt
modelopt/onnx/__init__.py

codecov · 2026-04-08T11:18:56Z

Codecov Report

❌ Patch coverage is 25.00000% with 6 lines in your changes missing coverage. Please review.
✅ Project coverage is 76.94%. Comparing base (ba4f42d) to head (b2348a4).
⚠️ Report is 2 commits behind head on main.

Files with missing lines	Patch %	Lines
modelopt/onnx/__init__.py	25.00%	6 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #1204      +/-   ##
==========================================
+ Coverage   71.86%   76.94%   +5.08%     
==========================================
  Files         352      352              
  Lines       40343    40351       +8     
==========================================
+ Hits        28992    31050    +2058     
+ Misses      11351     9301    -2050

Flag	Coverage Δ
examples	`44.33% <25.00%> (+1.13%)`	⬆️
gpu	`56.89% <25.00%> (+9.46%)`	⬆️
unit	`55.01% <25.00%> (-0.01%)`	⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

vishalpandya1990 · 2026-04-08T12:55:26Z

examples/windows/onnx_ptq/genai_llm/requirements.txt

@@ -1,4 +1,4 @@
 datasets>=2.14.5
-onnx==1.19.0
+onnx


Should we keep this unconditional (no version cap) or should this be like onnx >= 1.19.0? Not so sure of the down side though, unless it gets older incompatible version.

(same for other such places)

I think if onnx is not installed . Atleast for python >=3.10 it will install the latest version , if user is installing via requirements.txt for the first time. If onnx is installed via pyproject.toml , currently it will install 1.19 onnx which is fine i think .

vishalpandya1990 · 2026-04-08T12:58:41Z

Just to note, another older onnx-incompatibility issue in TRT, onnx-graphsurgeon - NVIDIA/TensorRT#4653

hthadicherla · 2026-04-08T13:01:21Z

Just to note, another older onnx-incompatibility issue in TRT, onnx-graphsurgeon - NVIDIA/TensorRT#4653

Yeah i saw this, but they reverted the change in this PR NVIDIA/TensorRT#4688. There seems to be some test failures there.

hthadicherla · 2026-04-08T13:06:37Z

Merging this PR

) ### What does this PR do? Type of change: Bug Fix With onnx 1.21 and onnx graphsurgeon there is a incompatibility issue where onnx discontinued the float32 to bfloat16 helper , thereby causing an import failure while importing onnx graph surgeon which has that function dependency. ``` Traceback (most recent call last): File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\examples\windows\onnx_ptq\genai_llm\quantize.py", line 29, in <module> from modelopt.onnx.quantization.int4 import quantize as quantize_int4 File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\__init__.py", line 36, in <module> from . import quantization File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\quantization\__init__.py", line 19, in <module> from .int4 import quantize as quantize_int4 File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\Model-Optimizer\modelopt\onnx\quantization\int4.py", line 30, in <module> import onnx_graphsurgeon as gs File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\venv\Lib\site-packages\onnx_graphsurgeon\__init__.py", line 1, in <module> from onnx_graphsurgeon.exporters.onnx_exporter import export_onnx File "C:\Users\local-hthadicherla\Downloads\ModelOpt_Bench\venv\Lib\site-packages\onnx_graphsurgeon\exporters\onnx_exporter.py", line 134, in <module> np.uint16, onnx.helper.float32_to_bfloat16 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ AttributeError: module 'onnx.helper' has no attribute 'float32_to_bfloat16' ``` Added mldtypes converter and overrided the helper to use this workaround function as onnx.helper.float32_to_bfloat16. Long term fix is an AI for TensorRT team to fix this within onnx graph surgeon i guess. ### Testing Tested quantization works after fix with onnx 1.21.  ## Summary by CodeRabbit * **Chores** * Relaxed ONNX version constraints in example configurations to support a wider range of compatible versions. * **Bug Fixes** * Added compatibility support for ONNX to ensure proper functionality across different library versions.  Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

added fallback to onnx.helper for float32 to bfloat16 function since …

b2348a4

…this is depreciated in onnx 1.21 Signed-off-by: Hrishith Thadicherla <hthadicherla@nvidia.com>

hthadicherla requested review from ajrasane and vishalpandya1990 April 8, 2026 11:05

hthadicherla requested review from a team as code owners April 8, 2026 11:05

hthadicherla changed the title ~~Added workaround to depreciated onnx.helper function depreciated in onnx 1.21~~ Added workaround to onnx.helper function depreciated in onnx 1.21 Apr 8, 2026

coderabbitai bot reviewed Apr 8, 2026

View reviewed changes

hthadicherla mentioned this pull request Apr 8, 2026

Pass for Nvidia ModelOpt graph surgery framework microsoft/Olive#2377

Merged

5 tasks

vishalpandya1990 reviewed Apr 8, 2026

View reviewed changes

vishalpandya1990 approved these changes Apr 8, 2026

View reviewed changes

hthadicherla merged commit abf4558 into main Apr 8, 2026
45 checks passed

hthadicherla deleted the hthadicherla/add-onnx-1.21-fix branch April 8, 2026 13:06

ajrasane mentioned this pull request Apr 8, 2026

Upgrade ONNX from 1.19 to 1.21 #1207

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added workaround to onnx.helper function depreciated in onnx 1.21#1204

Added workaround to onnx.helper function depreciated in onnx 1.21#1204
hthadicherla merged 1 commit intomainfrom
hthadicherla/add-onnx-1.21-fix

hthadicherla commented Apr 8, 2026 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Apr 8, 2026 •

edited

Loading

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

github-actions bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

coderabbitai bot left a comment

Uh oh!

codecov bot commented Apr 8, 2026 •

edited

Loading

Uh oh!

vishalpandya1990 Apr 8, 2026 •

edited

Loading

Uh oh!

hthadicherla Apr 8, 2026 •

edited

Loading

Uh oh!

vishalpandya1990 commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

hthadicherla commented Apr 8, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Testing

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Estimated code review effort

❌ Failed checks (1 warning)

Uh oh!

github-actions bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

vishalpandya1990 Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

hthadicherla Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

vishalpandya1990 commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

hthadicherla commented Apr 8, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

hthadicherla commented Apr 8, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Apr 8, 2026 •

edited

Loading

github-actions bot commented Apr 8, 2026 •

edited

Loading

codecov bot commented Apr 8, 2026 •

edited

Loading

vishalpandya1990 Apr 8, 2026 •

edited

Loading

hthadicherla Apr 8, 2026 •

edited

Loading