fix: handling of default attrs in SimplifiedLayerNormalization + LayerNormalization🐛 #2396

KarelZe · 2025-06-17T03:37:03Z

SkipLayerNormFusion does currently not fuse ops, if stash_type is at default (=1) or epsilon is at default (=1e-5) for LayerNormalization and SimplifiedLayerNormalization

This pr:

fixes handling default attrs in LayerNormalization, SimplifiedLayerNormalization
adds BART encoder as new test model. I added this model as some of the stash types are at default. The model is versatile and can also be used to test other fusions e.g., EmbedLayerNormalization.
allows for commuted inputs.

Closes #2378.

@shubhambhokare1 @justinchuby Could you please review? Any feedback is greatly appreciated.

…rNormalization🐛

codecov · 2025-06-17T04:45:44Z

Codecov Report

Attention: Patch coverage is 22.22222% with 203 lines in your changes missing coverage. Please review.

Project coverage is 62.37%. Comparing base (59340c6) to head (edad0ca).

Files with missing lines	Patch %	Lines
...cript/rewriter/ort_fusions/models/_bart_encoder.py	16.80%	201 Missing and 2 partials ⚠️

❗ There is a different number of reports uploaded between BASE (59340c6) and HEAD (edad0ca). Click for more details.

HEAD has 19 uploads less than BASE

Flag BASE (59340c6) HEAD (edad0ca)

20 1

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2396      +/-   ##
==========================================
- Coverage   70.37%   62.37%   -8.01%     
==========================================
  Files         199      200       +1     
  Lines       25216    25473     +257     
  Branches     2686     2688       +2     
==========================================
- Hits        17747    15888    -1859     
- Misses       6540     8762    +2222     
+ Partials      929      823     -106

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

KarelZe · 2025-06-17T12:10:46Z

onnxscript/rewriter/ort_fusions/skip_normalization.py

-        skip_sum_pattern_2 = op.Add(input, skip)
-        skip_sum = pattern.OrValue([skip_sum_pattern_1, skip_sum_pattern_2], name="skip_sum")
-
+        skip_sum = op.Add(input, skip)
        if self._has_bias and not self._bias_pre_add:
            skip_sum = op.Add(skip_sum, bias)


I chose to enable commute(...), as we didn't check for all variants in this addition and only in the lines above.

justinchuby · 2025-06-17T15:57:52Z

@gramalingam

onnxscript/rewriter/ort_fusions/models/_bart_encoder.py

+            encoder_layers_0_self_attn_layer_norm_weight
+        )
+
+        encoder_layers_1_fc2_bias = opset20.Identity(encoder_layers_0_self_attn_k_proj_bias)


onnxscript/rewriter/ort_fusions/models/_bart_encoder.py

+
+        encoder_layers_1_fc2_bias = opset20.Identity(encoder_layers_0_self_attn_k_proj_bias)
+        encoder_layers_1_fc1_bias = opset20.Identity(encoder_layers_0_fc1_bias)
+        encoder_layers_1_self_attn_layer_norm_bias = opset20.Identity(


Copilot

Pull Request Overview

This PR fixes how default attributes (epsilon, stash_type) are handled in both LayerNormalization and SimplifiedLayerNormalization fusions, adds a BART encoder model to the fusion tests, and introduces commuted-input support for SkipLayerNormalization rules.

Extract default epsilon from the matched node instead of requiring it in the pattern signature
Add test_bart_encoder to validate fusion with default-attribute cases
Enable commuted-input variants by applying .commute() to fusion rules

Reviewed Changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
skip_normalization_test.py	Added `test_bart_encoder` to cover default-attribute fusions
skip_normalization.py	Refactored patterns to drop default attrs, extract `epsilon` in rewrite, and apply rule commutation

Comments suppressed due to low confidence (2)

onnxscript/rewriter/ort_fusions/skip_normalization_test.py:73

The test uses fuse_skip_layer_normalization(model) but there is no import for that symbol in this file. Please add from onnxscript.rewriter.ort_fusions.skip_normalization import fuse_skip_layer_normalization (or adjust the import path) to ensure the function is available.

        fuse_skip_layer_normalization(model)

onnxscript/rewriter/ort_fusions/skip_normalization.py:231

The new .commute() calls are applied only to the full SkipLayerNormalization rules. To allow commuted inputs for SkipSimplifiedLayerNormalization as well, you should apply .commute() to the simplified-layer ruleset (if defined) or include those here before applying apply_fusion_rules.

skip_layer_normalization_ruleset = pattern.RewriteRuleSet(

Copilot · 2025-06-17T16:01:19Z

onnxscript/rewriter/ort_fusions/skip_normalization.py

        **_,
    ):
+        epsilon = simplified_layer_norm.producer().attributes.get_float("epsilon")


You extract epsilon from the matched node but do not extract or forward stash_type. If a non-default stash_type was used, it will be lost in the fused op. Consider retrieving stash_type = simplified_layer_norm.producer().attributes.get_int("stash_type") and passing it into SkipSimplifiedLayerNormalization.

Suggested change

epsilon = simplified_layer_norm.producer().attributes.get_float("epsilon")

epsilon = simplified_layer_norm.producer().attributes.get_float("epsilon")

stash_type = simplified_layer_norm.producer().attributes.get_int("stash_type")

I guess there is no stash type for fused layer norm ops? https://github.com/microsoft/onnxruntime/blob/main/docs/ContribOperators.md#com.microsoft.SkipLayerNormalization

KarelZe · 2025-06-17T18:19:05Z

onnxscript/rewriter/ort_fusions/skip_normalization.py

        if self._has_bias and not self._bias_pre_add:
            skip_sum = op.Add(skip_sum, bias)
+
        normalized = op.LayerNormalization(
            skip_sum,
            gamma,
            beta,


@gramalingam beta is an optional input. I'd lean toward matching both variants (w and w/o bias).

fix: handling of default attrs in SimplifiedLayerNormalization + Laye…

d550d38

…rNormalization🐛

github-project-automation bot added this to ONNX Script Review Board Jun 17, 2025

github-project-automation bot moved this to Todo in ONNX Script Review Board Jun 17, 2025

KarelZe added 4 commits June 17, 2025 09:54

tests: add new BART encoder test model⚡️

b5ffeac

test: fix typos in BART encoder test model⚡️

87122e8

style: rename variable names of bart encoder to lowercase💅

e9c0235

chore: final clean up of skip layer norm fusion

ac155dc

KarelZe commented Jun 17, 2025

View reviewed changes

KarelZe marked this pull request as ready for review June 17, 2025 12:13

KarelZe marked this pull request as draft June 17, 2025 12:47

justinchuby requested review from gramalingam and Copilot and removed request for gramalingam June 17, 2025 15:57

github-advanced-security bot found potential problems Jun 17, 2025

View reviewed changes

Copilot AI reviewed Jun 17, 2025

View reviewed changes

KarelZe added 2 commits June 17, 2025 19:28

fix: set allow_other_attributes for layer norm fusion

edad0ca

fix: address copilot review comments

ba4a971

KarelZe commented Jun 17, 2025

View reviewed changes

KarelZe marked this pull request as ready for review June 18, 2025 04:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: handling of default attrs in SimplifiedLayerNormalization + LayerNormalization🐛 #2396

fix: handling of default attrs in SimplifiedLayerNormalization + LayerNormalization🐛 #2396

KarelZe commented Jun 17, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 17, 2025 •

edited

Loading

Uh oh!

KarelZe Jun 17, 2025 •

edited

Loading

Uh oh!

justinchuby commented Jun 17, 2025

Uh oh!

Check warning

Check warning

Copilot AI left a comment

Uh oh!

Copilot AI Jun 17, 2025

Uh oh!

KarelZe Jun 17, 2025

Uh oh!

KarelZe Jun 17, 2025

Uh oh!

Uh oh!

	epsilon = simplified_layer_norm.producer().attributes.get_float("epsilon")
	epsilon = simplified_layer_norm.producer().attributes.get_float("epsilon")
	stash_type = simplified_layer_norm.producer().attributes.get_int("stash_type")

fix: handling of default attrs in SimplifiedLayerNormalization + LayerNormalization🐛 #2396

Are you sure you want to change the base?

fix: handling of default attrs in SimplifiedLayerNormalization + LayerNormalization🐛 #2396

Conversation

KarelZe commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

KarelZe Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

justinchuby commented Jun 17, 2025

Uh oh!

Check warning

Check warning

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Copilot AI Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

KarelZe Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

KarelZe Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

KarelZe commented Jun 17, 2025 •

edited

Loading

codecov bot commented Jun 17, 2025 •

edited

Loading

KarelZe Jun 17, 2025 •

edited

Loading