Improve MHA Pattern by xadupre · Pull Request #617 · sdpython/experimental-experiment

xadupre · 2026-02-23T21:56:39Z

No description provided.

Copilot

Pull request overview

This pull request improves the Multi-Head Attention (MHA) pattern optimization by adding support for the NoT (No Transpose) variant, which occurs when using FusedMatMul operators instead of regular MatMul with Transpose. The PR corrects the capitalization from "noT" to "NoT" to align with established naming conventions (SW for Switch Where, GQA for Group Query Attention), and adds a comprehensive test to verify the optimization works correctly with FusedMatMul.

Changes:

Fixed capitalization of the no-transpose suffix from "noT" to "NoT" in FunctionAttentionPattern.apply()
Added "NoT_to" prefix to MultiHeadAttention3DPattern's _prefixes_operator_name to recognize LocalAttentionNoT operators
Added test case test_multi_head_attention_fused_matmul to verify FusedMatMul-based attention patterns are correctly optimized to MultiHeadAttention

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 1 comment.

File	Description
experimental_experiment/xoptim/patterns_ort/llm_optim.py	Added NoT_to prefix to MultiHeadAttention3DPattern to recognize LocalAttentionNoT variants created when FusedMatMul is used
experimental_experiment/xoptim/patterns/onnx_attention.py	Fixed capitalization from "noT" to "NoT" in FunctionAttentionPattern.apply() for consistency, added clarifying comment for FusedMatMul branch
_unittests/ut_xoptim/test_graph_pattern_optimization_ort.py	Added comprehensive test for FusedMatMul-based multi-head attention pattern optimization

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-23T22:02:16Z

experimental_experiment/xoptim/patterns_ort/llm_optim.py

    _prefixes_operator_name = (
        f"{FunctionAttentionPattern._operator_name}_to",
        f"{FunctionAttentionPattern._operator_name}sQ_to",
        f"{FunctionAttentionPattern._operator_name}SW_to",
        f"{FunctionAttentionPattern._operator_name}SWsQ_to",
+        f"{FunctionAttentionPattern._operator_name}NoT_to",


The _prefixes_operator_name tuple is missing combination prefixes. Since both SW and NoT suffixes can occur together (when switch_where=True and transpose=None), the pattern should also include:
f"{FunctionAttentionPattern._operator_name}SWNoT_to",

Additionally, lines 1107 and 1109 appear incorrect. FunctionAttentionPattern never creates "LocalAttentionsQ" or "LocalAttentionSWsQ" nodes. The "sQ" suffix is part of GQA variants (GQAsQ), not standalone patterns. These lines should likely be removed or replaced with GQA-related prefixes.

ok if you can add unittest for each of those

Copilot · 2026-02-23T22:39:06Z

@xadupre I've opened a new pull request, #620, to work on those changes. Once the pull request is ready, I'll request review from you.

Improve MHA Pattern

bf1da5d

xadupre marked this pull request as ready for review February 23, 2026 21:56

Copilot AI review requested due to automatic review settings February 23, 2026 21:56

Copilot started reviewing on behalf of xadupre February 23, 2026 21:57 View session

Copilot AI reviewed Feb 23, 2026

View reviewed changes

Copilot AI mentioned this pull request Feb 23, 2026

Fix MultiHeadAttention3DPattern._prefixes_operator_name: remove incorrect sQ entries, add missing SWNoT #620

Closed

xadupre merged commit 0799b0b into main Feb 23, 2026
19 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve MHA Pattern#617

Improve MHA Pattern#617
xadupre merged 1 commit intomainfrom
att8

xadupre commented Feb 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 23, 2026

Uh oh!

xadupre Feb 23, 2026

Uh oh!

xadupre Feb 23, 2026

Uh oh!

Copilot AI commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

xadupre commented Feb 23, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

xadupre Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

xadupre Feb 23, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants