Skip to content

Conversation

@whitneywhtsang
Copy link
Contributor

Please do not squash and merge this PR.

etiotto and others added 2 commits August 28, 2025 07:23
When the B operand of a chained dot operation is defined by a tt.load
followed by a tt.trans the current implementation fails to set the
"one-matric-per-load" attribute. This PR fixes this problem.

Signed-off-by: Ettore Tiotto <ettore.tiotto@intel.com>
(cherry picked from commit cd493ff)
`AccelerateMatmul` pass adds `one_matrix_per_load` attribute to chain
dot's B operand load.
We would like to keep the attribute when fusing with `tt.trans` in
`OptimizeDotOperands` pass.

Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>
(cherry picked from commit b335d39)
@whitneywhtsang whitneywhtsang merged commit 6020778 into release/3.5.x Aug 28, 2025
19 of 21 checks passed
@whitneywhtsang whitneywhtsang deleted the whitneywhtsang/release branch August 28, 2025 15:01
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants