[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) shapes #166307

nikitaved · 2025-10-27T15:00:11Z

Stack from ghstack (oldest at bottom):

cc @ptrblck @msaroufim @eqy @jerryzh168

[ghstack-poisoned]

pytorch-bot · 2025-10-27T15:00:15Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166307

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit 2e05c55 with merge base 030de07 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

eqy

is it worth parametrizing the test(s) to try both 1xm and n-shaped bias or is that excessive?

eqy · 2025-10-27T15:04:37Z

aten/src/ATen/native/cuda/Blas.cpp

+    && (
+      self.is_contiguous() &&
+      // NOTE: fine to have 1-len dims to the left from the leading one
+      self.dim() <= result.dim() && self.squeeze().dim() == 1 &&


cool use of squeeze here

eqy · 2025-10-27T15:04:38Z

test/test_linalg.py

        self._test_addmm_addmv(func, M, m1, m2, activation=activation)

-        # vector-shaped bias and beta=1 result in epilogue fusion in CUDA
+        # vector-shaped bias (or with 1-len dims on the left from the leading dim)


is this an "or" or have we changed the case from vector-shaped bias to 1 x n one?

It is an or. Motivated by the "expected fusions" tests from Inductor -- and we can fuse these broadcast biases safely.

[ghstack-poisoned]

nikitaved · 2025-10-27T15:25:08Z

@eqy, thanks for the review! Do you mean in test_cuda_matmul.py? Yes, it would not hurt extending there -- will do. Otherwise test_linalg.py offers quite a comprehensive coverage.

[ghstack-poisoned]

nikitaved · 2025-10-28T14:54:03Z

@eqy, the testing is expanded. Let me know if there is anything else we'd rather do before I merge.

[ghstack-poisoned]

nikitaved · 2025-10-31T13:35:46Z

@pytorchbot merge

pytorchmergebot · 2025-10-31T13:37:37Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

…shapes (#166307) Pull Request resolved: #166307 Approved by: https://github.com/eqy

Update

a7ec71b

[ghstack-poisoned]

nikitaved requested review from IvanYashchuk, drisspg, lezcano and slayton58 as code owners October 27, 2025 15:00

pytorch-bot bot added ciflow/b200 ciflow/h100 ciflow/rocm Trigger "default" config CI on ROCm release notes: linalg_frontend release notes category labels Oct 27, 2025

This was referenced Oct 27, 2025

[Inductor] addmm with bias -> unfuse bias if there is a pointwise/reduction consumer #166165

Open

[Inductor] refine the logic in addmm -> mm + add #166170

Draft

[Inductor] refine the logic in (mm + bias) -> addmm #166300

Draft

nikitaved requested review from eqy and removed request for IvanYashchuk and lezcano October 27, 2025 15:00

nikitaved added ciflow/trunk Trigger trunk jobs on your pull request module: cuda Related to torch.cuda, and CUDA support in general labels Oct 27, 2025

pytorchbot added the open source label Oct 27, 2025

eqy approved these changes Oct 27, 2025

View reviewed changes

Update

ad49cc4

[ghstack-poisoned]

nikitaved added this to PyTorch + CUDA Oct 27, 2025

nikitaved moved this to In Progress in PyTorch + CUDA Oct 27, 2025

nikitaved self-assigned this Oct 27, 2025

nikitaved added 5 commits October 27, 2025 17:15

Update

4543377

[ghstack-poisoned]

Update

69555cf

[ghstack-poisoned]

Update

078f237

[ghstack-poisoned]

Update

74316d1

[ghstack-poisoned]

Update

e8ea596

[ghstack-poisoned]

Update

5cbd4bc

[ghstack-poisoned]

nikitaved mentioned this pull request Oct 28, 2025

[Inductor] refactoring: retire is_pointwise_use #166402

Draft

nikitaved added 11 commits October 28, 2025 15:08

Update

e6abd98

[ghstack-poisoned]

Update

20aa65a

[ghstack-poisoned]

Update

bb2c86f

[ghstack-poisoned]

Update

8602d03

[ghstack-poisoned]

Update

daf5a66

[ghstack-poisoned]

Update

15849fe

[ghstack-poisoned]

Update

2759ec4

[ghstack-poisoned]

Update

4ff38ff

[ghstack-poisoned]

Update

4db6a97

[ghstack-poisoned]

Update

51ef97d

[ghstack-poisoned]

Update

2e05c55

[ghstack-poisoned]

pytorchmergebot added the merging label Oct 31, 2025

pytorchmergebot added the Merged label Oct 31, 2025

pytorchmergebot closed this in 034e951 Oct 31, 2025

github-project-automation bot moved this from In Progress to Done in PyTorch + CUDA Oct 31, 2025

pytorchmergebot removed the merging label Oct 31, 2025

BoyuanFeng pushed a commit that referenced this pull request Oct 31, 2025

[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) …

22e197e

…shapes (#166307) Pull Request resolved: #166307 Approved by: https://github.com/eqy

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) shapes #166307

[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) shapes #166307

Uh oh!

nikitaved commented Oct 27, 2025 •

edited

Loading

Uh oh!

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading

Uh oh!

eqy left a comment •

edited

Loading

Uh oh!

eqy Oct 27, 2025

Uh oh!

eqy Oct 27, 2025 •

edited

Loading

Uh oh!

nikitaved Oct 27, 2025 •

edited

Loading

Uh oh!

nikitaved commented Oct 27, 2025

Uh oh!

nikitaved commented Oct 28, 2025

Uh oh!

nikitaved commented Oct 31, 2025

Uh oh!

pytorchmergebot commented Oct 31, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) shapes #166307

[CUDA][cuBLASLt] addmm -- extend bias fusions to cases with (1 by n) shapes #166307

Uh oh!

Conversation

nikitaved commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pytorch-bot bot commented Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/166307

✅ No Failures

Uh oh!

eqy left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

eqy Oct 27, 2025

Choose a reason for hiding this comment

Uh oh!

eqy Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved Oct 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

nikitaved commented Oct 27, 2025

Uh oh!

nikitaved commented Oct 28, 2025

Uh oh!

nikitaved commented Oct 31, 2025

Uh oh!

pytorchmergebot commented Oct 31, 2025

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

nikitaved commented Oct 27, 2025 •

edited

Loading

pytorch-bot bot commented Oct 27, 2025 •

edited

Loading

eqy left a comment •

edited

Loading

eqy Oct 27, 2025 •

edited

Loading

nikitaved Oct 27, 2025 •

edited

Loading