[Pytorch][Vulkan] Add baddbmm #109851

tina134 · 2023-09-22T02:57:58Z

Summary:
Similar implementation like BMM & ADDMM, the bias tensor is using the packed weights, similar to MM, but increases the index via the z-dim to get more matrices in the batch.

Packed bias (input of MM):

ivec3 pos(k_, j_, 0);
float v = texelFetch(uInput, pos, 0)
# v.xyzw are 4 numbers in one matrix
# no batch
# k_, j_ has only 1/4 of the range as the original matrix size (H*W matrix i=> H/2*W/2*1 3D Image).

Packed bias (input of BMM):

ivec3 pos(k_, j_, i);
float v = texelFetch(uInput, pos, 0)
# v.xyzw are 4 numbers in one matrix
# i as batch id

To support broadcasting, the bias packing of mm is slightly different than weight packing, which repeats the single element in height-dim twice to fill the 4 planes (see code for details). The width-dim doesn’t repeat twice, but the code still works, because stacking 3 planes together with the last one empty yields the same 3D image.
However, this doesn’t work for bmm, since it’s a series of {4 planes} {4 planes} … {4 planes}, and each {4 planes} represents a matrix, so only 3 planes completely mess up the indexing. Thus, I repeat the single element in width-dim as well to fill all 4 planes to have the correct indexing.

https://pytorch.org/docs/stable/generated/torch.baddbmm.html

Test Plan:

[ttingchulin@27298.od /data/sandcastle/boxes/fbsource (bmm)]$ LD_LIBRARY_PATH=third-party/swiftshader/lib/linux-x64/ buck run fbcode/mode/dev-nosan //xplat/caffe2:pt_vulkan_api_test_bin

Reviewed By: yipjustin

Differential Revision: D49402181

Summary: Similar implementation like BMM & ADDMM, the bias tensor is using the packed weights, similar to MM, but increases the index via the z-dim to get more matrices in the batch. Packed bias (input of MM): ``` ivec3 pos(k_, j_, 0); float v = texelFetch(uInput, pos, 0) # v.xyzw are 4 numbers in one matrix # no batch # k_, j_ has only 1/4 of the range as the original matrix size (H*W matrix i=> H/2*W/2*1 3D Image). ``` Packed bias (input of BMM): ``` ivec3 pos(k_, j_, i); float v = texelFetch(uInput, pos, 0) # v.xyzw are 4 numbers in one matrix # i as batch id ``` **To support broadcasting**, the bias packing of `mm` is slightly different than weight packing, which repeats the single element in height-dim twice to fill the 4 planes (see code for details). The width-dim doesn’t repeat twice, but the code still works, because stacking 3 planes together with the last one empty yields the same 3D image. However, this doesn’t work for `bmm`, since it’s a series of `{4 planes} {4 planes} … {4 planes}`, and each `{4 planes}` represents a matrix, so only 3 planes completely mess up the indexing. Thus, I repeat the single element in width-dim as well to fill all 4 planes to have the correct indexing. https://pytorch.org/docs/stable/generated/torch.baddbmm.html Test Plan: ``` [ttingchulin@27298.od /data/sandcastle/boxes/fbsource (bmm)]$ LD_LIBRARY_PATH=third-party/swiftshader/lib/linux-x64/ buck run fbcode/mode/dev-nosan //xplat/caffe2:pt_vulkan_api_test_bin ``` Reviewed By: yipjustin Differential Revision: D49402181

pytorch-bot · 2023-09-22T02:58:00Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109851

📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓ Need help or want to give feedback on the CI? Visit the bot commands wiki or our office hours

Note: Links to docs will display an error until the docs builds have been completed.

✅ You can merge normally! (2 Unrelated Failures)

As of commit ff78668 with merge base e1d7123 ():

FLAKY - The following job failed but was likely due to flakiness present on trunk:

win-vs2019-cuda11.8-py3 / test (default, 4, 4, windows.g5.4xlarge.nvidia.gpu) (gh)

UNSTABLE - The following job failed but was likely due to flakiness present on trunk and has been marked as unstable:

linux-focal-cuda11.8-py3.9-gcc9 / test (multigpu, 1, 1, linux.g5.12xlarge.nvidia.gpu, unstable) (gh)

This comment was automatically generated by Dr. CI and updates every 15 minutes.

facebook-github-bot · 2023-09-22T02:58:08Z

This pull request was exported from Phabricator. Differential Revision: D49402181

facebook-github-bot · 2023-09-22T20:32:49Z

@pytorchbot merge

(Initiating merge automatically since Phabricator Diff has merged)

pytorchmergebot · 2023-09-22T20:34:30Z

Merge started

Your change will be merged once all checks pass (ETA 0-4 Hours).

Learn more about merging in the wiki.

Questions? Feedback? Please reach out to the PyTorch DevX Team

Advanced Debugging

Check the merge workflow status
here

pytorch-bot bot added ciflow/periodic Trigger jobs ran periodically on master (periodic.yml) on the PR module: vulkan release notes: vulkan release notes category labels Sep 22, 2023

facebook-github-bot added the fb-exported label Sep 22, 2023

yipjustin approved these changes Sep 22, 2023

View reviewed changes

pytorch-bot bot added the ciflow/trunk Trigger trunk jobs on your pull request label Sep 22, 2023

pytorchmergebot added the merging label Sep 22, 2023

pytorchmergebot added Merged and removed merging labels Sep 22, 2023

pytorchmergebot closed this in 411ca10 Sep 22, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Pytorch][Vulkan] Add baddbmm #109851

[Pytorch][Vulkan] Add baddbmm #109851

Uh oh!

tina134 commented Sep 22, 2023

Uh oh!

pytorch-bot bot commented Sep 22, 2023 •

edited

Loading

Uh oh!

facebook-github-bot commented Sep 22, 2023

Uh oh!

facebook-github-bot commented Sep 22, 2023

Uh oh!

pytorchmergebot commented Sep 22, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

[Pytorch][Vulkan] Add baddbmm #109851

[Pytorch][Vulkan] Add baddbmm #109851

Uh oh!

Conversation

tina134 commented Sep 22, 2023

Uh oh!

pytorch-bot bot commented Sep 22, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/109851

✅ You can merge normally! (2 Unrelated Failures)

Uh oh!

facebook-github-bot commented Sep 22, 2023

Uh oh!

facebook-github-bot commented Sep 22, 2023

Uh oh!

pytorchmergebot commented Sep 22, 2023

Merge started

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

pytorch-bot bot commented Sep 22, 2023 •

edited

Loading