[Topi] Allow batch_matmul to broadcast along batch dimension. #6616

jwfromm · 2020-10-02T18:44:46Z

We found that requiring explicit broadcasting along the batch dimension for batch_matmul could cause serious memory issues during constant folding, since it would effectively multiply the size of weights by the input batch size. This PR allows implicit broadcasting along the batch dimension for batch_matmul without increasing compute or memory requirements. This should in fact give pretty significant speedups in cases where we previously applied explicit broadcasting. I also noticed that we had an unused C++ definition of batch_matmul and removed it to prevent confusion.

jwfromm · 2020-10-02T18:45:27Z

@mbrookhart, @csullivan, @rkimball can you guys take a look at this PR?

python/tvm/topi/x86/batch_matmul.py

src/relay/op/nn/nn.cc

tests/python/topi/python/test_topi_dense.py

csullivan

Thanks @jwfromm !

One comment regarding the memory use. For best of both worlds when a vendor library is used for batch_matmul, e.g. rocBLAS, if the primitive doesn't support implicit broadcast we will still see excessive memory use from the folded constants. Can you think of a clean solution for that case? My only idea at the moment is to disable constant folding for that case, but that coupling between opt. passes and supported codegen/runtime primitives isn't great.

tests/python/topi/python/test_topi_batch_matmul.py

tests/python/topi/python/test_topi_dense.py

tqchen · 2020-10-03T01:53:00Z

cc @yzhliu @icemelon9 @masahi

jwfromm · 2020-10-04T20:35:43Z

I need to make a few fixes after the merge with the dynamic shapes PR, which involved several changes to batch_matmul.

jwfromm · 2020-10-05T19:28:34Z

tests/python/frontend/onnx/test_forward.py

@@ -3628,7 +3628,6 @@ def verify_roi_align(
    test_clip_min_max_as_inputs()
    test_onehot()
    test_matmul()
-    test_batch_matmul()


Just wanted to note that I removed this since test_batch_matmul is now run with tvm.testing.parametrize, which means it will cause an error when run using python instead of pytest.

mbrookhart

LGTM

jwfromm · 2020-10-06T15:23:46Z

@masahi can you take a quick look at this PR?

masahi · 2020-10-06T21:07:56Z

Thanks @jwfromm @mbrookhart @csullivan @rkimball

…#6616) * Allow batch_matmul to broadcast along batch dimension. * Added typerel checking. * Fix style issue and respond to feedback. * Fix style. * More formatting issues :( * Fix issues after merge. * Comment update. * Small tweak.

jwfromm added 2 commits October 2, 2020 18:34

Allow batch_matmul to broadcast along batch dimension.

87128ed

Added typerel checking.

e0841dd

rkimball suggested changes Oct 2, 2020

View reviewed changes

python/tvm/topi/x86/batch_matmul.py Show resolved Hide resolved

src/relay/op/nn/nn.cc Outdated Show resolved Hide resolved

tests/python/topi/python/test_topi_dense.py Outdated Show resolved Hide resolved

csullivan reviewed Oct 2, 2020

View reviewed changes

tests/python/topi/python/test_topi_batch_matmul.py Show resolved Hide resolved

tests/python/topi/python/test_topi_dense.py Outdated Show resolved Hide resolved

jwfromm added 3 commits October 2, 2020 19:16

Fix style issue and respond to feedback.

2307658

Fix style.

a3cce46

More formatting issues :(

a522773

tqchen added the status: need review label Oct 3, 2020

Merge branch 'master' into broadcast_matmul

9091a74

rkimball approved these changes Oct 5, 2020

View reviewed changes

jwfromm added 3 commits October 5, 2020 19:24

Fix issues after merge.

f4cc296

Comment update.

5c94796

Small tweak.

d6d7afe

jwfromm commented Oct 5, 2020

View reviewed changes

mbrookhart approved these changes Oct 5, 2020

View reviewed changes

masahi approved these changes Oct 6, 2020

View reviewed changes

masahi merged commit 889fac1 into apache:master Oct 6, 2020

masahi mentioned this pull request Jan 27, 2021

[Torch] Various updates for PyTorch frontend #7348

Merged

masahi mentioned this pull request Mar 24, 2021

[Bug] Missing broadcast_to before batch_matmul for CuBLAS #7730

Closed

junrushao mentioned this pull request Nov 1, 2021

Apache TVM v0.8 Release Note Candidate #9416

Closed

jwfromm deleted the broadcast_matmul branch April 12, 2023 15:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Topi] Allow batch_matmul to broadcast along batch dimension. #6616

[Topi] Allow batch_matmul to broadcast along batch dimension. #6616

jwfromm commented Oct 2, 2020

jwfromm commented Oct 2, 2020

csullivan left a comment

tqchen commented Oct 3, 2020

jwfromm commented Oct 4, 2020

jwfromm Oct 5, 2020

mbrookhart left a comment

jwfromm commented Oct 6, 2020

masahi commented Oct 6, 2020

[Topi] Allow batch_matmul to broadcast along batch dimension. #6616

[Topi] Allow batch_matmul to broadcast along batch dimension. #6616

Conversation

jwfromm commented Oct 2, 2020

jwfromm commented Oct 2, 2020

csullivan left a comment

Choose a reason for hiding this comment

tqchen commented Oct 3, 2020

jwfromm commented Oct 4, 2020

jwfromm Oct 5, 2020

Choose a reason for hiding this comment

mbrookhart left a comment

Choose a reason for hiding this comment

jwfromm commented Oct 6, 2020

masahi commented Oct 6, 2020