Skip to content

[Unity][BYOC] Support ND A and 2D B batched matmul for cublasLt#15499

Merged
csullivan merged 2 commits intoapache:unityfrom
masahi:cublaslt-batch
Aug 7, 2023
Merged

[Unity][BYOC] Support ND A and 2D B batched matmul for cublasLt#15499
csullivan merged 2 commits intoapache:unityfrom
masahi:cublaslt-batch

Conversation

@masahi
Copy link
Member

@masahi masahi commented Aug 7, 2023

In our cublas BYOC where we use the newer cuBLASLt API, due to an API limitation ND x 2D or 2D x ND batched matmul have not been supported. But the former one can be easily supported by flattening all batch axes into the M axis and using the regular, non-batched GEMM.

@vinx13 @yelite @csullivan @sunggg

@tvm-bot
Copy link
Collaborator

tvm-bot commented Aug 7, 2023

Thanks for contributing to TVM! Please refer to the contributing guidelines https://tvm.apache.org/docs/contribute/ for useful information and tips. Please request code reviews from Reviewers by @-ing them in a comment.

Generated by tvm-bot

@csullivan csullivan merged commit e7cb3a8 into apache:unity Aug 7, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants