-
Notifications
You must be signed in to change notification settings - Fork 239
tests: add unit tests for grouped_gemm_multi_d persistent kernels #2941
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ThomasNing
merged 21 commits into
develop
from
aviralgoel/grouped_gemm_persistent_double_buffer_multi_d
Oct 1, 2025
Merged
Changes from all commits
Commits
Show all changes
21 commits
Select commit
Hold shift + click to select a range
87ef10f
feat(grouped_gemm_multi_d): add new example that integrates grouped_g…
AviralGoelAMD e256bff
refactor: grouped_gemm_multi_d relies on grouped_gemm_kernel
AviralGoelAMD 941192f
tests(grouped_gemm): grouped_gemm test suite passes with minor adjust…
AviralGoelAMD b1c3652
fix: segfault fix by passing correct parameters for d tensors
AviralGoelAMD 8a5d97a
style: clang format
AviralGoelAMD 36d37f8
WIP: host code for grouped_gemm_multi_d persistent kernel compiles bu…
AviralGoelAMD b9b470a
feat(grouped_gemm_multi_d): add functionality to run persistant kernel
AviralGoelAMD b1afff1
feat(grouped_gemm_multi_d): add new example that integrates grouped_g…
AviralGoelAMD 7ec1bfc
refactor: grouped_gemm_multi_d relies on grouped_gemm_kernel
AviralGoelAMD f6377b8
tests(grouped_gemm): grouped_gemm test suite passes with minor adjust…
AviralGoelAMD ef1bc62
fix: segfault fix by passing correct parameters for d tensors
AviralGoelAMD 64897ed
style: clang format
AviralGoelAMD 9955532
fix: incorrect validation method and Dtensor layout in test suite
AviralGoelAMD d9552c0
tests: add unit tests for grouped_gemm_multi_d persistent kernels
AviralGoelAMD 593724f
parent 5b0af640369b93849335b126d6826b204ccc43a3
AviralGoelAMD 90cb4da
feat(grouped_gemm_multi_d): add new example that integrates grouped_g…
AviralGoelAMD 01beeb3
WIP: host code for grouped_gemm_multi_d persistent kernel compiles bu…
AviralGoelAMD 0c939a5
feat(grouped_gemm_multi_d): add functionality to run persistant kernel
AviralGoelAMD 0a9be17
fix: parameterize NumDTensor in GroupedGemmHostArgs and remove lint
AviralGoelAMD 6adc764
style: clang format
AviralGoelAMD 23fbb61
refactor: removed unused file
AviralGoelAMD File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.