Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

64-bit interface for L3 functions #855

Merged
merged 7 commits into from
Apr 24, 2024
Merged

64-bit interface for L3 functions #855

merged 7 commits into from
Apr 24, 2024

Conversation

daineAMD
Copy link
Contributor

Sorry for the giant PR. This adds the interfaces needed for 64-bit support for all L3 functions (with the exception of trtri). It adds rocBLAS backend support for dgmm, trmm, and trsm, and will need fairly minimal changes to include the other functions as they are added to rocBLAS.

Copy link
Contributor

@NaveenElumalaiAMD NaveenElumalaiAMD left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM! if the tests pass.

@@ -17,7 +17,7 @@ Tests:
uplo: [ 'L', 'U' ]
diag: [ 'N', 'U' ]
matrix_size: *size_range
api: [ FORTRAN, C ]
api: [ FORTRAN, C , FORTRAN_64, C_64]
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

trtri ILP64 rocBLAS is not done right? If that is the case could we comment this just like other functions?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Good catch, I've removed this. Didn't make other changes for trtri since I'm not sure if we're planning on 64-bit support for it or not.

clients/include/blas3/testing_trsm.hpp Outdated Show resolved Hide resolved
beta,
dC,
ldc));
// DAPI_EXPECT(HIPBLAS_STATUS_INVALID_VALUE,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should the commented out code be removed?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

uncommented it and moved into if(arg.bad_arg_all) since cuBLAS documentation is unclear on whether or not this is supported.

Copy link
Contributor

@amcamd amcamd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is a huge change, and very thorough to include the functions not yet implemented! Although it takes time to review, it will save a lot of time by having all the work in one PR.

@daineAMD daineAMD merged commit f41024c into ROCm:develop Apr 24, 2024
14 of 20 checks passed
@daineAMD daineAMD deleted the l364 branch April 24, 2024 21:47
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants