-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[5.3.X] TRMM functions do not have correct correspondence in hipBLAS #524
Comments
+ Added tests for SYRK, SYMM, HEMM, and TRSM v2 functions + Added tests for HERKX (eXtended HERK) and GEAM functions [ToDo] + Mark TRMM functions as UNSUPPORTED (both in `hip` and `roc`) till ROCm/hipBLAS#524 is fixed + File an analogous bug issue to rocBLAS
Hi @emankov, I've made some comments in ROCm/rocBLAS#1265 regarding the rocBLAS implementation of these functions. The reasoning behind the difference for hipBLAS/cuBLAS is the same. Moving forward, we do plan on moving to introduce the out-of-place functionality support (with the extra C and ldc parameters) for the hipblasXtrmm API. We announced deprecation of the current trmm API in hipBLAS 0.49 for ROCm 5.0. There is an open pull reqest at #504 which makes these requested changes. We plan on pushing this change into a future release of hipBLAS. Thanks again, |
+ Mark rocBLAS TRMM functions rocblas_(s|d|c|z)trmm_outofplace, as supported only for TRMM v2 CUDA analogues + Mark hipBLAS TRMM functions hipblas(S|D|C|Z)trmm as HIP_UNSUPPORTED + Regenerate and update docs and hipify-perl accordingly [Reasons] + hipBLAS TRMM functions hipblas(S|D|C|Z)trmm, actually, do not match neither cublas TRMM functions, nor cublas TRMM _v2 functions: ROCm/hipBLAS#524 + There is a correspondence between cuBLAS cublas_(s|d|c|z)trmm and rocBLAS TRMM rocblas_(s|d|c|z)trmm_outofplace, not rocblas_(s|d|c|z)trmm: fixed it [ToDo] + Close ROCm/rocBLAS#1265 as erroneous + Remove HIP_UNSUPPORTED mark from hipblas(S|D|C|Z)trmm functions after merging ROCm/hipBLAS#504 + Add cublas2rocblas and update cublas2hipblas synthetic tests
hipBLAS TRMM functions
hipblasStrmm
,hipblasDtrmm
,hipblasCtrmm
,hipblasZtrmm
do not match neither cublas TRMM functions, nor cublas TRMM _v2 functions.For instance:
cublasStrmm:
cublasStrmm_v2:
hipblasStrmm:
The same goes for rocBLAS analogues
rocblas_strmm
,rocblas_dtrmm
,rocblas_ctrmm
,rocblas_ztrmm
(ROCm/rocBLAS#1265).So, the above 4 hipBLAS and 4 rocBLAS functions are marked as
HIP UNSUPPORTED
.[Solution]
As far as hipBLAS doesn't support
v1
BLAS functions, populate hipblas TRMM functions with two missing arguments:float* C
andint ldc
and revise functions' logic.The text was updated successfully, but these errors were encountered: