Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TRMM, remove deprecated inplace trmm to favor outofplace/inplace trmm API #617

Merged
merged 2 commits into from
Jul 10, 2023

Conversation

amcamd
Copy link
Contributor

@amcamd amcamd commented Jun 21, 2023

  • this PR does 3 things
    • removes the macro HIPBLAS_V1 and code used for deprecating trmm_inplace
    • copies from daineAMD PR hipblasTrmm API change to allow inplace and outofplace behaviour #504 to switch from trmm_inplace to trmm_outofplace. This contains most of the changes
    • call rocblas_Xtrmm in place of rocblas_Xtrmm_outofplace. This needs the changes in rocBLAS-internal PR # 1926
  • This PR will fail tests untill rocBLAS-internal PR # 1926 is merged

@amcamd amcamd added the noCI Disable Jenkins for this PR. label Jun 21, 2023
Copy link
Contributor

@TorreZuk TorreZuk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM but @daineAMD should review as had linked PR.

Comment on lines 1 to 4
/* ************************************************************************
* Copyright (C) 2016-2023 Advanced Micro Devices, Inc. All rights reserved.
*
* Permission is hereby granted, free of charge, to any person obtaining a copy
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why wasn't this just converted to show the new API use?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for catching this. Added back example_strmm.cpp in next commit.

Comment on lines -28 to -30
#ifndef HIPBLAS_NO_DEPRECATED_WARNINGS
#define HIPBLAS_NO_DEPRECATED_WARNINGS
#endif
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Maybe just comment out for next deprecation use?

@@ -90,3 +90,49 @@ Packed int8x4 was removed as support for arbitrary dimensioned int8_t data is a
* enum hipblasInt8Datatype_t was removed
* function hipblasSetInt8Datatype was removed
* function hipblasGetInt8Datatype was removed

Legacy BLAS in-place trmm
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are you adding another Removed in hipBLAS 2.0 section for the changes here?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for finding this, added Removed in hipBLAS 2.0 section in next commit.

Comment on lines 14573 to 14609
Note that trmm can provide in-place functionality by passing in the same address for both
matrices B and C.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

When B == C, rocBLAS requires that ldb == ldc, otherwise we return an error code. I think that cuBLAS doesn't do any of this checking, but would have to double check. Might be worth mentioning, or guide users to look at rocBLAS/cuBLAS documentation to avoid having to update this if backends ever change.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for finding this, added requires ldb == ldc in next commit.

@amcamd amcamd removed the noCI Disable Jenkins for this PR. label Jun 29, 2023
@amcamd amcamd merged commit 215876e into ROCm:develop Jul 10, 2023
9 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants