[ROCm] rocBLAS BLAS algorithms integration logic in ROCm StreamExecutor#20715
Closed
whchung wants to merge 4 commits intotensorflow:masterfrom
Closed
[ROCm] rocBLAS BLAS algorithms integration logic in ROCm StreamExecutor#20715whchung wants to merge 4 commits intotensorflow:masterfrom
whchung wants to merge 4 commits intotensorflow:masterfrom
Conversation
Contributor
Author
Contributor
|
cc @chsigg |
The commit contains following components to support TensorFlow on ROCm platform - bazel build system - continuous integration logic Authors: - Jack Chung: jack.chung@amd.com - Jeffrey Poznanovic: Jeffrey.Poznanovic@amd.com - Peng Sun: Peng.Sun@amd.com
4a09fa2 to
c12b7df
Compare
…ROCm 1) StreamInterface::CudaStreamMemberHack() Despite the fact that StreamExecutor and GPU common runtime are largely orthogonal, this particular routine in StreamExecutor is used in GPU common runtime and a couple of other operators. In this commit it's renamed as StreamInterface::GpuStreamMemberHack() and their call sites are also changed. 2) StreamExecutorInterface::CudaContextHack() This member is renamed to StramExecutorInterface::GpuContextHack(). Changes introduced in this commit includes: - some StreamExecutor interfaces and CUDA implementation - GPU common runtime related to interface changes in StreamExecutor - operators affected by interface changes in StreamExecutor
The commit contains StreamExecutor logic for ROCm platform. It includes integration logic with major components on ROCm platform: - HIP runtime APIs - rocRAND for RNG Also included are relevant changes to: - bazel script to build ROCm StreamExecutor on ROCm platform Authors: - Jack Chung: jack.chung@amd.com - Deven Desai: deven.desai@amd.com - Johannes M Dieterich: Johannes.Dieterich@amd.com - Peng Sun: Peng.Sun@amd.com - Jeffrey Poznanovic: Jeffrey.Poznanovic@amd.com
This commit contains integration logic for rocBLAS on ROCm platform.
c12b7df to
555a08a
Compare
Member
|
@whchung, is this PR still alive? Please resolve the merge conflict if this is the case. |
Contributor
Author
Member
|
It has been 14 days with no activity and the |
Member
|
It has been 43 days that this pull-request has stalled. Please create a new pull-request with the requested changes. |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Please focus on the content in commit c12b7df
The pull request contains rocBLAS integration logic for ROCm platform.
There are additional changes included in this pull request which are covered
in other pull requests for ROCm. In order to ensure each pull request is
self-contained and builds / runs on both CUDA & ROCm platform they are still
included here.
Authors: