Skip to content

Conversation

@PeixuanZuo
Copy link
Contributor

@PeixuanZuo PeixuanZuo commented Apr 27, 2023

ROCm CI batch size test occasionally fail. Try reduce batch size to fix it.

error log:
Non-zero status code returned while running FusedMatMul node. Name:'MatMul_2914_Grad/FusedMatMul_0' Status Message: HIP error hipErrorNotFound:named symbol not found
Non-zero status code returned while running Gemm node. Name:'MatMul_2891_Grad/Gemm_5' Status Message: HIP error hipErrorNotFound:named symbol not found

@PeixuanZuo PeixuanZuo requested review from cloudhan and mindest April 28, 2023 06:23
@PeixuanZuo PeixuanZuo merged commit e96f10d into main May 16, 2023
@PeixuanZuo PeixuanZuo deleted the peixuanzuo/fix_rocm_batch_size branch May 16, 2023 05:10
prathikr pushed a commit that referenced this pull request May 16, 2023
ROCm CI batch size test occasionally fail. Try reduce batch size to fix
it.

error log:
Non-zero status code returned while running FusedMatMul node.
Name:'MatMul_2914_Grad/FusedMatMul_0' Status Message: HIP error
hipErrorNotFound:named symbol not found
Non-zero status code returned while running Gemm node.
Name:'MatMul_2891_Grad/Gemm_5' Status Message: HIP error
hipErrorNotFound:named symbol not found
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants