[Navi3x] Add Device Operations#567
Conversation
|
@aska-0096 Does this PR also pass on current compiler used by CI? If not, we may need to update compiler on CI again cc @illsilin |
|
Looks like we got a couple of new test failures in CI for this branch: [2023-01-30T12:16:01.378Z] The following tests FAILED: |
…posable_kernel into navi3x_mD_batchedGEMM_GroupConvFwd
|
I have updated the CI compiler. Please sync your branch with develop branch. |
…posable_kernel into navi3x_mD_batchedGEMM_GroupConvFwd
|
The docker I triggered still the |
|
What happened is, I changed the default values for compiler in jenkins parameters. Usually it takes 20-30 minutes for Jenkins to update those after the change has been merged. So if CI is launched before that, it will still use old default values. I'll restart your branch manually now and it will use the new compiler defaults. |
|
OK, so the results are in: there are 3 failures: [2023-02-08T17:13:42.060Z] The following tests FAILED: Test 150 seems sensitive, I re-ran it locally with your branch and it passed. in the CI test results were just different from baseline by 1, 879 vs 880. So most likely a round-off error. The other two tests, however, should not have been launched on MI100/200. So you need to add a check somewhere to make sure those tests are only triggered "#if defined(gfx1100)". |
|
One option is to make sure those tests are only built if the appropriate GPU architecture is on the list of targets: diff --git a/example/02_gemm_bilinear/CMakeLists.txt b/example/02_gemm_bilinear/CMakeLists.txt
|
|
Interesting, I confirmed the CI failed due to example running on the unsupported GPU. However, 2 of 4 WMMA including examples passed without compile/runtime error. |
…posable_kernel into navi3x_mD_batchedGEMM_GroupConvFwd
|
@asroy |
Three Device Operations Added
1. DeviceGemmMultipleD_Wmma_Cshuffle
2. DeviceBatchedContractionMultipleD_Wmma_Cshuffle
3. DeviceGroupedConvFwdMultipleD_Wmma_Cshuffle
Above example passed with latest version of amd-stg-open compiler