Skip to content

Commit

Permalink
fix ROCm tabulate_fusion_se_a_grad_fifth_order_polynomial (#2357)
Browse files Browse the repository at this point in the history
Follow #830 to fix #2303.

Signed-off-by: Jinzhe Zeng <jinzhe.zeng@rutgers.edu>
  • Loading branch information
njzjz committed Mar 1, 2023
1 parent a16ac9f commit 719005c
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion source/lib/src/rocm/tabulate.hip.cu
Original file line number Diff line number Diff line change
Expand Up @@ -162,7 +162,7 @@ __global__ void tabulate_fusion_se_a_grad_fifth_order_polynomial(
}
__syncthreads();
FPTYPE ago = __shfl(em_x[block_idx * nnei + nnei - 1], 0);
for (int ii = 0; ii < nnei; ii += KTILE) {
for (int ii = 0; ii < nnei - warp_idx; ii += KTILE) {
FPTYPE xx = em_x[block_idx * nnei + ii + warp_idx];
if (ago == xx) {
unloop = true;
Expand Down

0 comments on commit 719005c

Please sign in to comment.