Open
Description
The DLAMOV
call in the following implies the size of A
be DBPTR + ( LLDA - 1 ) * BN
, if the trailing LLDA - 1 - BM
elements are not accessed, this number is DBPTR + ( LLDA - 1 ) * BN - ( LLDA - 1 - BM )
.
Lines 817 to 837 in a23c2cd
From the assignment, assuming 1x1 grid, then LBWU = BWU
, LN = N - BW
, BN = BW
, BM = BW + BWU
.
Lines 699 to 718 in a23c2cd
The number is
BW + 1 + BWU + ( N - BW ) * LLDA + ( LLDA - 1 ) * BN - ( LLDA - 1 - BM ) - 1
= BW + 1 + BWU + N * LLDA - BW * LLDA + ( LLDA - 1 ) * BW - ( LLDA - 1 - BW - BWU ) - 1
= BW + 1 + BWU + N * LLDA - BW * LLDA + LLDA * BW - BW - LLDA + 1 + BW + BWU - 1
= BW + 1 + 2 * BWU - LLDA + N * LLDA
So apart from the block N * LLDA
, there is an additional part BW + 1 + 2 * BWU - LLDA
.
If LLDA = 2 * BW +1
, and BW = BWU + BWL
, then
BW + 1 + 2 * BWU - LLDA = BW + 1 + 2 * BWU - 2 * BW - 1 = 2 * BWU - BW = BWU - BWL
If BWU
is different from BWL
, A
needs to be extended to avoid buffer overrun.
Metadata
Metadata
Assignees
Labels
No labels