about zenMatMulSplit split strategy #1

MichoChan · 2021-08-17T03:20:10Z

" if m/6 < n/16, there is no benifit in splitting m, as it will
//make more skinny matix sizes."

i find that throughput is higher when no split in case (m/6 < n/16), but latency is higher than split, could you give suggestions about how to balance the throughput with latency ? is there some better split strategy ?

cramasam · 2021-10-25T04:57:49Z

We don't do split on m dimension in ZenDNN based on following conditions,

(m/6 < n/16) - Splitting here will make the matrix more skinny. BLIS internally handles in optimal way
(n,n,k >= 1024) - This makes matrix more suitable for BLIS to take decision of split/no-split internally
(m,n >= 4096) - Same for this condition like '2'
(m <= (thread_qty6) - m is not large enough to accommodate total threads6 size

For latency we have been observed that BLIS takes cares of it in the optimal way. We just need to bypass the call to BLIS sgemm.

If you can share the sizes of matrices that you are working on, It would help us to give you better insights.

ratan-prasad added the question Further information is requested label Aug 18, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

about zenMatMulSplit split strategy #1

about zenMatMulSplit split strategy #1

MichoChan commented Aug 17, 2021

cramasam commented Oct 25, 2021 •

edited

Loading

about zenMatMulSplit split strategy #1

about zenMatMulSplit split strategy #1

Comments

MichoChan commented Aug 17, 2021

cramasam commented Oct 25, 2021 • edited Loading

cramasam commented Oct 25, 2021 •

edited

Loading