Open
Description
Currently in XgemmBatched::BatchedGemmDirect (routines/levelx), m,n,k values are singular.
This enables one to process a batch of matrix multiplications where the sizes of m,n,k do not change.
However there may be use cases where the sizes of m,n,k are completely variable. This forces one to use the non-batched call in a tight-loop which isn't ideal.
My suggestion is to make m, n, and k vectors so that you can specify the dimensions of each matrix individually.
Optionally let a vector of size 1 be passed in to indicate that they are all the same length instead of using two functions.