Standalone sparse-dense matrix multiplication benchmark #390

marsupialtail · 2020-07-09T11:24:49Z

Hi, I am wondering if FBGEMM supports standalone sparse matrix dense matrix multiplication using the unrolling approach to get register blocking as mentioned in the new release notes. It seems like the test involves the operation fused with another matrix multiplication. I am wondering if an API similar to say MKL's SpMM exists for FBGEMM. Thank you!

dskhudia · 2020-07-09T16:35:41Z

@marsupialtail Such an API doesn't exist in FBGEMM at the moment.

marsupialtail · 2020-07-16T01:02:17Z

So is the easiest way of testing a SpMM fusing it with a quantized matmul at the moment?

dskhudia · 2020-07-17T16:51:45Z

@marsupialtail: Please take a look at the https://github.com/pytorch/FBGEMM/blob/master/bench/I8SpmdmBenchmark.cc

marsupialtail · 2020-07-18T08:14:38Z

So FBGEMM currently only supports int8 SpMM rn? Does it support fp32?

dskhudia · 2020-07-18T17:36:30Z

Currently it's int8 only.

dskhudia · 2021-03-17T18:03:22Z

fp32 and int8 dense-sparse exist at https://github.com/pytorch/FBGEMM/blob/master/bench/SparseDenseMMFP32Benchmark.cc and https://github.com/pytorch/FBGEMM/blob/master/bench/SparseDenseMMInt8Benchmark.cc

CorbinFoucart · 2024-05-16T03:06:57Z

Regarding the SparseDenseMMInt8Benchmark.cc example, it seems that both the input matrix and the output matrix must be transposed in order to use the API. Namely, if I have an input matrix A and am interested in the output matrix C, I must first transpose A to use the API and must transpose the output C^T as well to get C.

As these operations require memory copies which may be quite expensive, I've looked at using the CSC matrix format; for example the function doSpmdmOnInpBuffer. Is there a way to use this function, doSpmdmOnInpBuffer, standalone, followed by a requantization? Similar to the original poster, I've seen it only used in the context of an output pipeline.

dskhudia closed this as completed Mar 17, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Standalone sparse-dense matrix multiplication benchmark #390

Standalone sparse-dense matrix multiplication benchmark #390

marsupialtail commented Jul 9, 2020

dskhudia commented Jul 9, 2020

marsupialtail commented Jul 16, 2020

dskhudia commented Jul 17, 2020

marsupialtail commented Jul 18, 2020

dskhudia commented Jul 18, 2020

dskhudia commented Mar 17, 2021

CorbinFoucart commented May 16, 2024 •

edited

Standalone sparse-dense matrix multiplication benchmark #390

Standalone sparse-dense matrix multiplication benchmark #390

Comments

marsupialtail commented Jul 9, 2020

dskhudia commented Jul 9, 2020

marsupialtail commented Jul 16, 2020

dskhudia commented Jul 17, 2020

marsupialtail commented Jul 18, 2020

dskhudia commented Jul 18, 2020

dskhudia commented Mar 17, 2021

CorbinFoucart commented May 16, 2024 • edited

CorbinFoucart commented May 16, 2024 •

edited