CSR: Relaxing constraints to s_addmm_out_sparse_dense_cuda_worker #59099
Labels
module: sparse
Related to torch.sparse
open source
triaged
This issue has been looked at a team member, and triaged and prioritized into an appropriate module
Projects
馃殌 Feature
Given that cusparse APIs support setting ldb (leading dimension of B).
pytorch/aten/src/ATen/native/sparse/cuda/SparseCUDATensorMath.cu
Line 83 in 1bd22e2
The previous condition is too strict condition as
dense.stride(1)
can be larger thandense.size(0)
. The first stridestride(1)
can besize(0) * s
wheres
is some positive integer.Additional context
This function is used by
addmm_out_sparse_csr_dense_cuda
(CSR) ands_addmm_out_sparse_dense_cuda_worker
(COO) tensor's operation.Mentioned here:
#59012 (comment)
cc @aocsa @nikitaved @pearu @cpuhrsch @IvanYashchuk @mruberry
The text was updated successfully, but these errors were encountered: