torch.mm(dense, sparse_csr) #73686

cpuhrsch · 2022-03-02T21:50:48Z

pytorch-bot · 2022-03-02T21:50:51Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/cpuhrsch/pytorch/blob/c87578c0b7b996e7de6e5a1309275ab5f55aa176/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default
Add ciflow labels to this PR to trigger more builds:

Workflows	Labels (bold enabled)	Status
Triggered Workflows
linux-binary-conda	`ciflow/binaries`, `ciflow/binaries_conda`, `ciflow/default`	✅ triggered
linux-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
linux-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
linux-binary-manywheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
linux-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/trunk`	✅ triggered
linux-bionic-rocm4.5-py3.7	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/rocm`, `ciflow/trunk`	✅ triggered
linux-docs	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/docs`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-vulkan-bionic-py3.7-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-cuda11.3-py3.7-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-build	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3-clang5-mobile-custom-build-static	`ciflow/all`, `ciflow/default`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc5.4-mobile-lightweight-dispatch-build	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/mobile`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
linux-xenial-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
macos-arm64-binary-conda	`ciflow/binaries`, `ciflow/binaries_conda`, `ciflow/default`	✅ triggered
macos-arm64-binary-wheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
macos-binary-conda	`ciflow/binaries`, `ciflow/binaries_conda`, `ciflow/default`	✅ triggered
macos-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
macos-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
macos-binary-wheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-gradle-custom-build-single-full-jit	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/trunk`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/trunk`, `ciflow/win`	✅ triggered
windows-binary-libtorch-cxx11-abi	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
windows-binary-libtorch-pre-cxx11	`ciflow/binaries`, `ciflow/binaries_libtorch`, `ciflow/default`	✅ triggered
windows-binary-wheel	`ciflow/binaries`, `ciflow/binaries_wheel`, `ciflow/default`	✅ triggered
Skipped Workflows
caffe2-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
docker-builds	`ciflow/all`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-arm64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/scheduled`	🚫 skipped
ios-12-5-1-arm64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/scheduled`	🚫 skipped
ios-12-5-1-arm64-custom-ops	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/scheduled`	🚫 skipped
ios-12-5-1-arm64-metal	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/scheduled`	🚫 skipped
ios-12-5-1-x86-64	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
ios-12-5-1-x86-64-coreml	`ciflow/all`, `ciflow/ios`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
libtorch-linux-xenial-cuda10.2-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
libtorch-linux-xenial-cuda11.3-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`, `ciflow/trunk`	🚫 skipped
linux-docs-push	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
linux-xenial-cuda11.3-py3.7-gcc7-no-ops	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
macos-10-15-py3-arm64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
macos-10-15-py3-lite-interpreter-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
macos-11-py3-x86-64	`ciflow/all`, `ciflow/macos`, `ciflow/trunk`	🚫 skipped
parallelnative-linux-xenial-py3.7-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
periodic-libtorch-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-bionic-cuda11.5-py3.7-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	🚫 skipped
periodic-linux-xenial-cuda11.3-py3.7-gcc7-debug	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	🚫 skipped
periodic-win-vs2019-cuda11.5-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	🚫 skipped
pytorch-linux-xenial-py3-clang5-android-ndk-r19c-build	`ciflow/all`, `ciflow/android`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`	🚫 skipped
pytorch-xla-linux-bionic-py3.7-clang8	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`, `ciflow/trunk`, `ciflow/xla`	🚫 skipped

facebook-github-bot · 2022-03-02T21:50:53Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/73686
📄 Preview Python docs built from this PR
📄 Preview C++ docs built from this PR
❓Need help or want to give feedback on the CI? Visit our office hours

💊 CI failures summary and remediations

As of commit 6ec1929 (more details on the Dr. CI page):

💚 💚 Looks good so far! There are no failures yet. 💚 💚

This comment was automatically generated by Dr. CI (expand for details).

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

facebook-github-bot · 2022-03-04T19:07:51Z

@cpuhrsch has imported this pull request. If you are a Meta employee, you can view this diff on Phabricator.

IvanYashchuk · 2022-03-31T04:28:31Z

test/test_sparse_csr.py

+        convert_layout(t)
+        convert_layout(m)
+        convert_layout(v)


These lines do nothing.

Yes. It's just for intermediate testing. Will remove.

cpuhrsch · 2022-03-31T23:55:35Z

@IvanYashchuk - can you take another look?

IvanYashchuk · 2022-04-01T09:57:12Z

aten/src/ATen/native/sparse/cuda/SparseBlasImpl.cpp

@@ -806,19 +806,23 @@ void spgemm(
 } // anonymous namespace

 void addmm_out_sparse_csr(
-    const at::sparse_csr::SparseCsrTensor& mat1,
+    const Tensor& mat1,


I agree that we should get rid of at::sparse_csr::SparseCsrTensor, this is really good.

IvanYashchuk · 2022-04-01T10:04:27Z

aten/src/ATen/native/sparse/SparseCsrTensorMath.h

+
+// Returns true if all entries of self are zero
+// TODO: This has potential to be a generic helper
+inline bool _is_all_zero(const Tensor& self) {


The strided path of this helper function introduces an unnecessary synchronization. I think it's possible to restructure the code and remove this check. Dense addmm doesn't have checks for zero. nnz == 0 is checked because underlying backend libraries might not handle this case correctly, raising an error, so we need to manually fix the results.

IvanYashchuk · 2022-04-01T10:08:01Z

aten/src/ATen/native/sparse/cuda/SparseBlas.cpp


  if (&result != &self) {
    if (result.layout() == kStrided) {
-      at::native::resize_output(result, self__sizes);


For strided result at::native::resize_output should be used, because there's a deprecation warning:

pytorch/aten/src/ATen/native/Resize.cpp

Lines 17 to 24 in 00607e7

TORCH_WARN(

"An output with one or more elements was resized since it had ",

"shape ", output.sizes(), ", which does not match the required ",

"output shape ", shape, ". ",

"This behavior is deprecated, and in a future PyTorch release outputs ",

"will not be resized unless they have zero elements. You can explicitly ",

"reuse an out tensor t by resizing it, inplace, to zero elements with ",

"t.resize_(0).");

IvanYashchuk · 2022-04-01T10:11:07Z

aten/src/ATen/native/sparse/cuda/SparseBlas.cpp

-  IntArrayRef self__sizes;
-  c10::MaybeOwned<Tensor> self_;
-  if (&result != &self && self.layout() == kStrided) {
-    self_ = expand_size(self, {mat1_sizes[0], mat2_sizes[1]}, "addmm");
-    self__sizes = self_->sizes();
-  } else {
-    self_ = c10::MaybeOwned<Tensor>::borrowed(self);
-    self__sizes = self_->sizes();
-    TORCH_CHECK(result.dim() == 2, "tensors must be 2-D");
-    TORCH_CHECK(
-        self__sizes[0] == mat1_sizes[0], "self_ dim 0 must match mat1 dim 0");
-    TORCH_CHECK(
-        self__sizes[1] == mat2_sizes[1], "self_ dim 1 must match mat2 dim 1");
-  }
+  c10::MaybeOwned<at::Tensor> self_ =
+      expand_size(self, {mat1.size(0), mat2.size(1)}, "addmm");


This code was copied from addmm_out_cuda_impl so that for in-place operations self is not expanded.

IvanYashchuk · 2022-04-01T10:12:37Z

aten/src/ATen/native/sparse/SparseCsrTensorMath.cpp

+  sparse::impl::_check_is_cpu(self, "self");
+  sparse::impl::_check_is_cpu(mat1, "mat1");
+  sparse::impl::_check_is_cpu(mat2, "mat2");
+  sparse::impl::_check_is_cpu(result, "result");


This might be already in the generated code. The easy way to check this is to pass tensors on different devices and see whether you get an error from these lines or from somewhere higher in the call chain.

Yes, but then we have to update all the error message tests etc. Sounds like good follow up work.

IvanYashchuk

Thanks!

cpuhrsch · 2022-04-05T17:04:14Z

@pytorchbot merge this

github-actions · 2022-04-05T17:06:15Z

Hey @cpuhrsch.
You've committed this PR, but it does not have both a 'release notes: ...' and 'topics: ...' label. Please add one of each to the PR. The 'release notes: ...' label should represent the part of PyTorch that this PR changes (fx, autograd, distributed, etc) and the 'topics: ...' label should represent the kind of PR it is (not user facing, new feature, bug fix, perf improvement, etc). The list of valid labels can be found here for the 'release notes: ...' and here for the 'topics: ...'.
For changes that are 'topic: not user facing' there is no need for a release notes label.

Summary: Fixes #68621 Pull Request resolved: #73686 Approved by: https://github.com/IvanYashchuk, https://github.com/malfet Test Plan: contbuild & OSS CI, see https://hud.pytorch.org/commit/pytorch/pytorch/f2a4d49174eb7b705c2c605c45f78d4fa6786be0 Reviewed By: malfet, b0noI Differential Revision: D34648223 Pulled By: cpuhrsch fbshipit-source-id: 08d62bd1006376d86ee9bb3162848f9f130598b7

pytorch-bot bot added the ciflow/default label Mar 2, 2022

facebook-github-bot added the cla signed label Mar 2, 2022

cpuhrsch marked this pull request as draft March 3, 2022 00:40

suo removed the ciflow/default label Mar 22, 2022

cpuhrsch added 2 commits March 29, 2022 00:22

Subset of code

5de502a

SparseCsrTensorMath.cpp

f714a13

cpuhrsch force-pushed the sparsedensemm1 branch from 8184274 to f714a13 Compare March 29, 2022 00:34

cpuhrsch marked this pull request as ready for review March 29, 2022 00:35

cpuhrsch requested a review from IvanYashchuk March 29, 2022 00:35

cpuhrsch marked this pull request as draft March 29, 2022 01:01

cpuhrsch added 2 commits March 29, 2022 20:28

Support mul(scalar, sparse)

0f23c94

Merge branch 'master' into sparsedensemm1

dddccaf

cpuhrsch marked this pull request as ready for review March 29, 2022 21:26

cpuhrsch changed the title ~~torch.mm(dense, sparse)~~ torch.mm(dense, sparse_csr) Mar 29, 2022

Support mul(scalar, sparse_csr)

3cd3346

cpuhrsch requested a review from bdhirsh as a code owner March 30, 2022 19:31

Merge branch 'master' of github.com:pytorch/pytorch into sparsedensemm1

1f8537a

IvanYashchuk reviewed Mar 31, 2022

View reviewed changes

IvanYashchuk self-requested a review March 31, 2022 04:30

cpuhrsch added 2 commits March 31, 2022 15:17

Remove debugging code

81dd4f4

Use inline

8118928

IvanYashchuk reviewed Apr 1, 2022

View reviewed changes

cpuhrsch added 4 commits April 1, 2022 20:41

Merge branch 'master' of github.com:pytorch/pytorch into sparsedensemm1

f5e1e61

Merge branch 'master' of github.com:pytorch/pytorch into sparsedensemm1

c71d386

Introduce _is_sparse_and_zero

345ce5a

Avoid in-place expand

ca596ca

Use native::resize_output for deprecation warning

6ec1929

cpuhrsch requested a review from IvanYashchuk April 5, 2022 03:32

IvanYashchuk approved these changes Apr 5, 2022

View reviewed changes

malfet approved these changes Apr 5, 2022

View reviewed changes

pytorchmergebot closed this in f2a4d49 Apr 5, 2022

cpuhrsch added release notes: sparse release notes category topic: new features topic category labels Apr 5, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

torch.mm(dense, sparse_csr) #73686

torch.mm(dense, sparse_csr) #73686

cpuhrsch commented Mar 2, 2022

pytorch-bot bot commented Mar 2, 2022

⚛️ CI Flow

facebook-github-bot commented Mar 2, 2022 •

edited

facebook-github-bot commented Mar 4, 2022

IvanYashchuk Mar 31, 2022

cpuhrsch Mar 31, 2022 •

edited

cpuhrsch commented Mar 31, 2022

IvanYashchuk Apr 1, 2022

IvanYashchuk Apr 1, 2022

IvanYashchuk Apr 1, 2022

IvanYashchuk Apr 1, 2022

IvanYashchuk Apr 1, 2022

cpuhrsch Apr 1, 2022

IvanYashchuk left a comment

cpuhrsch commented Apr 5, 2022

github-actions bot commented Apr 5, 2022

	TORCH_WARN(
	"An output with one or more elements was resized since it had ",
	"shape ", output.sizes(), ", which does not match the required ",
	"output shape ", shape, ". ",
	"This behavior is deprecated, and in a future PyTorch release outputs ",
	"will not be resized unless they have zero elements. You can explicitly ",
	"reuse an out tensor t by resizing it, inplace, to zero elements with ",
	"t.resize_(0).");

torch.mm(dense, sparse_csr) #73686

torch.mm(dense, sparse_csr) #73686

Conversation

cpuhrsch commented Mar 2, 2022

pytorch-bot bot commented Mar 2, 2022

⚛️ CI Flow

facebook-github-bot commented Mar 2, 2022 • edited

🔗 Helpful links

💊 CI failures summary and remediations

facebook-github-bot commented Mar 4, 2022

IvanYashchuk Mar 31, 2022

Choose a reason for hiding this comment

cpuhrsch Mar 31, 2022 • edited

Choose a reason for hiding this comment

cpuhrsch commented Mar 31, 2022

IvanYashchuk Apr 1, 2022

Choose a reason for hiding this comment

IvanYashchuk Apr 1, 2022

Choose a reason for hiding this comment

IvanYashchuk Apr 1, 2022

Choose a reason for hiding this comment

IvanYashchuk Apr 1, 2022

Choose a reason for hiding this comment

IvanYashchuk Apr 1, 2022

Choose a reason for hiding this comment

cpuhrsch Apr 1, 2022

Choose a reason for hiding this comment

IvanYashchuk left a comment

Choose a reason for hiding this comment

cpuhrsch commented Apr 5, 2022

github-actions bot commented Apr 5, 2022

facebook-github-bot commented Mar 2, 2022 •

edited

cpuhrsch Mar 31, 2022 •

edited