Sparse CSR CUDA: fix input checks for `addmm` and `mm` #66485

IvanYashchuk · 2021-10-12T15:57:45Z

Stack from ghstack:

The errors for incorrectly sized inputs should match the dense variants
of functions.
Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and
removed unnecessary device check.

cc @nikitaved @pearu @cpuhrsch @IvanYashchuk

Differential Revision: D31764036

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. [ghstack-poisoned]

pytorch-probot · 2021-10-12T15:57:48Z

CI Flow Status

⚛️ CI Flow

Ruleset - Version: v1
Ruleset - File: https://github.com/pytorch/pytorch/blob/7949f07bd48ec1bf74ec67531399b1ebd7ce8930/.github/generated-ciflow-ruleset.json
PR ciflow labels: ciflow/default,ciflow/cuda

Workflows	Labels (bold enabled)	Status
Triggered Workflows
libtorch-linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	✅ triggered
libtorch-linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`	✅ triggered
linux-bionic-cuda10.2-py3.9-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	✅ triggered
linux-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/noarch`, `ciflow/xla`	✅ triggered
linux-vulkan-bionic-py3.6-clang9	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/vulkan`	✅ triggered
linux-xenial-cuda10.2-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/slow`	✅ triggered
linux-xenial-cuda11.3-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-clang7-asan	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/sanitizers`	✅ triggered
linux-xenial-py3.6-clang7-onnx	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`, `ciflow/onnx`	✅ triggered
linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
linux-xenial-py3.6-gcc7-bazel-test	`ciflow/all`, `ciflow/bazel`, `ciflow/cpu`, `ciflow/default`, `ciflow/linux`	✅ triggered
periodic-libtorch-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/libtorch`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-linux-xenial-cuda10.2-py3-gcc7-slow-gradcheck	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`, `ciflow/slow`, `ciflow/slow-gradcheck`	✅ triggered
periodic-linux-xenial-cuda11.1-py3.6-gcc7	`ciflow/all`, `ciflow/cuda`, `ciflow/linux`, `ciflow/scheduled`	✅ triggered
periodic-win-vs2019-cuda11.1-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/scheduled`, `ciflow/win`	✅ triggered
win-vs2019-cpu-py3	`ciflow/all`, `ciflow/cpu`, `ciflow/default`, `ciflow/win`	✅ triggered
win-vs2019-cuda11.3-py3	`ciflow/all`, `ciflow/cuda`, `ciflow/default`, `ciflow/win`	✅ triggered
Skipped Workflows
parallelnative-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped
puretorch-linux-xenial-py3.6-gcc5.4	`ciflow/all`, `ciflow/cpu`, `ciflow/linux`	🚫 skipped

You can add a comment to the PR and tag @pytorchbot with the following commands:

# ciflow rerun, "ciflow/default" will always be added automatically
@pytorchbot ciflow rerun

# ciflow rerun with additional labels "-l <ciflow/label_name>", which is equivalent to adding these labels manually and trigger the rerun
@pytorchbot ciflow rerun -l ciflow/scheduled -l ciflow/slow

For more information, please take a look at the CI Flow Wiki.

facebook-github-bot · 2021-10-12T15:57:50Z

🔗 Helpful links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/66485
📄 Preview docs built from this PR
📄 Preview C++ docs built from this PR
🔧 Opt-in to CIFlow to control what jobs run on your PRs

💊 CI failures summary and remediations

As of commit 7949f07 (more details on the Dr. CI page):

1/1 failures introduced in this PR

1 failure not recognized by patterns:

Job	Step	Action
^{pytorch_ios_12_5_1_x86_64_coreml_build}	^{Run Simulator Tests}	🔁 rerun

This comment was automatically generated by Dr. CI (expand for details).

Follow this link to opt-out of these comments for your Pull Requests.

Please report bugs/suggestions to the (internal) Dr. CI Users group.

Click here to manually regenerate this comment.

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. ghstack-source-id: ba2b4efb1e9f092c897da0484caa8631fccc527a Pull Request resolved: pytorch#66485

@IvanYashchuk

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. cc nikitaved pearu cpuhrsch @IvanYashchuk [ghstack-poisoned]

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. ghstack-source-id: af6563cba012fc9eac3206f00037b731eefdd85e Pull Request resolved: pytorch#66485

@IvanYashchuk

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. cc nikitaved pearu cpuhrsch @IvanYashchuk [ghstack-poisoned]

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. ghstack-source-id: 8a32a9597e02bc05b1ac15752407e30a48381a69 Pull Request resolved: pytorch#66485

cpuhrsch · 2021-10-18T23:07:57Z

aten/src/ATen/native/sparse/cuda/SparseBlas.cpp

+    const Scalar& beta,
+    const Scalar& alpha,
+    Tensor& result) {
+  TORCH_INTERNAL_ASSERT_DEBUG_ONLY(mat1.is_sparse_csr());


As mentioned in #63583 TORCH_INTERNAL_ASSERT_DEBUG_ONLY doesn't run in CI or in the binaries we distribute. It might make sense to use a regular internal assert or explicitly throw an error if this condition isn't met.

TORCH_INTERNAL_ASSERT_DEBUG_ONLY is not something that should be tested usually. It could be all removed and nothing would change. It's just my style of warning some potential future developer (who surely develops with DEBUG=1) that this function is intended to be used only when mat1 is sparse csr matrix.
I'd rather remove the assert completely, or replace it with a comment, than turning into a regular assert that 100% of the time in the current code evaluates to assert(true);. It could be expensive to check is_sparse_csr, we never know without measuring!

cpuhrsch

Looks fine but take a look at the comment about TORCH_INTERNAL_ASSERT_DEBUG_ONLY

@IvanYashchuk

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. cc nikitaved pearu cpuhrsch @IvanYashchuk [ghstack-poisoned]

@IvanYashchuk

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. cc nikitaved pearu cpuhrsch @IvanYashchuk [ghstack-poisoned]

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. ghstack-source-id: bcdc7fc80b4c1fbb1cc43f4746d5e8b6f43d2e68 Pull Request resolved: pytorch#66485

anjali411 · 2021-10-19T16:23:37Z

@anjali411 has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

Summary: Pull Request resolved: #66485 The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. cc nikitaved pearu cpuhrsch IvanYashchuk Test Plan: Imported from OSS Reviewed By: jbschlosser Differential Revision: D31764036 Pulled By: cpuhrsch fbshipit-source-id: 76900fe9e4a49474695a01f34bad41cb3422321c

Sparse CSR CUDA: fix input checks for addmm and mm

20731db

The errors for incorrectly sized inputs should match the dense variants of functions. Moved addmm_out_sparse_csr_dense_cuda from SparseCsrTensorMath.cu and removed unnecessary device check. [ghstack-poisoned]

pytorch-probot bot added the ciflow/default label Oct 12, 2021

IvanYashchuk added ciflow/cuda module: sparse Related to torch.sparse labels Oct 12, 2021

pytorchbot added the open source label Oct 12, 2021

facebook-github-bot added the cla signed label Oct 13, 2021

IvanYashchuk requested a review from ngimel October 15, 2021 14:11

cpuhrsch reviewed Oct 18, 2021

View reviewed changes

cpuhrsch approved these changes Oct 18, 2021

View reviewed changes

facebook-github-bot closed this Oct 19, 2021

facebook-github-bot deleted the gh/ivanyashchuk/41/head branch November 19, 2021 15:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sparse CSR CUDA: fix input checks for `addmm` and `mm` #66485

Sparse CSR CUDA: fix input checks for `addmm` and `mm` #66485

IvanYashchuk commented Oct 12, 2021 •

edited by anjali411

pytorch-probot bot commented Oct 12, 2021 •

edited

⚛️ CI Flow

facebook-github-bot commented Oct 12, 2021 •

edited

cpuhrsch Oct 18, 2021

IvanYashchuk Oct 19, 2021

cpuhrsch left a comment

anjali411 commented Oct 19, 2021

Sparse CSR CUDA: fix input checks for addmm and mm #66485

Sparse CSR CUDA: fix input checks for addmm and mm #66485

Conversation

IvanYashchuk commented Oct 12, 2021 • edited by anjali411

pytorch-probot bot commented Oct 12, 2021 • edited

⚛️ CI Flow

facebook-github-bot commented Oct 12, 2021 • edited

🔗 Helpful links

💊 CI failures summary and remediations

1 failure not recognized by patterns:

cpuhrsch Oct 18, 2021

Choose a reason for hiding this comment

IvanYashchuk Oct 19, 2021

Choose a reason for hiding this comment

cpuhrsch left a comment

Choose a reason for hiding this comment

anjali411 commented Oct 19, 2021

Sparse CSR CUDA: fix input checks for `addmm` and `mm` #66485

Sparse CSR CUDA: fix input checks for `addmm` and `mm` #66485

IvanYashchuk commented Oct 12, 2021 •

edited by anjali411

pytorch-probot bot commented Oct 12, 2021 •

edited

facebook-github-bot commented Oct 12, 2021 •

edited