Fix nonzero for scalars on cuda, to_sparse for scalars on cpu/cuda. #17406

gchanan · 2019-02-22T17:36:24Z

I originally set out to fix to_sparse for scalars, which had some overly restrictive checking (sparse_dim > 0, which is impossible for a scalar).

This fix uncovered an issue with nonzero: it didn't properly return a size (z, 0) tensor for an input scalar, where z is the number of nonzero elements (i.e. 0 or 1).

I originally set out to fix to_sparse for scalars, which had some overly restrictive checking (sparse_dim > 0, which is impossible for a scalar). This fix uncovered an issue with nonzero: it didn't properly return a size (z, 0) tensor for an input scalar, where z is the number of nonzero elements (i.e. 0 or 1).

facebook-github-bot

@gchanan has imported this pull request. If you are a Facebook employee, you can view this diff on Phabricator.

fmassa · 2019-02-22T18:08:52Z

This current behavior of nonzero is a bit weird and is different from numpy.

In [1]: import numpy as np

In [2]: a = np.array(5)

In [3]: a
Out[3]: array(5)

In [4]: a.nonzero()
Out[4]: (array([0]),)

In [5]: a.nonzero()[0]
Out[5]: array([0])

In [6]: b = np.array([5])

In [7]: b.nonzero()[0]
Out[7]: array([0])

so we still have the result as expected, while with the current patch scalars don't return anything

gchanan · 2019-02-22T19:11:50Z

@fmassa Yes that is true, but right now the CPU/CUDA code doesn't match and the CUDA code doesn't match the documentation.

ezyang · 2019-02-25T15:14:29Z

aten/src/ATen/native/sparse/SparseTensor.cpp

@@ -284,14 +284,16 @@ SparseTensor dense_to_sparse(const Tensor& self){

 SparseTensor dense_to_sparse(const Tensor& self, int64_t sparse_dim){
  int64_t dims = self.dim();
-  AT_CHECK(sparse_dim > 0, "sparse_dim must be >0");
+  // TODO: it seems like sparse_dim == 0 could be supported even if self.dim() > 0,
+  // but this would take some work and doesn't seem particularly useful.


ISTR myself saying that in this sense, sparse tensors are a strict generalization of dense tensors :)

Yep, feel free to implement it to make it true!

ezyang · 2019-02-25T15:42:41Z

aten/src/ATen/native/sparse/SparseTensor.cpp

+  Tensor values;
+  if (self.dim() > 0) {
+    std::vector<Tensor> ix = indices.chunk(indices.size(0), 0);
+    values = self.index(ix).squeeze(0).clone();


It's kind of surprising this code doesn't generalize to the self.dim() == 0 case. Where exactly does it fail? index() with an empty vector?

well even before I don't believe you can chunk empty tensors, and that's consistent with numpy.

I guess you could go one way or another, but chunking an empty tensor seems like a perfectly well defined thing to do :)

ya, if that would have fixed the issue I would have looked into it more, but considering neither call worked, it didn't seem worth it.

…#17406) Summary: I originally set out to fix to_sparse for scalars, which had some overly restrictive checking (sparse_dim > 0, which is impossible for a scalar). This fix uncovered an issue with nonzero: it didn't properly return a size (z, 0) tensor for an input scalar, where z is the number of nonzero elements (i.e. 0 or 1). Pull Request resolved: pytorch/pytorch#17406 Differential Revision: D14185393 Pulled By: gchanan fbshipit-source-id: f37a6e1e3773fd9cbf69eeca7fdebb3caa192a19

facebook-github-bot reviewed Feb 22, 2019

View reviewed changes

ezyang reviewed Feb 25, 2019

View reviewed changes

ezyang approved these changes Feb 25, 2019

View reviewed changes

facebook-github-bot closed this in 15a55b8 Feb 25, 2019

pytorchbot added the merged label Feb 25, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix nonzero for scalars on cuda, to_sparse for scalars on cpu/cuda. #17406

Fix nonzero for scalars on cuda, to_sparse for scalars on cpu/cuda. #17406

gchanan commented Feb 22, 2019

facebook-github-bot left a comment

fmassa commented Feb 22, 2019

gchanan commented Feb 22, 2019

ezyang Feb 25, 2019

gchanan Feb 25, 2019

ezyang Feb 25, 2019

gchanan Feb 25, 2019

gchanan Feb 25, 2019

ezyang Feb 25, 2019

gchanan Feb 25, 2019

Fix nonzero for scalars on cuda, to_sparse for scalars on cpu/cuda. #17406

Fix nonzero for scalars on cuda, to_sparse for scalars on cpu/cuda. #17406

Conversation

gchanan commented Feb 22, 2019

facebook-github-bot left a comment

Choose a reason for hiding this comment

fmassa commented Feb 22, 2019

gchanan commented Feb 22, 2019

ezyang Feb 25, 2019

Choose a reason for hiding this comment

gchanan Feb 25, 2019

Choose a reason for hiding this comment

ezyang Feb 25, 2019

Choose a reason for hiding this comment

gchanan Feb 25, 2019

Choose a reason for hiding this comment

gchanan Feb 25, 2019

Choose a reason for hiding this comment

ezyang Feb 25, 2019

Choose a reason for hiding this comment

gchanan Feb 25, 2019

Choose a reason for hiding this comment