fix off-by-one in findn and findnz #5386

mlubin · 2014-01-13T18:26:38Z

count was equal to the number of nonzero elements plus one, so the output vectors had an extra element with junk.

``count`` was equal to the number of nonzero elements plus one, so the output vectors had an extra element with junk.

StefanKarpinski · 2014-01-13T18:34:59Z

Yikes. Maybe a test for this would be good so that it can't happen in the future?

mlubin · 2014-01-13T18:38:42Z

Sure, I'll add a test.

mlubin · 2014-01-13T18:47:26Z

Done.

JeffBezanson · 2014-01-13T19:07:49Z

That test passes for me without this patch. I'm not sure how to make a sparse matrix with stored zeros at this point.

fix off-by-one in findn and findnz

JeffBezanson · 2014-01-13T19:26:56Z

Thanks!

mlubin · 2014-01-13T19:27:10Z

Oops, I had run the test on 0.2 where it failed. I fixed it now (rebased).

Note that make test now prints

WARNING: sparse matrix contains explicit stored zeros

I'm not sure the warning is really needed.

ViralBShah · 2014-01-15T13:14:58Z

test/sparse.jl

@@ -182,3 +182,7 @@ mfe22 = eye(Float64, 2)

 # issue #5169
 @test nnz(sparse([1,1],[1,2],[0.0,-0.0])) == 0
+
+# issue #5386
+I,J,V = findnz(SparseMatrixCSC(2,1,[1,3],[1,2],[1.0,0.0]))


Although this constructor is available, one should really not be creating sparse matrices with stored zeros. A lot of routines assume no stored zeros. Maybe we should not export SparseMatrixCSC?

Creating a sparse matrix with zeros should obviously be discouraged but I
don't think it should be an error. One should just have to pay for the cost
of the extra operations.
In JuMP the sparse constraint matrix follows the structure of the model,
and it's possible for some coefficients to be zero in some instances.

In test/sparse.jl:

@@ -182,3 +182,7 @@ mfe22 = eye(Float64, 2)

issue #5169

@test nnz(sparse([1,1],[1,2],[0.0,-0.0])) == 0
+
+# issue #5386
+I,J,V = findnz(SparseMatrixCSC(2,1,[1,3],[1,2],[1.0,0.0]))

Although this constructor is available, one should really not be creating
sparse matrices with stored zeros. A lot of routines assume no stored
zeros. Maybe we should not export SparseMatrixCSC?

—
Reply to this email directly or view it on
GitHubhttps://github.com//pull/5386/files#r8894990
.

A lot of routines assume no stored zeros.

Do they run slower with stored zeros, or do they actually crash and burn? If the former, then I would agree with @mlubin; otherwise the routines for which stored zeros would be fatal should just trap any divide-by-zero errors and the like as they happen.

AFAICT the only way to make a sparse matrix with stored zeros is to call SparseMatrixCSC directly, so if nobody did that there would be no problem. @mlubin do you call it directly for performance reasons?

In JuMP we translate the constraint matrix into a SparseMatrixCSC in a single pass. Yes, we could filter out zeros here.

But I think the point is, sparse matrix formats are definitely not one-size-fits-all, so trying to be strict about having no non-zeros is just going to make some applications more difficult. For example, a common trick is to leave "gaps" between columns if it's possible that we may need to occasionally insert new elements. This could be implemented with a SparseMatrixCSC if zeros are accepted. If not, one would need to use a custom data structure and would instantly lose all of the functionality in Base that might be useful, such as sparse mat-vec products.

What's an example of a routine that crashes if explicit nonzeros are stored?

Once you start doing linear algebra, you always have to check for small values and cancellations. An algorithm that fails with zero entries is likely also going to fail with 1e-50 entries.

Would keeping track of the number of filled entries and how many of those are zero be an option? You'd have to check each inserted value to see if it is zero, but I feel like that might be a trivial cost in the face of the rest of the operation.

I'm not sure the extra complexity is worthwhile. When would one actually need the number of nonzero entries instead of the number of filled entries?

The dense nnz actually counts the number of nonzeros.

Keeping track of the number of filled nonzeros is going to be expensive, and not practical. We already do not have good performance on sparse operations, and it would be good not to slow down further.

fix off-by-one in findn and findnz

5446521

``count`` was equal to the number of nonzero elements plus one, so the output vectors had an extra element with junk.

mlubin added a commit to mlubin/julia that referenced this pull request Jan 13, 2014

add test for JuliaLang#5386

ae97589

add test for JuliaLang#5386

e9f0c66

JeffBezanson added a commit that referenced this pull request Jan 13, 2014

Merge pull request #5386 from mlubin/patch-2

698ef9d

fix off-by-one in findn and findnz

JeffBezanson merged commit 698ef9d into JuliaLang:master Jan 13, 2014

mlubin deleted the patch-2 branch January 13, 2014 19:47

ViralBShah reviewed Jan 15, 2014
View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix off-by-one in findn and findnz #5386

fix off-by-one in findn and findnz #5386

mlubin commented Jan 13, 2014

StefanKarpinski commented Jan 13, 2014

mlubin commented Jan 13, 2014

mlubin commented Jan 13, 2014

JeffBezanson commented Jan 13, 2014

JeffBezanson commented Jan 13, 2014

mlubin commented Jan 13, 2014

ViralBShah Jan 15, 2014

mlubin Jan 15, 2014

jiahao Jan 15, 2014

JeffBezanson Jan 15, 2014

mlubin Jan 15, 2014

mlubin Jan 16, 2014

StefanKarpinski Jan 16, 2014

mlubin Jan 16, 2014

ViralBShah Jan 17, 2014

ViralBShah Jan 17, 2014

fix off-by-one in findn and findnz #5386

fix off-by-one in findn and findnz #5386

Conversation

mlubin commented Jan 13, 2014

StefanKarpinski commented Jan 13, 2014

mlubin commented Jan 13, 2014

mlubin commented Jan 13, 2014

JeffBezanson commented Jan 13, 2014

JeffBezanson commented Jan 13, 2014

mlubin commented Jan 13, 2014

Choose a reason for hiding this comment

Choose a reason for hiding this comment

issue #5169

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment