TST tests for non-canonical input to sparse matrix operations #3254

jnothman · 2014-01-29T00:54:39Z

A number of sparse matrix formats are designed to treat duplicate values as their sum; they prefer indices in sorted order, but should be capable when unsorted. The current test suite largely compares functionality to numpy arrays/matrices, and so constructs sparse matrices from those, producing only canonical sparse forms.

This introduces tests for non-canonical forms, but finds many test failures. I don't intend to fix them all here, but we can perhaps add known failure decorations.

pv · 2014-01-30T13:42:00Z

scipy/sparse/tests/test_base.py

+    if indptr is None:
+        return (data,) + inds
+    else:
+        return (data,) + inds + 2 * indptr


This probably should read 2 * (indptr,)

No, we're duplicating the data entries; 2*indptr fixes the indptr to point correctly.

And I'd created this helper function in part to work with LIL as well, until I realised (I suppose) that LIL's not meant to handle duplicates like COO and CSR.

Then it probably should be (data,) + inds + (2 * indptr,)?
indptr is an array and cannot be added to a tuple (check the Travis-CI output)

Yes, that ;)

pv · 2014-01-30T13:44:36Z

This test would indeed be useful to add.
For the compressed formats, one could also check if things work when the indices array is too big and contains crap beyond indptr[-1].

knownfailures can be added by overriding the corresponding functions in the Test*NonCanonical classes.

jnothman · 2014-01-30T19:58:48Z

For the compressed formats, one could also check if things work when the indices array is too big and contains crap beyond indptr[-1].

Do we want this case to work? Or do you mean that we should test that every method throws an error (do we need to be validating that often?)?

pv · 2014-01-30T20:05:26Z

I was wondering whether len(indices) = len(data) = indptr[-1] should be taken as an invariant in the code or not. But maybe it's clearest to assume it's an invariant (doesn't need to be checked, except maybe in __init__ and in self.check_format).

jnothman · 2014-01-30T23:43:43Z

I think we can assume len(indices) == len(data) == indptr[-1] except where
there are functions for the user to set these (init). If the user
manually changes these things, it's their problem.

I'm pushing some known failures...

On 31 January 2014 07:05, Pauli Virtanen notifications@github.com wrote:

I was wondering whether len(indices) = len(data) = indptr[-1] should be
taken as an invariant in the code or not. But maybe it's clearest to assume
it's an invariant (doesn't need to be checked, except maybe in __init__and in
self.check_format).

Reply to this email directly or view it on GitHubhttps://github.com//pull/3254#issuecomment-33727268
.

jnothman · 2014-01-31T00:18:56Z

That's a whole lot of failures, and some are truly broken (abs, add_sub, bool, minmax, sparse_format_conversions, unary_ufunc_overrides; in CSR/C: sparse boolean indexing, broadcast element-wise multiply, inverse, solve and getnnz_axis).

I'll rebase on master and try the changes to min/max.

jnothman · 2014-01-31T03:48:23Z

It'll be nice to remove many of the known failures when #3233 is merged, but those cases largely throw an error at the moment, while other cases will silently return the wrong values, and at a minimum should have comments to note this fact.

coveralls · 2014-01-31T04:03:11Z

Coverage remained the same when pulling bd391ab on jnothman:test_noncanonical_sparse into 4844c63 on scipy:master.

jnothman · 2014-01-31T04:59:14Z

It turns out add_sub and mu were my fault for not handling uints.

pv · 2014-01-31T08:58:21Z

The remaining test_mu failures are due to use of assert_array_almost_equal. This function uses absolute tolerances, and it is best to never use it.
assert_allclose is a better alternative.

coveralls · 2014-02-01T13:02:51Z

Coverage remained the same when pulling faedf01 on jnothman:test_noncanonical_sparse into 233ad82 on scipy:master.

coveralls · 2014-02-01T13:12:36Z

Coverage remained the same when pulling faedf01 on jnothman:test_noncanonical_sparse into 233ad82 on scipy:master.

TST: sparse: tests for non-canonical input to sparse matrix operations A number of sparse matrix formats are designed to treat duplicate values as their sum; they prefer indices in sorted order, but should be capable when unsorted. The current test suite largely compares functionality to numpy arrays/matrices, and so constructs sparse matrices from those, producing only canonical sparse forms. These commits introduce tests for non-canonical forms.

pv · 2014-02-01T17:25:15Z

Thanks, merged.

pv reviewed Jan 30, 2014
View reviewed changes

jnothman added 3 commits January 31, 2014 11:24

TST tests for non-canonical input to sparse matrix operations

4eabdc2

TST correct logic for noncanonical sparse tests

3eaaca5

TST add known failures for non-canonical sparse matrices

bd391ab

TST fix tests to avoid negating unsigned datatypes

f37f094

TST use assert_allclose instead of _array_equal

faedf01

pv merged commit 2b1c323 into scipy:master Feb 1, 2014

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TST tests for non-canonical input to sparse matrix operations #3254

TST tests for non-canonical input to sparse matrix operations #3254

jnothman commented Jan 29, 2014

pv Jan 30, 2014

jnothman Jan 30, 2014

jnothman Jan 30, 2014

pv Jan 30, 2014

jnothman Jan 30, 2014

pv commented Jan 30, 2014

jnothman commented Jan 30, 2014

pv commented Jan 30, 2014

jnothman commented Jan 30, 2014

jnothman commented Jan 31, 2014

jnothman commented Jan 31, 2014

coveralls commented Jan 31, 2014

jnothman commented Jan 31, 2014

pv commented Jan 31, 2014

coveralls commented Feb 1, 2014

coveralls commented Feb 1, 2014

pv commented Feb 1, 2014

TST tests for non-canonical input to sparse matrix operations #3254

TST tests for non-canonical input to sparse matrix operations #3254

Conversation

jnothman commented Jan 29, 2014

pv Jan 30, 2014

Choose a reason for hiding this comment

jnothman Jan 30, 2014

Choose a reason for hiding this comment

jnothman Jan 30, 2014

Choose a reason for hiding this comment

pv Jan 30, 2014

Choose a reason for hiding this comment

jnothman Jan 30, 2014

Choose a reason for hiding this comment

pv commented Jan 30, 2014

jnothman commented Jan 30, 2014

pv commented Jan 30, 2014

jnothman commented Jan 30, 2014

jnothman commented Jan 31, 2014

jnothman commented Jan 31, 2014

coveralls commented Jan 31, 2014

jnothman commented Jan 31, 2014

pv commented Jan 31, 2014

coveralls commented Feb 1, 2014

coveralls commented Feb 1, 2014

pv commented Feb 1, 2014