BUG: Fix fancy indexing on compressed sparse arrays with mixed `indices`/ `indptr` dtypes #20183

ivirshup · 2024-03-04T14:50:15Z

Reference issue

Fixes BUG: csr_row_index and csr_column_index error for mixed indices/indptr dtype when they should probably just convert #20182

What does this implement/fix?

This coerces the indices/ indptr arrays instead of erroring

Additional information

I'm not totally sure on the implementation. It could be worth bundling this behaviour into a method, and maybe warning.

ivirshup · 2024-03-05T13:01:15Z

I've modified my PR so that it no longer does inplace modification of the input array. This better fits existing code like: _matmul_sparse and _binopt.

This will have higher peak memory (when the dtypes need to be converted) and may happen more than once.

perimosocordiae

Thanks, @ivirshup. Ideally we wouldn't need to defend against these kinds of index dtype mismatches, but this is a reasonable improvement and it won't cause slowdowns for well-formed sparse arrays.

tylerjereddy · 2024-03-08T18:29:34Z

scipy/sparse/tests/test_csr.py

+
+    indices = [([2, 3, 4], slice(None)), (slice(None), [2, 3, 4])]
+    for idx, mtx in product(indices, [base_mtx, indptr_64bit, indices_64bit]):
+        np.testing.assert_array_equal(mtx[idx].toarray(), base_mtx[idx].toarray())


It looks like CJ already approved, but I did notice that when I revert your source change this regression test still passes.

Something like this, which borrows from your original issue reproducer, fails before and passes after the source patch:

--- a/scipy/sparse/tests/test_csr.py +++ b/scipy/sparse/tests/test_csr.py @@ -183,4 +183,6 @@ def test_mixed_index_dtype_int_indexing(cls): indices = [([2, 3, 4], slice(None)), (slice(None), [2, 3, 4])] for idx, mtx in product(indices, [base_mtx, indptr_64bit, indices_64bit]): - np.testing.assert_array_equal(mtx[idx].toarray(), base_mtx[idx].toarray()) \ No newline at end of file + np.testing.assert_array_equal(mtx[idx].toarray(), base_mtx[idx].toarray()) + base_mtx.indptr = base_mtx.indptr.astype(np.int64) + base_mtx[[1, 2], :]

Any chance we could do something like that? I'm not a sparse regular, so maybe it can be cleaner than that.

Thanks for the catch! I've corrected this, though am a little unsure in what exactly is different.

tylerjereddy · 2024-03-12T18:02:37Z

Ok, I checked locally that we have fail before/pass after with the test now.

The linter is complaining about a few line lengths, but also some other things you didn't touch (same file, but different lines).

The other CI failures were recently fixed in main by Matt H.

I'll go ahead and merge this, overriding the linter as we get ready to branch, etc. I don't think that'll make main fail the linter, but if it does we'll deal with it...

ivirshup added 3 commits March 4, 2024 14:46

Add test

856f174

Implement

0a98c85

reference issue

691bb6c

github-actions bot added the scipy.sparse label Mar 4, 2024

Don't modify input

d810971

ivirshup marked this pull request as ready for review March 5, 2024 13:01

ivirshup requested a review from perimosocordiae as a code owner March 5, 2024 13:01

perimosocordiae approved these changes Mar 6, 2024

View reviewed changes

tylerjereddy added this to the 1.13.0 milestone Mar 8, 2024

tylerjereddy changed the title ~~Fix fancy indexing on compressed sparse arrays with mixed indices/ indptr dtypes~~ BUG: Fix fancy indexing on compressed sparse arrays with mixed indices/ indptr dtypes Mar 8, 2024

tylerjereddy added the defect A clear bug or issue that prevents SciPy from being installed or used as expected label Mar 8, 2024

tylerjereddy reviewed Mar 8, 2024

View reviewed changes

fix test

4f858f7

tylerjereddy merged commit 33e93e0 into scipy:main Mar 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

BUG: Fix fancy indexing on compressed sparse arrays with mixed `indices`/ `indptr` dtypes #20183

BUG: Fix fancy indexing on compressed sparse arrays with mixed `indices`/ `indptr` dtypes #20183

Uh oh!

ivirshup commented Mar 4, 2024 •

edited

Loading

Uh oh!

ivirshup commented Mar 5, 2024

Uh oh!

perimosocordiae left a comment

Uh oh!

tylerjereddy Mar 8, 2024

Uh oh!

ivirshup Mar 11, 2024

Uh oh!

tylerjereddy commented Mar 12, 2024

Uh oh!

Uh oh!

Uh oh!

BUG: Fix fancy indexing on compressed sparse arrays with mixed indices/ indptr dtypes #20183

BUG: Fix fancy indexing on compressed sparse arrays with mixed indices/ indptr dtypes #20183

Uh oh!

Conversation

ivirshup commented Mar 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Reference issue

What does this implement/fix?

Additional information

Uh oh!

ivirshup commented Mar 5, 2024

Uh oh!

perimosocordiae left a comment

Choose a reason for hiding this comment

Uh oh!

tylerjereddy Mar 8, 2024

Choose a reason for hiding this comment

Uh oh!

ivirshup Mar 11, 2024

Choose a reason for hiding this comment

Uh oh!

tylerjereddy commented Mar 12, 2024

Uh oh!

Uh oh!

BUG: Fix fancy indexing on compressed sparse arrays with mixed `indices`/ `indptr` dtypes #20183

BUG: Fix fancy indexing on compressed sparse arrays with mixed `indices`/ `indptr` dtypes #20183

ivirshup commented Mar 4, 2024 •

edited

Loading