Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" #9958

ananyashreyjain · 2019-03-19T12:29:37Z

re-merge #9619 .
Binary operations on Sparse matrices removes the explicit zeros. These changes preserve the explicit zeros in the output matrix.

ananyashreyjain · 2019-03-19T13:56:12Z

scipy/sparse/sparsetools/csr.h

+    bool addsub = false;
+
+    //checks the type of binary operation to be performed.
+    if((int)op(8, 4) == 12 || (int)op(8, 4) == 4)


Simple check to find if the binary operation is plus or minus.

I'd much prefer defining a separate function for addition/subtraction, rather than special casing logic here.

Even I was doubtful about this but I wanted to keep the code small that's why I choose this special casing logic here. Anyways I have made three separate functions now . One will handle relational operations and other two will handle the arithmetic operations .

ananyashreyjain · 2019-03-19T13:59:13Z

scipy/sparse/sparsetools/csr.h

                Cj[nnz] = Aj[A_pos];
-                Cx[nnz] = result;
+                Cx[nnz] = Ax[A_pos];


op(0, Bx[B_pos]) for Bx but just Ax[A_pos] for Ax because if binary operation is minus Bx has to be multiplied with negative sign but this is not the case with Ax .

ananyashreyjain · 2019-03-19T14:29:37Z

Benchmark results:

before	after	ratio
1.84±0.01ms	2.02±0.01ms	1.10	sparse.Arithmetic.time_arithmetic('csr', 'AA', 'sub')
33.2±0.4 ms	35.3±0.07ms	1.06	sparse.Arithmetic.time_arithmetic('csr', 'BB', 'mul')

ananyashreyjain · 2019-03-19T15:33:53Z

@perimosocordiae I have separated the part of code performing operations of addition and subtraction from the rest so that other operations remain intact from this change. This separation was necessary because adding an extra condition for checking explicit zeros at bottleneck was affecting the performance adversely . In case of addition and subtraction some checks become redundant and can be replaced with the checks for explicit zeros .

perimosocordiae

What's the performance hit for doing the more generic explicit zeros check, instead of special-casing the addition/subtraction case?

perimosocordiae · 2019-03-19T22:17:10Z

scipy/sparse/tests/test_base.py

+        data1 = np.array([0, 5, 7, 9])
+        data2 = np.array([0, 4, 6, 8])
+        m1 = coo_matrix((data1, (row, col)), shape=(4, 4))
+        m2 = coo_matrix((data2, (row, col)), shape=(4, 4))


This is the CSR matrix test class, so these should be CSR matrices.

You are right, I will change them to CSR matrices.

perimosocordiae · 2019-03-19T22:23:21Z

scipy/sparse/sparsetools/csr.h

+    bool addsub = false;
+
+    //checks the type of binary operation to be performed.
+    if((int)op(8, 4) == 12 || (int)op(8, 4) == 4)


I'd much prefer defining a separate function for addition/subtraction, rather than special casing logic here.

perimosocordiae · 2019-03-19T22:24:37Z

scipy/sparse/sparsetools/csr.h

@@ -776,7 +801,7 @@ void csr_binop_csr_general(const I n_row, const I n_col,
 * Note:
 *   Input:  A and B column indices are assumed to be in sorted order
 *   Output: C column indices will be in sorted order
- *           Cx will not contain any zero entries
+ *           Cx will not contain any implicit zero entries


I think you meant "explicit" here.

Cx initially contained only the non-zero values but after these changes it can also have the explicit zero values. That means Cx will have the explicit zeros but not the implicit ones.

ananyashreyjain · 2019-03-19T23:15:31Z

@perimosocordiae for the case of addition and subtraction if the result of operation is zero and the value in one of the matrices is zero then value in the other one will be definitely zero but this doesn't hold good for multiplication and division. In latter case entries in both the matrices will have to be checked which will add one more condition at the bottleneck. This may increase the time to a factor of 1.20 .

ananyashreyjain · 2019-03-29T22:11:18Z

Benchmark Results:

before	after	ratio
33.6±0.1ms	36.0±0.2ms	1.07	sparse.Arithmetic.time_arithmetic('csr', 'BB', 'mul')
4.86±0.06ms	4.59±0.05ms	0.94	sparse.Arithmetic.time_arithmetic('csr', 'AB', 'multiply')

ananyashreyjain · 2019-03-29T23:00:59Z

@perimosocordiae I have divided the csr_binop_csr_canonical() and csr_binop_csr_general() into three parts for handling the operations of relation (>, <, <=, >=, etc), addition/subtraction and multiplication/division separately. I have modified these functions a bit so that there is not much performance hit for checking explicit zeros in matrices. After these changes explicit zero check will work for all the arithmetic operations .

ananyashreyjain · 2019-04-11T08:23:55Z

@perimosocordiae did you get time to go through the changes I made ?

carldlaird · 2020-01-31T18:32:17Z

Is there an update on this PR?

ananyashreyjain changed the title ~~Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros"~~ [WIP] Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" Mar 19, 2019

ananyashreyjain force-pushed the main_zeros branch from 94e8fa0 to a306d75 Compare March 19, 2019 13:52

ananyashreyjain commented Mar 19, 2019

View reviewed changes

ananyashreyjain changed the title ~~[WIP] Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros"~~ Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" Mar 19, 2019

perimosocordiae requested changes Mar 19, 2019

View reviewed changes

ananyashreyjain and others added 5 commits March 30, 2019 03:20

exp_zeros

4018ed0

removal

4cae5d4

tests

5e85bd1

remove_extra_file

4a14c94

removal_of_extra_spaces

d535910

ananyashreyjain force-pushed the main_zeros branch 2 times, most recently from fa19098 to d535910 Compare March 29, 2019 22:01

ananyashreyjain added 2 commits March 30, 2019 03:33

Additional functions

ed18dc4

removal of spaces

e9384d4

pvanmulbregt added the scipy.sparse label May 9, 2019

lucascolley added the maintenance Items related to regular maintenance tasks label Dec 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" #9958

Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" #9958

ananyashreyjain commented Mar 19, 2019

ananyashreyjain Mar 19, 2019 •

edited

perimosocordiae Mar 19, 2019

ananyashreyjain Mar 29, 2019

ananyashreyjain Mar 19, 2019

ananyashreyjain commented Mar 19, 2019 •

edited

ananyashreyjain commented Mar 19, 2019 •

edited

perimosocordiae left a comment

perimosocordiae Mar 19, 2019

ananyashreyjain Mar 19, 2019

perimosocordiae Mar 19, 2019

perimosocordiae Mar 19, 2019

ananyashreyjain Mar 19, 2019

ananyashreyjain commented Mar 19, 2019 •

edited

ananyashreyjain commented Mar 29, 2019 •

edited

ananyashreyjain commented Mar 29, 2019 •

edited

ananyashreyjain commented Apr 11, 2019

carldlaird commented Jan 31, 2020

Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" #9958

Are you sure you want to change the base?

Remerge gh-9619: "FIX: Sparse matrix addition/subtraction eliminates explicit zeros" #9958

Conversation

ananyashreyjain commented Mar 19, 2019

ananyashreyjain Mar 19, 2019 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ananyashreyjain commented Mar 19, 2019 • edited

ananyashreyjain commented Mar 19, 2019 • edited

perimosocordiae left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

ananyashreyjain commented Mar 19, 2019 • edited

ananyashreyjain commented Mar 29, 2019 • edited

ananyashreyjain commented Mar 29, 2019 • edited

ananyashreyjain commented Apr 11, 2019

carldlaird commented Jan 31, 2020

ananyashreyjain Mar 19, 2019 •

edited

ananyashreyjain commented Mar 19, 2019 •

edited

ananyashreyjain commented Mar 19, 2019 •

edited

ananyashreyjain commented Mar 19, 2019 •

edited

ananyashreyjain commented Mar 29, 2019 •

edited

ananyashreyjain commented Mar 29, 2019 •

edited