ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil #4166

atiasnir · 2014-11-18T19:18:16Z

I've re-implemented depth_first_order.

The new implementation offers the following advantages:

linear in the the size of the input graph
releases gil (does not use the interpreter at all)

As a result the new implementation is much faster.

In addition, this new implementation offers a deviation from the original functionality by representing the root of the tree using a self loop (which, to the best of my knowledge, is more standard). The original implementation uses "-9999" which introduces indices outside the domain of the nodes. Currently this behavior is only available through a deisgnated keyword.

…ptions in the original implementation (like using -9999 for tree root) are supported but can be optionally removed (a cleaner way of designating tree top is to add a self loop)

perimosocordiae · 2014-11-18T19:43:38Z

Looks like you should be using ITYPE and ITYPE_t instead of plain int types, because it looks like this will break on sparse inputs with 64-bit indices.

Also, more of a general question: the other functions in this module use the older Cython array types: np.ndarray[ITYPE_t, ndim=1, mode='c']. Should new code use that style, or the newer memoryview types (like this PR)?

atiasnir · 2014-11-18T19:48:02Z

Actually, I can't use the ITYPE and ITYPE_t because these are numpy types. int is a native type and is 64bit.

Note also that key to the implementation is the use of memory views which are more efficient than using the numpy arrays and allows the implementation to release the gil.

Indeed I think that this is prefered to using numpy arrays.

Just for the record, I am testing on a 64 bit system and the testsuite runs with no errors.

pv · 2014-11-18T19:59:42Z

New code needs to use the same approach to types as the rest of the
csgraph code. All of csgraph only supports 32-bit indices currently,
but the code base needs to be transitioned all at the same time.
.
int is 32-bit on Windows, also on win64. Note also that Scipy sparse
matrices use 32-bit indices on all platforms, also 64-bit, if the
matrices are small enough.

larsmans · 2014-11-18T20:19:34Z

scipy/sparse/csgraph/_traversal.pyx

+    cdef int order_end = 0
+
+    cdef int* status = <int*> stdlib.malloc(n*sizeof(int))
+    cdef DfsStackEntry* stack = <DfsStackEntry*>stdlib.malloc(n * sizeof(DfsStackEntry))


Unchecked mallocs! Why not use two NumPy arrays of the appropriate type and get automatic memory allocation for free?

I can use a numpy array for the status buffer. However the stack needs to save both node and neighbor index.
As a side note, the allocation of numpys array is way slower and could be avoided for internal buffers. I used it only for return values.
Finally, I've added the tests for the memory allocation in a separate commit.

larsmans · 2014-11-18T20:21:50Z

I must say I find this a lot more readable than the old code; csgraph algorithms often don't look like graph algorithms the way I know them.

atiasnir · 2014-11-18T20:35:34Z

I totally agree with pv. I've added a commit to respect both indexing schemes by using the fused type from parameters.px.

larsmans · 2014-11-18T21:04:48Z

scipy/sparse/csgraph/_traversal.pyx

+    cdef int* status = <int*> stdlib.malloc(n*sizeof(int))
+    if not status:
+        with gil:
+            raise MemoryError()


This doesn't actually work the way you think it does (as I've found out the hard way). This raise will abort the function, but because it's marked nogil, it has no way of propagating the exception. Instead it will log the exception to stderr and return.

In these situations, I usually return an error code, then check for that in a function that is not nogil.

In that case I'm guessing that the try-finally block is as useless as raising an exception. I've changed the code to reflect that.

…ng to support both 32bit and 64bit indexing schemes as fused types are not allowed in structs

atiasnir · 2014-11-18T21:16:53Z

Thank you for the valuable advice on raising exceptions from nogil functions.

larsmans · 2014-11-18T21:58:27Z

scipy/sparse/csgraph/_traversal.pyx

@@ -399,9 +399,76 @@ cdef unsigned int _breadth_first_undirected(

    return i_nl

+cdef struct DfsStackEntry:
+    long node
+    long index


long is not reliable. It's 32 bits wide on 64-bit Windows. Consider using Py_ssize_t.

argriffing · 2014-11-18T23:27:17Z

Looks like the return of symengine/symengine#308. Does scipy need a PR analogous to numpy/numpy#5271?

… across the module

atiasnir · 2014-11-19T18:40:44Z

Any advice on how to proceed? It does not seem to be related to the commit.

ev-br · 2014-11-19T19:07:25Z

Travis CI failure is unrelated. I've restarted the build, let's see how it fares this time

chebee7i · 2014-12-05T18:31:16Z

Naive question here, but does this actually release the gil? http://docs.cython.org/src/userguide/external_C_code.html#declaring-a-function-as-callable-without-the-gil says that nogil only declares it as not requiring the gil. You must still release it with nogil.

atiasnir · 2014-12-07T07:17:57Z

I've checked the resulting c code and it seems to be the case. I'll submit a fix for that.

rgommers · 2015-05-10T21:00:25Z

@atiasnir are you still planning to make the changes to release the GIL discussed above? This PR seems close to ready.

A faster DFS implementation: (i) linear (ii) releases gil. Some assum…

e8c2faf

…ptions in the original implementation (like using -9999 for tree root) are supported but can be optionally removed (a cleaner way of designating tree top is to add a self loop)

larsmans reviewed Nov 18, 2014
View reviewed changes

respect both 32bit and 64bit indexing for the DFS implementation

6760907

explicitly check memory allocations in the _depth_first_iterative

e12e1e7

larsmans reviewed Nov 18, 2014
View reviewed changes

fixed memory allocation checks. changed types for DfsStackEntry to lo…

f3010f2

…ng to support both 32bit and 64bit indexing schemes as fused types are not allowed in structs

larsmans reviewed Nov 18, 2014
View reviewed changes

Use Py_ssize_t instead of long to handle 64bit indices

ecafc4f

Changed Py_ssize_t to np.int64_t to be more consistent with the types…

617f24c

… across the module

rgommers added scipy.sparse.csgraph enhancement A new feature or improvement labels Dec 7, 2014

ev-br added the needs-work Items that are pending response from the author label Nov 9, 2015

lucascolley changed the title ~~A faster DFS implementation: (i) linear (ii) releases gil. Some assumpti...~~ ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil Mar 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil #4166

ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil #4166

atiasnir commented Nov 18, 2014

perimosocordiae commented Nov 18, 2014

atiasnir commented Nov 18, 2014

pv commented Nov 18, 2014

larsmans Nov 18, 2014

atiasnir Nov 18, 2014

larsmans commented Nov 18, 2014

atiasnir commented Nov 18, 2014

larsmans Nov 18, 2014

atiasnir Nov 18, 2014

atiasnir commented Nov 18, 2014

larsmans Nov 18, 2014

argriffing commented Nov 18, 2014

atiasnir commented Nov 19, 2014

ev-br commented Nov 19, 2014

chebee7i commented Dec 5, 2014

atiasnir commented Dec 7, 2014

rgommers commented May 10, 2015

ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil #4166

Are you sure you want to change the base?

ENH: sparse.csgraph: a faster DFS implementation: (i) linear (ii) releases gil #4166

Conversation

atiasnir commented Nov 18, 2014

perimosocordiae commented Nov 18, 2014

atiasnir commented Nov 18, 2014

pv commented Nov 18, 2014

larsmans Nov 18, 2014

Choose a reason for hiding this comment

atiasnir Nov 18, 2014

Choose a reason for hiding this comment

larsmans commented Nov 18, 2014

atiasnir commented Nov 18, 2014

larsmans Nov 18, 2014

Choose a reason for hiding this comment

atiasnir Nov 18, 2014

Choose a reason for hiding this comment

atiasnir commented Nov 18, 2014

larsmans Nov 18, 2014

Choose a reason for hiding this comment

argriffing commented Nov 18, 2014

atiasnir commented Nov 19, 2014

ev-br commented Nov 19, 2014

chebee7i commented Dec 5, 2014

atiasnir commented Dec 7, 2014

rgommers commented May 10, 2015