Graph theoretic algorithms for internal use (e.g. Tarjan's algorithm) #16174

oscarbenjamin · 2019-03-06T00:05:22Z

I have thought of a couple of patches that would benefit from having algorithms for e.g. finding connected components of a graph. Do any graphs theoretic algorithms exist anywhere for internal use in SymPy?

I see #8186 rejects the idea of adding a graph theory module for external use but this is about internal use.

I have two examples:

Given a system of ODEs we can define a directed graph where the variables are the vertices and an equation involving f_i(x).diff(x) and f_j(x) is an edge from i to j. The weakly connected components of this graph represent subsets of the system of equations that can be solved independently. The strongly connected components represent the coupled systems of ODEs that have to be solved as units.
Eigenvalues of block diagonal or block diagonal permuted matrices. Viewing a matrix as an adjacency matrix we can partition the row/col indices into ~~strongly~~ connected components. Those ~~strongly~~ connected components can be solved as reduced problems in a divide and conquer style approach. This would give a significant performance boost for many matrices.

For these two it would be useful to have e.g. Tarjan's algorithm:
https://en.wikipedia.org/wiki/Tarjan%27s_strongly_connected_components_algorithm

I'm opening this issue to ask:

Does this already exist in SymPy?
If not is there a good place to put it?
Are there other parts of SymPy that could benefit from something like this?

asmeurer · 2019-03-06T00:16:08Z

We have topological_sort, which is useful in several places. If a graph algorithm is needed for some library code we should implement it. Tarjan's algorithm, like the topological sort algorithm, doesn't seem that complicated, but it would be useful to implement it in one place.

Can the algorithm you describe for ODEs apply to solving regular systems of equations as well?

oscarbenjamin · 2019-03-06T00:28:35Z

It could be used for regular systems of equations but there we would have undirected graphs. So the idea would be that given

a + b = 2
a - b = 1
c + d = 1
c - d = 2

we could partition that into two subproblems for (a,b) and (c,d). I'm not sure how useful that wold be or if it already happens.

In the case of ODEs we can do the same but we can also go a step further since for ODEs we have a directed graph so given:

f(t).diff(t) = 1
g(t).diff(t) = f(t) + h(t)
h(t).diff(t) = g(t) + f(t)
z(t).diff(t) = h(t)

We can identify that (g,h) is a strongly connected component that must be solved as a unit. The variable f is not disconnected from the graph but is not in a strongly connected component with any other variables so that ODE can be solved independently. Once we've solved for f we can eliminate it from the equations for g,h. Then they can be solved as a separate system. Once we have the solution for h the equation for z becomes a single ODE that can be solved independently.

For the ODE module this could greatly expand the number of systems that are matched since we only need to match the strongly connected components of the system. Making use of it needs good support for non-homogeneous / non-autonomous systems though.

asmeurer · 2019-03-06T00:36:10Z

Ah. So this actually combines nicely with topological_sort, since the strongly connected components create a DAG. The topological_sort tells you what order to solve the sub-systems in. Although apparently Tarjan's algorithm already returns the components in (reverse) topological order). I suspect many places that use topological_sort could be using this instead, as a more generalized way to handle graphs with cycles.

asmeurer · 2019-03-06T00:47:46Z

For example you can topologically sort a substitution, to make it happen "in order". #6257

If there's a cycle, like expr.subs({x: y, y: z}), it fails. But maybe this can be generalized to finding the strongly connected components and substituting each strongly connected component with simultaneous=True, doing each component in topological order.

oscarbenjamin · 2019-03-06T01:04:30Z

I would describe expr.subs({x: y, y: z}) as a chain rather than a cycle.

We could certainly partition the substitutions into strongly connected components. Would those components be unambiguous though. What about something where one replacement expression appear in another e.g. expr.subs({x:y, exp(x):z})?

oscargus · 2019-03-06T13:29:14Z

I think something similar can be used when evaluating summations with symbols in the limits, see #15767 where one approach is to reorder the summation order. (Which will not solve the issue at hand, but still can be useful if deciding to do an explicit evaluation as it may split it into several independent summations/evaluations.)

Edit: and therefore also integrals with symbols in the limits, although I do not know if there is a similar issue there.

asmeurer · 2019-03-06T20:46:43Z

We could certainly partition the substitutions into strongly connected components. Would those components be unambiguous though. What about something where one replacement expression appear in another e.g. expr.subs({x:y, exp(x):z})?

I hadn't thought about that. Maybe you would also need an edge if one term's "old" also contains another's. I'm mostly thinking out loud with this idea. I don't know if it is a good one, just trying to think of ways this algorithm could be useful.

oscarbenjamin · 2019-03-07T00:01:08Z

Here's an implementation of Tarjan's algorithm:

def strongly_connected_components(vertices, edgefunc):
    '''Strongly connected components of a graph

    vertices is an iterable giving the vertices of the graph.
    edgefunc(vertex) gives an iterable of vertices that have edges from
    vertex.
    '''
    def follow(v1):
        # Add to the data structures
        index = len(stack)
        indices[v1] = lowlink[v1] = index
        stack.append(v1)

        # Recurse over descendants
        for v2 in edgefunc(v1):
            if v2 not in indices:
                follow(v2)
                lowlink[v1] = min(lowlink[v1], lowlink[v2])
            elif v2 in stack:
                lowlink[v1] = min(lowlink[v1], indices[v2])

        # Pop off complete connected components
        if lowlink[v1] == indices[v1]:
            component = [stack.pop()]
            while component[-1] is not v1:
                component.append(stack.pop())
            components.append(component[::-1])

    lowlink = {}
    indices = {}
    stack = []
    components = []
    for v in vertices:
        if v in indices:
            continue
        follow(v)

    return components

# This represents a directed graph which in dot notation would look like:
# digraph G {
#     A -> B
#     A -> C
#     A -> D
#     B -> C
#     C -> B
#     C -> D
# }
graph = {
    'A': ('B', 'C', 'D'),
    'B': ('C'),
    'C': ('B', 'D'),
    'D': (),
}

for c in strongly_connected_components(graph, graph.__getitem__):
    print(c)

Output:

$ python scc.py 
['D']
['B', 'C']
['A']

asmeurer · 2019-03-07T22:13:27Z

Can it use the same input format as topological_sort. Input format is one thing that a real graph library like networkx will provide more flexibility over, but we should just pick a format that is convenient and use it.

Is min called over integers or over graph elements? If it's graph elements we should allow to pass a key function to it.

friyaz · 2019-03-08T03:53:13Z

Cain it use the same input format as topological_sort.

Input format can be easily changed.

Is min called over integers or over graph elements? If it's graph elements we should allow to pass a key > function to it.

It is called over integers.

oscarbenjamin · 2019-03-08T17:41:54Z

Here's a version that takes the same format as topological_sort:

from sympy import *

def strongly_connected_components(G):
    '''Strongly connected components of a graph

    G is a tuple (V, E) with V the vertices and E the edges as (v1, v2) pairs.
    '''
    V, E = G
    Gmap = {vi: [] for vi in V}
    for v1, v2 in E:
        Gmap[v1].append(v2)

    def follow(v1):
        # Add to the data structures
        index = len(stack)
        indices[v1] = lowlink[v1] = index
        stack.append(v1)

        # Recurse over descendants
        for v2 in Gmap[v1]:
            if v2 not in indices:
                follow(v2)
                lowlink[v1] = min(lowlink[v1], lowlink[v2])
            elif v2 in stack:
                lowlink[v1] = min(lowlink[v1], indices[v2])

        # Pop off complete connected components
        if lowlink[v1] == indices[v1]:
            component = [stack.pop()]
            while component[-1] is not v1:
                component.append(stack.pop())
            components.append(component[::-1])

    lowlink = {}
    indices = {}
    stack = []
    components = []
    for vi in V:
        if vi in indices:
            continue
        follow(vi)

    return components

# This represents a directed graph which in dot notation would look like:
# digraph G {
#     A -> B
#     A -> C
#     A -> D
#     B -> C
#     C -> B
#     C -> D
# }
V = ['A', 'B', 'C', 'D']
E = [
    ('A', 'B'),
    ('A', 'C'),
    ('A', 'D'),
    ('B', 'C'),
    ('C', 'B'),
    ('C', 'D'),
]
G = (V, E)

for c in strongly_connected_components(G):
    print(c)

oscarbenjamin · 2019-03-08T17:52:16Z

Here's an example of how we could use this in solve to partition a system of equations (this is an undirected graph so we're just finding the connected components). The function solve_scc solves a system of equations by first partitioning the system into connected components and is faster than solve.

def solve_scc(eqs, syms):
    # Build graph
    V = syms
    E = []
    # Map back from syms to eqs
    eqmap = {s:set() for s in syms}
    for eq in eqs:
        eqsyms = eq.free_symbols & set(syms)
        for s1 in eqsyms:
            eqmap[s1].add(eq)
            for s2 in eqsyms:
                if s1 is s2:
                    break
                E.append((s1, s2))
                E.append((s2, s1))
    G = (V, E)
    # Find coupled subsystems:
    coupled_syms = strongly_connected_components(G)

    # Solve subsystems and combine results
    soldict = {}
    for csyms in coupled_syms:
        ceqs = set.union(*[eqmap[s] for s in csyms])
        csol = solve(ceqs, csyms, dict=True)
        assert len(csol) == 1
        soldict.update(csol[0])
    return [soldict]


# Build a big-ish linear system:
Nrep = 5 # Nrep*3 equations
syms = symbols('x:%d' % (3*Nrep,))
a, b, c = symbols('a b c')

B = Matrix([
    [a,  b,  c],
    [a, -b,  c],
    [a,  b, -c],
])
rhs = [1, 2, 3] * Nrep
M = BlockMatrix([[B if i == j else zeros(3, 3) for i in range(Nrep)] for j in range(Nrep)])
M = M.as_explicit()
#pprint(M)

b = Matrix([[bi] for bi in rhs])
x = Matrix([[si] for si in syms])

eqs = list(M*x - b)

#pprint(eqs)
import time

# Solve with scc_solve:
start = time.time()
sol_scc = solve_scc(eqs, syms)
print('solve_scc:', time.time() - start, 'seconds')

# Solve with solve:
start = time.time()
sol = solve(eqs, syms, dict=True)
print('solve:', time.time() - start, 'seconds')

Running this the timings are:

$ python scc.py 
solve_scc: 0.47367119789123535 seconds
solve: 35.56143808364868 seconds

The differences grow as a system like this gets larger (increase Nrep). Solving an NxN linear system is maybe O(N**3). Here we trade a single system of size N for k systems of size N/k which is then O(N**3/k**2). So the improvement is quadratic in the number of subsystems that we can break down to.

oscargus · 2019-03-09T13:11:55Z

I think it would make sense to add strongly_connected_components. We both have usecases for it, so go ahead, I would say!

sylee957 added Enhancement Uncategorised Doesn't fit other labels... labels Mar 6, 2019

oscarbenjamin mentioned this issue Mar 8, 2019

Faster eigenvalues using connected components #16207

Closed

oscarbenjamin mentioned this issue Mar 10, 2019

Add connected_components function in iterables #16225

Merged

smichr closed this as completed in #16225 Mar 16, 2019

asmeurer mentioned this issue Feb 23, 2020

Refactor the ODE module using classes for solvers #18348

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph theoretic algorithms for internal use (e.g. Tarjan's algorithm) #16174

Graph theoretic algorithms for internal use (e.g. Tarjan's algorithm) #16174

oscarbenjamin commented Mar 6, 2019 •

edited

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 6, 2019

asmeurer commented Mar 6, 2019

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 6, 2019

oscargus commented Mar 6, 2019 •

edited

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 7, 2019

asmeurer commented Mar 7, 2019

friyaz commented Mar 8, 2019

oscarbenjamin commented Mar 8, 2019

oscarbenjamin commented Mar 8, 2019

oscargus commented Mar 9, 2019

Graph theoretic algorithms for internal use (e.g. Tarjan's algorithm) #16174

Graph theoretic algorithms for internal use (e.g. Tarjan's algorithm) #16174

Comments

oscarbenjamin commented Mar 6, 2019 • edited

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 6, 2019

asmeurer commented Mar 6, 2019

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 6, 2019

oscargus commented Mar 6, 2019 • edited

asmeurer commented Mar 6, 2019

oscarbenjamin commented Mar 7, 2019

asmeurer commented Mar 7, 2019

friyaz commented Mar 8, 2019

oscarbenjamin commented Mar 8, 2019

oscarbenjamin commented Mar 8, 2019

oscargus commented Mar 9, 2019

oscarbenjamin commented Mar 6, 2019 •

edited

oscargus commented Mar 6, 2019 •

edited