Type class vs data type Graph #5

snowleopard · 2017-03-01T14:25:05Z

At the moment we have type class Graph and data type called Basic that is its direct deep embedding (I also sometimes refer to it as Expr). However, it would be really nice to call the data type Graph too, to add it to Tree, Forest and the rest of the family of nice algebraic data types.

So, I propose the following:

The data type Graph lives in Algebra.Graph.Data: ~~Of course we need to convince GHC to add it to containers, but it's really very lightweight.~~ See Type class vs data type Graph #5 (comment).
```
data Graph a = Empty | Vertex a | Overlay (Graph a) | Connect (Graph a) deriving Show
```
The type class Graph lives in Algebra.Graph. It also defines the instance Graph Data.Graph.

Presumably, if I'm implementing cool algorithms on the data type Graph, I don't care about Algebra.Graph and don't need to include it.

Alternatively, if I'm coding polymorphic functions, I don't need Algebra.Graph.Data and don't include it.

So, even though having the same name for the type class and data type sounds potentially confusing, they would probably rarely get mixed in the same file, and so can happily live in separate modules.

Having them in different modules also allows you to always be specific through qualified imports, like Data.Graph and Class.Graph, which seems nicer than inventing names like IGraph or GraphLike or what not for the type class.

The text was updated successfully, but these errors were encountered:

snowleopard · 2017-03-01T14:32:08Z

Oh, I just realised that putting it into Data.Graph won't work, because of type Graph = Table [Vertex] :-(

So, maybe Algebra.Graph.Data then.

See #5

ndmitchell · 2017-03-04T14:56:01Z

I prototyped a potential design here: https://gist.github.com/ndmitchell/800fb7bf65f5875a22e72b1b868e1442

snowleopard · 2017-03-04T22:19:13Z

@ndmitchell Thanks! I've just played with your code a little and realised that toGraph limits the space of inhabitants of the type class only to instances that are isomorphic to directed graphs (or whatever graphs we choose to have by default according the type class laws), which is unfortunate.

For undirected, transitive, etc. graphs, the result is not immediately unique and one would have to come up with some canonical way to implement toGraph so that, e.g. connect a b and connect b a map to the same result, which is the only sensible behaviour, I think.

And for instances like ToList this method makes little sense, even though it's easy to obtain a good (from the canonicity point of view) implementation, e.g. toGraph = vertices . toList.

What about adding a class ToGraph with a single method? I'm not a fan of having a dozen of single-method classes, but I think it's important to keep the core unlimited in terms of the number of potential inhabitants.

As a more vague comment: I find toGraph a bit fishy, because mathematically it's just an id function for instances that are isomorphic to directed graphs, and it seems to exist purely because of efficiency considerations. Efficiency is important, but I'd prefer if it didn't force us to extend the core.

snowleopard · 2017-03-04T22:50:44Z

Another argument for turning ToGraph into a separate class: it's not a subclass of Graph! It's only purpose is to turn your graph representation (an edgelist [(a,a)], Data.Graph.Graph, Gr () () from fgl, or anything else) into the Graph datatype, so that it's possible to reuse algorithms supported by the latter.

Note: making Data.Graph.Graph an instance of type class Graph would be a disaster: since it doesn't share common subgraphs, it would be ridiculously inefficient (imagine constructing it using the edges function). But making it an instance of ToGraph is perfectly sensible!

As a side note, we still don't have any cool algorithms on the Graph data type, which makes toGraph a bit premature -- why would we want to convert to Graph now? -- but I do share your belief that at some point it will actually become useful.

ndmitchell · 2017-03-06T22:12:53Z

I didn't intend to imply that toGraph must be invertible. If you have a bidirectional graph, and call connection a b, I'd expect toGraph to return Overlay (Connect a b) (Connect b a). I see no reason connect a b and connect b a return the same result. You have laws about how to reason about graphs, and can use them to say two things are equivalent.

Can you give an example of a type that can implement Graph but not toGraph?

Most of the graph algorithms you have written produce a polymorphic g as a result. That is equivalent to producing a Basic graph combined with a toGraph implementation, apart from the algorithm must at least do a complete recopy. With toGraph you get to chose where the copy/convert goes, giving you more efficient code - and you can actually start to compose steps (e.g. removeVertex) into algorithms. From what I see, you've already demonstrated the need for toGraph :) It does enable efficiency, but it seems far more principled that just for efficiency, it's for conversion/compositionality.

snowleopard · 2017-03-06T23:18:11Z

I see no reason connect a b and connect b a return the same result. You have laws about how to reason about graphs, and can use them to say two things are equivalent.

I think they should return the same result -- otherwise, they will leak implementation details. For example, at the moment we implement the Eq instance of SymmetricRelation by comparing the symmetric closures of the underlying non-symmetric relations. A trivial implementation of toGraph will therefore return different results for connect a b and connect b a. Now imagine that at some point in future we decide to change the implementation and store symmetric relations in a canonical representation where all edges (a, b) are ordered, i.e. a < b. Then suddenly toGraph will return the same result for connect a b and connect b a and some code elsewhere may break as a result.

I therefore think that the following should be a law of toGraph: if x == y then toGraph x == toGraph y. This is the only way to guarantee that implementation details don't leak.

Can you give an example of a type that can implement Graph but not toGraph?

Consider the following classic instance:

newtype Size a = Size { getSize :: Int }

instance Graph (Size a) where
    type Vertex (Size a) = a
    empty       = Size 0
    vertex _    = Size 1
    overlay x y = Size $ getSize x + getSize y
    connect x y = Size $ getSize x + getSize y

It's not inconceivable that we define something like this to calculate the size of graph expressions. You will find it tricky to implement toGraph :-) I can even write a law-abiding Eq instance for it: _ == _ = True, which is practically useless, but still proves the point.

…tion and tests See #5.

snowleopard · 2017-03-24T15:54:07Z

The data type Graph is now defined in the top-level module Algabra.Graph:
http://hackage.haskell.org/package/algebraic-graphs/docs/Algebra-Graph.html

The type class Graph has been demoted to Algabra.Graph.Class:
http://hackage.haskell.org/package/algebraic-graphs/docs/Algebra-Graph-Class.html

snowleopard added the design label Mar 1, 2017

snowleopard added a commit that referenced this issue Mar 2, 2017

Rename Basic to Graph, move Relation-like instances to Relation

2d796cf

See #5

ndmitchell mentioned this issue Mar 6, 2017

Optional methods for Graph type class #4

Closed

snowleopard mentioned this issue Mar 7, 2017

Foldable instance for Data.Graph #8

Closed

snowleopard added a commit that referenced this issue Mar 19, 2017

Move data type Graph to top-level Algebra.Graph module, add documenta…

fb384e7

…tion and tests See #5.

snowleopard closed this as completed Mar 24, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type class vs data type Graph #5

Type class vs data type Graph #5

snowleopard commented Mar 1, 2017 •

edited

Loading

snowleopard commented Mar 1, 2017 •

edited

Loading

ndmitchell commented Mar 4, 2017

snowleopard commented Mar 4, 2017 •

edited

Loading

snowleopard commented Mar 4, 2017 •

edited

Loading

ndmitchell commented Mar 6, 2017

snowleopard commented Mar 6, 2017 •

edited

Loading

snowleopard commented Mar 24, 2017

Type class vs data type Graph #5

Type class vs data type Graph #5

Comments

snowleopard commented Mar 1, 2017 • edited Loading

snowleopard commented Mar 1, 2017 • edited Loading

ndmitchell commented Mar 4, 2017

snowleopard commented Mar 4, 2017 • edited Loading

snowleopard commented Mar 4, 2017 • edited Loading

ndmitchell commented Mar 6, 2017

snowleopard commented Mar 6, 2017 • edited Loading

snowleopard commented Mar 24, 2017

snowleopard commented Mar 1, 2017 •

edited

Loading

snowleopard commented Mar 1, 2017 •

edited

Loading

snowleopard commented Mar 4, 2017 •

edited

Loading

snowleopard commented Mar 4, 2017 •

edited

Loading

snowleopard commented Mar 6, 2017 •

edited

Loading