Use a custom array type for SetM? #928

treeowl · 2023-02-02T02:48:20Z

I noticed that the marking array we use for DFS doesn't always get unboxed (at least, it didn't in one of my lazy preorder experiments). So I tried making the Functor, Applicative, and Monad instances for SetM explicitly strict in the array. This successfully unboxed it, but unfortunately the effects on performance were somewhat mixed. I suspect the problem may be something like register pressure.

We currently use STUArray s Int Bool for SetM. STUArray is defined

data STUArray s i e = STUArray !i !i !Int (MutableByteArray# s)

In our case, that means we're passing around:

A boxed Int for the lower bound.
A boxed Int for the upper bound.
An unboxed Int# for the array size.

If we use a custom array type, we can remove indirections without so many function arguments. The most extreme version would be

data Markers s = Markers (MutableByteArray# s)

where we store the lower bound in the first word of the byte array and get the array size using getSizeofMutableByteArray#. If I read the GHC heap object layout correctly, this will put the lower bound immediately after the array size, so we'll use at most one cache line for those.

The text was updated successfully, but these errors were encountered:

meooow25 · 2023-03-03T14:46:31Z

I think we would want to store the size explicitly too, for bounds checks, since the real size would be >= the requested size.

treeowl · 2023-03-03T16:03:57Z

GHC stores the size. It needs to, for garbage collection. There's a primop to get it.

meooow25 · 2023-03-03T17:50:20Z

Yes, but consider that we want to store just one bit, but the MutableByteArray# will be the size of a word. We should ideally error on access beyond that one bit.

treeowl · 2023-03-03T18:37:02Z

I don't think we need that, do we? We have the Graph, which is a Data.Array and therefore has its size stashed. We can do the bounds check against that, no?

meooow25 · 2023-03-04T15:18:29Z

We could do that at dfs yes. If so we might as well not store the lower bound, and check that at the same place. Then we need no checks at all at the MutableByteArray#. Seems a bit extreme though.

treeowl · 2023-03-04T15:30:47Z

Yes. I guess I wasn't thinking clearly.

meooow25 mentioned this issue Mar 4, 2023

A more efficient Graph representation #936

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use a custom array type for SetM? #928

Use a custom array type for SetM? #928

treeowl commented Feb 2, 2023

meooow25 commented Mar 3, 2023

treeowl commented Mar 3, 2023

meooow25 commented Mar 3, 2023

treeowl commented Mar 3, 2023

meooow25 commented Mar 4, 2023 •

edited

treeowl commented Mar 4, 2023

Use a custom array type for SetM? #928

Use a custom array type for SetM? #928

Comments

treeowl commented Feb 2, 2023

meooow25 commented Mar 3, 2023

treeowl commented Mar 3, 2023

meooow25 commented Mar 3, 2023

treeowl commented Mar 3, 2023

meooow25 commented Mar 4, 2023 • edited

treeowl commented Mar 4, 2023

meooow25 commented Mar 4, 2023 •

edited