Doc/dps arrays #110

Divesh-Otwani · 2020-05-15T14:43:32Z

I documented DPS arrays (and did some background reading on them).

(By the way, I don't see how this implementation reduces the number of allocations; it seems like it doesn't actually help -- though I'm probably missing something. The example I made is doable with exactly one allocation of a temporary array of size n in C. This is not the case here.)

aspiwack · 2020-05-19T06:47:09Z

I don't know what your question actually is. But if what you are trying to do is compare mutable arrays and destinations, my suggestion is to see how the interaction of split and freeze (the latter being part of the allocation primitive in destinations) is different.

aspiwack · 2020-08-06T09:17:16Z

@utdemir I'll leave it to you to review this PR, too.

utdemir

It looks like I am not the best person to review this, since I do not completely understand in which cases I would use this module, instead of Vector.fromList or Pull/Push arrays.

utdemir · 2020-08-17T04:57:01Z

src/Data/Array/Destination.hs

+-- (i.e., [deforesting](https://www.sciencedirect.com/science/article/pii/030439759090147A)),
+-- be allocated, filled, passed along and de-allocated. When the allocation
+-- of these arrays is controlled by the programmer and not
+-- done by Haskell's GC (garbage collector), programs are often more efficient.


I am probably mistaken, but I don't see how just using this module makes the programs more efficient. If I'm wrong, it'd be nice to explain why here :).

After reading Arnaud's explanation, I think we should reword this last sentence with something like:

Destination arrays, makes it possible to reduce the amount of allocations
without relying on compiler optimisations or fusion rules.

utdemir · 2020-08-17T05:06:33Z

src/Data/Array/Destination.hs

+-- >
+-- >     inputVector :: IO (Vector Int)
+-- >     inputVector =
+-- >       return (fromList (map (\x -> (7 * (x+3)) `div` 11) [1..100]))


Instead of describing an IO action, I think this would be a shorter example if we were to just describe a Vector Int -> Vector Int function. eg.

computeDiff :: Vector Int -> DPS.DArray Int #-> () computeDiff = undefined vectorDiff :: Vector Int -> Vector Int vectorDiff vec = let diffSize = (Vector.length vec) - 1 in DPS.alloc diffSize (computeDiff vec)

utdemir · 2020-08-17T05:08:04Z

src/Data/Array/Destination.hs

+--
+-- Since 'DArray' doesn't have a 'Consumable' instance, the only way to
+-- consume it is with the given API (e.g., with 'fill' or perhaps
+-- 'fromFunction') which fills the destiniation array completely


Typo here on destination.

utdemir · 2020-08-17T05:09:48Z

src/Data/Array/Destination.hs

+-- Linear types are used to ensure that the destination array
+-- is always written to. Why? Well:
+--
+-- Because of linear types, any function that uses a destination array


"because of linear types" sounds a bit hand-wavy. The explanation after this sentence is clear, so maybe we should just start with that ("The only way to create ...").

aspiwack · 2020-08-25T06:21:58Z

@utdemir I believe, on the contrary, that it makes you the best person to review: the documentation is supposed to make you understand why you want to use this.

I'm available for specific questions though.

utdemir · 2020-08-25T09:12:49Z

@aspiwack I think I am missing the main point, because I still don't understand how this can be more efficient than just calling something like Vector.fromListN inside the function.

The only thing I can think of is that creating a DArray once, splitting them and filling using two different functions would be faster than combining two vectors after the fact. But the module gives the impression that it is not the only point.

aspiwack · 2020-09-03T14:19:08Z

Sorry, sorry, I forgot to reply to this one. I promise that I'm catching up with my email and stuff.

“More efficient” is a bit vague. The general way that I think about destinations is that taking a destination instead of returning an array lets you ask a new question, namely: whose responsibility is it to allocate the array. When you return an array, you have no choice: it is you who will allocate the array. Where as when you take a destination as an argument, it can be someone else.

At this point, exercise: show that from a dps function f :: a -> DArray b #-> (), you can define an array-returning function f :: a -> Array b. So DPS is indeed the more general form.

There can be many reasons for doing such a thing: destinations can be split. So I can give one part to a thread, and another part to another thread. If I used array-returning functions, both of these thread would allocate an array, and I would have to copy them into a new array on my way out. Vector depends on array-fusion to avoid such extra-allocation, but it's really not reliable (if these are really threads (e.g. async), vector fusion will not trigger). Another case where fusion will most certainly not work, is when you want to fill in a memory-mapped buffers, since it is not a normal vector thing.

Further reading:

Using DPS for memory-efficient code generation: https://www.microsoft.com/en-us/research/publication/using-destination-passing-style-compile-functional-language-efficient-low-level-code/
My Haskell Exchange 2017 talk speaks about destination quite a bit: https://skillsmatter.com/skillscasts/10637-distributed-programming-with-linear-types

Divesh-Otwani · 2020-09-29T22:04:50Z

Closing in favor of #238.

Divesh-Otwani added 2 commits May 15, 2020 14:44

DPS arrays documentation

5a6d46b

fixup! DPS arrays documentation

c552bba

Divesh-Otwani requested a review from aspiwack May 15, 2020 14:43

fixup! DPS arrays documentation

86680f7

Divesh-Otwani added 2 commits May 29, 2020 12:18

fixup! DPS arrays documentation

e7938e9

fixup! DPS arrays documentation

f4d994d

Divesh-Otwani force-pushed the doc/dps-arrays branch from 13aead2 to f4d994d Compare June 9, 2020 07:35

silky mentioned this pull request Jul 7, 2020

problems using linear-base in a new project #120

Closed

utdemir reviewed Aug 17, 2020

View reviewed changes

Divesh-Otwani mentioned this pull request Sep 4, 2020

Have a first draft of documentation covering all modules #163

Closed

Divesh-Otwani mentioned this pull request Sep 29, 2020

Documenting destination arrays #238

Merged

Divesh-Otwani closed this Sep 29, 2020

Divesh-Otwani deleted the doc/dps-arrays branch October 13, 2020 17:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Doc/dps arrays #110

Doc/dps arrays #110

Divesh-Otwani commented May 15, 2020

aspiwack commented May 19, 2020

aspiwack commented Aug 6, 2020

utdemir left a comment

utdemir Aug 17, 2020

utdemir Sep 23, 2020

utdemir Aug 17, 2020

utdemir Aug 17, 2020

utdemir Aug 17, 2020

aspiwack commented Aug 25, 2020

utdemir commented Aug 25, 2020

aspiwack commented Sep 3, 2020

Divesh-Otwani commented Sep 29, 2020

Doc/dps arrays #110

Doc/dps arrays #110

Conversation

Divesh-Otwani commented May 15, 2020

aspiwack commented May 19, 2020

aspiwack commented Aug 6, 2020

utdemir left a comment

Choose a reason for hiding this comment

utdemir Aug 17, 2020

Choose a reason for hiding this comment

utdemir Sep 23, 2020

Choose a reason for hiding this comment

utdemir Aug 17, 2020

Choose a reason for hiding this comment

utdemir Aug 17, 2020

Choose a reason for hiding this comment

utdemir Aug 17, 2020

Choose a reason for hiding this comment

aspiwack commented Aug 25, 2020

utdemir commented Aug 25, 2020

aspiwack commented Sep 3, 2020

Divesh-Otwani commented Sep 29, 2020