polyml-forkjoin

Nested fork-join parallelism in Poly/ML. Implements a standard work-stealing scheduler with a simple interface for expressing nested parallelism.

structure ForkJoin:
sig
  (** Initialize the scheduler with the given number of threads. This should
    * be called exactly once at program startup.
    *)
  val initialize: int -> unit
  val isInitialized: unit -> bool

  (** Run two functions in parallel and get their results. *)
  val par: (unit -> 'a) * (unit -> 'b) -> 'a * 'b

  (** A parallel for-loop. `parfor g (i, j) f` executes f(k) in parallel
    * for every k between i and j (inclusive i, exclusive j). The parameter
    * `g` is for granularity control: the loop is split up into approximately
    * (j-i)/g subranges each of size at most g, and each subrange is processed
    * sequentially.
    *)
  val parfor: int -> (int * int) -> (int -> unit) -> unit

  (** UNSAFE!! Allocate a new array of the given length. Intended for use
    * in high-performance libraries.
    *)
  val alloc: int -> 'a array
end

To use it in your project:

PolyML.make "ForkJoin";

Example

For example, parallel Fibonacci with granularity control (switches to sequential implementation at input size 20 and smaller).

fun sfib n =
  if n <= 1 then n else sfib (n-1) + sfib (n-2)

fun fib n =
  if n <= 20 then sfib n else
  let
    val (a, b) = ForkJoin.par (fn _ => fib (n-1), fn _ => fib (n-2))
  in
    a + b
  end

See examples/ for more.

Notes

Poly/ML doesn't appear to have first-class continuations, which can make it tricky to implement a good scheduler. This is a common problem in many languages.

In this library, I used a trick I like to call "burying the join" which I first saw in ParlayLib (a vanilla C++ library with a high-performance parallel scheduler). The idea is to let processors greedily perform other useful work while waiting to synchronize at a join point. This "buries" the join point in the processor's stack, delaying it until the processor completes one or more full tasks elsewhere. For well-parallelized programs, where the number of processors is much less than the available parallelism, this doesn't introduce any noticeable slowdown.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
ForkJoin		ForkJoin
examples		examples
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

polyml-forkjoin

Example

Notes

About

Releases

Packages

Languages

License

shwestrick/polyml-forkjoin

Folders and files

Latest commit

History

Repository files navigation

polyml-forkjoin

Example

Notes

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages