picoservice

A parallel, persistent and distributed execution model for some functional language constructs based on asynchronous buffered message passing

A single inherently parallel model of computation for local and distributed algorithms
Persistent, recoverable execution state using event logging
To be embedded in a host language, avoiding new language constructs

pico? Well, micro (obviously) and nano already have been occupied.

ingredients

Constants, immutable values and Functions. They may be

recursive (optionally tail call optimized)
partially applied
of higher order, accepting functions as arguments
a conditional
primitive, this might be on the level of +, -, > or <=, but it could be as well a call of a connector to an external data provider, expensive both in terms of time and of money charged.
a side effect

why?

It started with the quest for the smallest possible microservice, like + or ! as a service (I rather won't tell how that came about). Really, really micro. Which was also a means to understand actor systems, event sourcing and, as it turned out, different models of computation.
A single approach promised to cover 4 issues at once (parallel distributed and local computation, persistent execution state).
At the end it looked like it may be suitable for long running processes like you have them in workflow systems, just that you write your workflows in a plain programming language (with quite some annotations).

concurrent message passing

A function receives named values as messages and propagates them to subsequent functions as needed. Each function is an actor, so all off them operate concurrently.

Receiving of messages is insensitive against duplicates (idempotent), which allows to re-do operations in case of a recovery, avoiding the need of global consistency or snapshotting. This happens with a granularity of the processing of single values. Say operation A actually has processed value V already and sent a subsequent value V' to operation B in the process, but A(V) wasn't yet marked as completed before failure. In this case after recovery A will process V again and send V' to B for a 2nd time.

Message passing undermines usual function call semantics. You could have function A calling function B and B already computing a result while A hasn't received all its parameters yet (but enough for B). On the other hand primitives, usual conditionals, tail call optimized recusive functions and functions with functional parameters may need to stash parameters until the computation can proceed (until all parameters have been received to computa a primitive, until the functional value has been provided and can be applied, the condition has been computed and the chosen branch can execute, the next recursive call can execute).

If a function is a primitive and got all values necessary, it computes a result value and propagates it to the designated recipient. Any non primitive function that receives a result value propagates it to its own designated recipient of a result.

A conditional is a function with signature: if(condition, value_true_branch, value_false_branch) and comes in two flavors. One is the usual semantics where either the true- or false-branch get evaluated on the condition. The second flavour calculates all 3 elements concurrently and returns the result of one branch as soon as the condition and all necessary parameters are available.

Let's say a function C that takes 2 values (a, b) and subtracts the smaller one from the bigger one. It consists of 3 primitive functions (> and 2 times -, say -t, -f) and one if.

That's a possible sequence of events, le's say X->a->Y means X sends a to Y and let's assume a caller Γ.

1  Γ -> a -> C
2  C -> a -> if 
3  Γ -> b -> C
4  C -> b -> if
5  if -> a -> >
6  if -> b -> >
7  > -> condition -> if
8  if -> a -> -t
9  if -> a -> -f
10 if -> b -> -t
11 if -> b -> -f
12 -t -> value_true_branch -> if
13 -t -> value_false_branch -> if  
14 if -> result -> C
15 C -> result -> Γ

Note, that it could as well happen that way.

1  Γ -> b -> C
2  Γ -> a -> C
3  C -> a -> if 
4  if -> a -> >
5  if -> a -> -t
6  if -> a -> -f
7  C -> b -> if
8  if -> b -> >
9  if -> b -> -t
10 > -> condition -> if
11 -t -> value_true_branch -> if
12 if -> b -> -f
13 if -> result -> C
14 C -> result -> Γ 
15 -t -> value_false_branch -> if

algorithms

...expressed in the elements of the execution model (its byte code, kind of)

max

max(left, right) = if(left > right) left else right

A functional version of quicksort together with a higher order filter-function. Multiple quicksorts could be executed in parallel. The execution can be recovered and resumed from every point of its event log. The filter and quicksort pseudocode follow closely the actual structures, which try to keep things straight forward in terms of the chosen model.

    given predicate: arg -> boolean

    function filter(list, predicate) = {
        iff (list == []){
            []
        } else {
            let head = head(list);
            let tail = tail(list);
            let tail_filtered = filter(tail, predicate)
            iff (predicate(head))
                head :: tail_filtered
            else
                tail_filtered
        }
    }

    given lteq: (u,v) -> boolean

    function quicksort(list) = {
        iff (list == []){
            []
        } else {
            let head = head(list);
            let tail = tail(list);
            let testGt = lteq(_, head) // partially applied function
            let testLteq = lteq(_, head)  // partially applied function
            let filteredGt = filter(tail, testGt)
            let filteredLteq = filter(tail, testLteq)
            let sortedGt = quicksort(filteredGt)
            let sortedLteq = quicksort(filteredLteq)
            sortedLteq ::  head :: sortedGt
        }
    }

The trace of sorting the list (3,1,2).

Recursive calculation of simple geometrical series. geo(n) = 1 + 2 + ... + n-1 + n

geo(a) = if (a = 0)
           0
         else
         {
           let next_a = a - 1
           a + geo(next_a);
         }

A trace of the execution of geo(3) without tail call optimization.

Another tail recursive version thereof, with a less appealing execution trace.

         geo(a, cumulated) =
            if (a = 0)
              cumulated
            else
            {
              let next_a = a - 1
              let c = cumulated + next_a
              geo(next_a, c);
            }

Nested functions forming an arithmetic expression

f(a,b,c) = (a*b*c)+(a+b+c)

They're right now only run (then step-wise recovered and re-run) by the project's unit tests.

event logging/recovery

A log of events causing state change is used to restore the computation state and resume the compuation from the last operations logged. The implementation turned out to be a magnitude more messy with event sourcing built in. Perhaps just for the sake of clarity there should be a non-event-sourced version provided.

TODO

A all kind of stuff is missing, among that

a model of the implemantation that allows to derive some charasteristics and guarantees, e.g. monotonicity vs coordination where needed. As far as it is stated for the bloom language it should be spot-on.
perhaps restrict message passing to match conventional function call semantics
location transparency for function calls
remove restriction to a single functional parameter
primitives for sets and maps
mutability maybe
value de-duplication in event logs
add scatter/gather semantics at least, see this for consistency requirements
a compiler and integration to a source language. There shouldn't be a 2nd form needed for programs to get a picoservice-executed function
find a field of application, perhaps long running processes with big chunks as primitives like in a workflow system, just that you write your workflows in plan Java or whatever
travel backwards in time to the 1980ies to find hardware that is actually supportive of this kind of parallelism.

Name		Name	Last commit message	Last commit date
Latest commit History 157 Commits
.idea		.idea
img		img
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
if.png		if.png
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.idea

.idea

img

img

src

src

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

if.png

if.png

pom.xml

pom.xml

Repository files navigation

picoservice

ingredients

why?

concurrent message passing

algorithms

event logging/recovery

TODO

somewhat related

About

Releases

Packages

Languages

License

curiosag/picoservice

Folders and files

Latest commit

History

Repository files navigation

picoservice

ingredients

why?

concurrent message passing

algorithms

event logging/recovery

TODO

somewhat related

About

Resources

License

Stars

Watchers

Forks

Languages