Providing input data to Actors using a shared context #109

joachimvh · 2018-03-12T10:14:15Z

Some context: Miel wants to make a Memento actor. All HTTP calls that get executed should be sent to that actor, who then forwards the request to an actual HTTP actor after doing some changes to the request. This itself is actually quite easy in Comunica: there just needs to be a Bus with this Memento actor in between actors that request an HTTP query and those that resolve one.

The problem is that this Memento actor needs some metadata (a date) that is provided at the entry level (e.g., through the command line input). The question now is how to best make sure that date arrives at the Memento actor, preferably in such a way that it arrives together with the actual HTTP requests to make sure the date isn't actually used for requests that should not need this.

Some random ideas:

The one that was discussed the most. Create a general (read-only) "context" object that gets passed to all actors in the call chain. The init actor that parses the command line arguments would then fill in this object. This would require a change to the input interface of the actors to also take the context as input (which is easy) and a change to how all actors currently call their mediator to make sure they pass the context object down the chain (which would be more work). A potential problem is that somehow the init actor needs to use the same keys when writing the context object as the memento actor will use to read out the metadata, so these need to be synced somehow.
(The other ideas are things that were less discussed and maybe just mentioned). Have some sort of shared memory that all actors can access. In this case the context object would be stored there instead of being passed to all the actors. One problem here is that the metadata doesn't arrive at the same time as the query. It might also decrease the independency of all actors?
Have some sort of Bus where this context gets sent on separately. That way all actors that are interested can subscribe to this Bus. This also has the disadvantage of not having the metadata arrive at the same time as the query.
... (I'm sure there are still many potential solutions. Like overloading the run function of the HTTP actors and other weird stuff).

rubensworks · 2018-03-12T10:17:32Z

Option 2 reminds me of the blackboard pattern, which is also often used in cases like these, so we could probably learn some things from that.

rubensworks · 2018-03-12T10:20:40Z

It may also be worth to mention the other option of using the query operation context for the most part, and creating/adapting the deeper actor action interfaces that exist between the QPF actor and the HTTP actor. I'll look into this option a bit more in any case, it may be simpler than it sounds.

RubenVerborgh · 2018-03-12T11:11:27Z

The context might conceptually solve it, but what obligation is there for actors to look at the context? For instance, it could be cheaper to ignore the context, so actors would explicitly need to acknowledge that they reckon with it.

I'm thinking of a kind of decorator pattern, where you have a Memento actor as follows:

receives a job X with constraint "datetime = t"
puts the job X back on the bus without the constraint, asking for a solution
some actor Y takes on job X, resulting in n subtasks
the Memento actor sticks the constraint "datetime = t" on all of the subtasks

In any case, I think this mechanism is crucial and deserves a brainstorm meeting.

joachimvh · 2018-03-12T12:11:48Z

The part where Memento has access to the subtasks would be hard with the current framework, there is no way for it to know what is going on deeper down the pipeline until it gets a result back. For Memento it is not required though, the idea here would be

receives a job X with constraint "datetime = t"
puts the job X back on the bus without the constraint, asking for a solution
some actor Y takes on job X, and returns the solution (being the HTTP result specifically here)
the Memento actor does whatever it needs to do with that result (which could be another call to actor Y).

But the main problem of the issue is how to do step 1, that is: how to get that constraint to the Memento actor.

RubenVerborgh · 2018-03-12T19:19:22Z

I would Step 1 to be really simple, actually. Something like ConstrainedProblem { origProblem, constraints: [ DateTimeConstraint {} ] }.

rubensworks · 2018-03-12T23:30:57Z

@RubenVerborgh The constraint approach sounds interesting, but it would probably require refactoring all actors and interfaces to make the actions constrainable actions. So we have to make sure about this before we do it. Also, it would at least require some additional checking mechanism in all actors. (but as @joachimvh has mentioned, it does not fully solve the problem of passing down information to deeper actors in the tree)

For reference, the only interfaces that need to be changed if we just want to pass down the current context deeper are IActionRdfDereferencePaged and IActionRdfDereference.
Instead of both taking a single URL, they would then also take an optional context, just like the query operation actors.
I think this would probably be the easiest and most consistent way of solving this.

joachimvh · 2018-03-13T08:03:18Z

The problem might be in the future if there are new actors that also need some sort of input from the root level. Only changing the actors that need to change sort of decrease the independency since IActionRdfDereferencePaged now needs to contain a field just because it might be needed further down the road. So I do think that if we do it by passing a context object, it should be changed at the level of the abstract class so all actors have to support context.

RubenVerborgh · 2018-03-13T12:40:31Z

Indeed; I think we should solve the general problem, not just the Memento case.

mielvds · 2018-03-13T14:17:31Z

Don't let the (big) effort of changing the code hold you back, it's just going to become more work in the long run :)

rubensworks · 2018-03-13T23:29:29Z

Alright, here are three possible architectures for the constraints. (Note that the query operation context could also be removed in favor of these)

Constrainable Actions

This changes the parameter of run(IAction) and test(IAction) to run(IConstrainableAction) and test(IConstrainableAction).

interface IConstrainableAction {
  public getAction(checker: (constraint: IConstraint) => boolean): IAction throws UnsatisfiedConstraintException
}
type IConstraint = { type: string; value: any };

The callback would be used to iterate over all constraints (possibly none) in this IConstrainableAction, and throw an exception if one of them is not satisfied. Otherwise, the action is returned.

This will make it so that all actors now will be forced to check the constraints before they can read the actual action.

Pros

Constraint checking is forced.

Cons

Will require a lot of refactoring.
No way of doing partial constraint checking and passing down the other constraints.

Partial Constrainable Actions

This also changes the parameter of run(IAction) and test(IAction) to run(IPartialConstrainableAction) and test(IPartialConstrainableAction).

interface IPartialConstrainableAction {
  public getPartial(partialChecker: (constraint: IConstraint) => boolean): IPartialConstrainableAction
  public getAction(checker: (constraint: IConstraint) => boolean): IAction throws UnsatisfiedConstraintException
}
type IConstraint = { type: string; value: any };

The partialChecker callback would be used to iterate over all constraints (possibly none). All constraints that were not satisfied, will be included in the returned action as well.
The other method is the same as in IConstrainableAction, and should be used in the final actors that will not pass on the action further down anymore.

This will make it so that all actors now will be forced to check the constraints before they can read the actual action, and they can do partial constraint checking.

Pros

Constraint checking is forced.
Compatible with partial constraint checking, as the remaining constraints can be passed down.

Cons

Will require a lot of refactoring.

Constraints Field

This requires not changes to the run and test methods. It only requires an additional field in the IAction interface (which is currently empty).

interface IAction {
  public constraints: IConstraint[];
}

In this case, the actors are responsible for checking the constrainst themselves.

Pros

Simpler and less invasive changes
Compatible with partial constraint checking, as the remaining constraints can be passed down.

Cons

As the responsibility now lies in the actor itself, developers could forget to check the constraints, which would lead to bugs.

Conclusion

I think Partial Constrainable Actions is the best long-term solution of these 3.

rubensworks · 2018-03-14T06:22:54Z

I think we'll still need a plain context object as well (possible added to IAction as well) for things that are not a constraint, but merely optional data.

A logger (#114) is one example of something that needs to be passed down, but is not a constraint.

joachimvh · 2018-03-14T09:30:24Z

I'm slightly confused now. Is this now to solve the problem of getting the input context to the Memento actor, or to do the restraint thingy Ruben V mentioned? And where would these constraints come from and be defined?

Would it be possible to show in a small example how this would be applied to the Memento problem for example?

rubensworks · 2018-03-14T23:50:07Z

Is this now to solve the problem of getting the input context to the Memento actor, or to do the restraint thingy Ruben V mentioned?

Both :-) The Memento use case could be solved using constraints as I see it. But not sufficient for all cases (such as logging), for which a context would still be required. These constraints would be defined by the engine caller (as the root call will also be a constrainable context).

For the Memento use case, this could look like this:

actorInitSparql.run(new ConstrainableAction({ query: 'SELECT ...' }, [ { type: 'memento:datetime', value: '...' } ]));

The Memento actor would then be subscribed to the HTTP bus, where it accepts action with the memento:datetime constraint, and transform it to an action on the bus where this datetime has been transformed to an additional HTTP header.

joachimvh · 2018-03-15T09:59:39Z

It's the getAction function that is throwing me off. So a ConstrainableAction contains an Action and a list of constraints (key/value pairs), right? But what should an Actor provide as input for the getAction function? I assume most of them would just provide null as an input since they have no restrictions on the input? And Memento would then call input.getAction((constraint) => constraint['memento:datetime])? I just don't see how this is different from passing a context (or list of constraints) and checking if the required key in the context is present when it is required (i.e., Memento checks if its key is in there, if it is it adds the new header), which is what the Constraints Field option is.

So I guess I just don't really understand yet how the getAction functions is supposed to work and who is supposed to call it with what parameters, which might be clear from all my sentences ending with a question mark. :P

rubensworks · 2018-03-16T00:17:45Z

Sure, the actor could still implement the checker in an incorrect/unsafe way, which would make it similar to the context.

The alternative would be to move this checking behaviour to the Actor class, but even then, stuff might be implemented wrongly, there's probably no way around that.

Perhaps it's best to not complicate things too much (as people can break it anyways if they really want to), and just add a context and constraints field to the IAction interface, and explain in jsdoc how they should be used.

mielvds · 2018-03-27T08:46:02Z

@joachimvh @rubensworks any resolution for this? Not having Memento would break compatibility with the archives, which is not problematic, but a pity.

joachimvh · 2018-03-27T08:48:23Z

I discussed this with @rubensworks in our last meeting. We'll probably look into some way to pass a context-like thing down the pipeline (which is doable since all the query actors already do this). But will be for after the release.

rubensworks · 2018-06-13T13:27:56Z

Note to self: for this context we'll need a proper namespacing strategy to avoid entry name conflicts for different usages. For instance, we could require the bus/actor package name as a prefix.

mielvds · 2018-07-11T08:17:47Z

@rubensworks what's the status on this?

rubensworks · 2018-07-11T15:07:45Z

This is the third issue on the dev 1.2.0 project board, so within the next couple of weeks I guess. Unless other major maintenance issues would arise.

This is an abstraction of the query operation context, and allows data to be passed to any actor. Closes #109

joachimvh added the feature ➕ label Mar 12, 2018

joachimvh assigned rubensworks and RubenVerborgh Mar 12, 2018

joachimvh added this to To Do in Development 1.0.0 via automation Mar 12, 2018

joachimvh assigned mielvds and joachimvh Mar 12, 2018

rubensworks mentioned this issue Mar 14, 2018

Logging #114

Closed

rubensworks removed this from To Do in Development 1.0.0 Mar 14, 2018

rubensworks added this to To Do in Development Major Mar 14, 2018

rubensworks added this to To do in Development 1.2.0 via automation Jun 13, 2018

rubensworks removed this from To Do in Development Major Jun 13, 2018

rubensworks mentioned this issue Jun 13, 2018

Memento support #79

Closed

rubensworks added the difficulty:high label Jun 13, 2018

rubensworks changed the title ~~Providing input data to Actors~~ Providing input data to Actors using a shared context Jun 13, 2018

rubensworks assigned rubensworks and unassigned rubensworks, RubenVerborgh, mielvds and joachimvh Jun 13, 2018

rubensworks added a commit that referenced this issue Jul 24, 2018

Add ActionContext in IAction

59349a8

This is an abstraction of the query operation context, and allows data to be passed to any actor. Closes #109

rubensworks mentioned this issue Jul 24, 2018

Feature/action context #183

Merged

rubensworks added a commit that referenced this issue Jul 26, 2018

Add ActionContext in IAction

a9225b6

This is an abstraction of the query operation context, and allows data to be passed to any actor. Closes #109

rubensworks added a commit that referenced this issue Jul 27, 2018

Add ActionContext in IAction

427ed01

This is an abstraction of the query operation context, and allows data to be passed to any actor. Closes #109

rubensworks closed this as completed in f2183f9 Jul 27, 2018

Development 1.2.0 automation moved this from To do to Done Jul 27, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Providing input data to Actors using a shared context #109

Providing input data to Actors using a shared context #109

joachimvh commented Mar 12, 2018

rubensworks commented Mar 12, 2018

rubensworks commented Mar 12, 2018

RubenVerborgh commented Mar 12, 2018 •

edited

joachimvh commented Mar 12, 2018

RubenVerborgh commented Mar 12, 2018 •

edited

rubensworks commented Mar 12, 2018

joachimvh commented Mar 13, 2018

RubenVerborgh commented Mar 13, 2018

mielvds commented Mar 13, 2018

rubensworks commented Mar 13, 2018 •

edited

rubensworks commented Mar 14, 2018

joachimvh commented Mar 14, 2018

rubensworks commented Mar 14, 2018

joachimvh commented Mar 15, 2018

rubensworks commented Mar 16, 2018

mielvds commented Mar 27, 2018

joachimvh commented Mar 27, 2018

rubensworks commented Jun 13, 2018

mielvds commented Jul 11, 2018

rubensworks commented Jul 11, 2018

Providing input data to Actors using a shared context #109

Providing input data to Actors using a shared context #109

Comments

joachimvh commented Mar 12, 2018

rubensworks commented Mar 12, 2018

rubensworks commented Mar 12, 2018

RubenVerborgh commented Mar 12, 2018 • edited

joachimvh commented Mar 12, 2018

RubenVerborgh commented Mar 12, 2018 • edited

rubensworks commented Mar 12, 2018

joachimvh commented Mar 13, 2018

RubenVerborgh commented Mar 13, 2018

mielvds commented Mar 13, 2018

rubensworks commented Mar 13, 2018 • edited

Constrainable Actions

Pros

Cons

Partial Constrainable Actions

Pros

Cons

Constraints Field

Pros

Cons

Conclusion

rubensworks commented Mar 14, 2018

joachimvh commented Mar 14, 2018

rubensworks commented Mar 14, 2018

joachimvh commented Mar 15, 2018

rubensworks commented Mar 16, 2018

mielvds commented Mar 27, 2018

joachimvh commented Mar 27, 2018

rubensworks commented Jun 13, 2018

mielvds commented Jul 11, 2018

rubensworks commented Jul 11, 2018

RubenVerborgh commented Mar 12, 2018 •

edited

RubenVerborgh commented Mar 12, 2018 •

edited

rubensworks commented Mar 13, 2018 •

edited