Decide on terminology #44

fintelia · 2017-11-11T19:15:39Z

As we move to distributed fault tolerant Soup, the terms we're using have become increasingly ambiguous and overloaded. I don't love all these names, but we should have terminology for all of the following concepts:

domain: the collection of all shards and replicas that do computation for the same nodes in the graph.
replica set: the part of a domain that manages a single shard.
replica: single running instantiation of the nodes in a domain.

And for these higher level components...

controller: component that coordinates control plane operations. Runs in the same address space as one worker, but typically interacts with more over the network.
controller stub: component that participates in leader election. Becomes a controller if it wins.
worker: component manages data plane operations
worker thread pool: the collection of threads on a worker that do actual processing
worker thread: single thread in the worker thread pool.

We should also probably have distinct DomainIndex, ReplicaSetIndex, and ReplicaIndex types as well as WorkerIndexs to identify individual processes. Depending on how they're implemented, worker indexes may have to be randomly generated to avoid collisions.

The text was updated successfully, but these errors were encountered:

ms705 · 2017-11-11T23:08:25Z

Good idea! In this taxonomy, a worker thread processes exactly one replica at a time, before returning to the pool, correct?

I think the conventional solution for WorkerIndex-like structures is to use UUIDs/GUIDs (to be able to tell apart workers returning from transient failures and newly started ones).

jonhoo · 2017-11-12T17:38:38Z

replica: single running instantiation of the nodes in a domain.

I assume this should say "in one shard of a domain"?

worker: component manages data plane operations

"A process that executes one or more replicas"?

jonhoo · 2017-11-13T19:36:14Z

I kind of want to call worker thread something else, because the simplification "thread" has so many meanings. Also, I can't use mod thread and also use std::thread :( "Processor"? "Executor"? " Other names?

ms705 · 2017-11-13T19:41:04Z

"Processor" is a term already widely used in the data-flow literature for this purpose (albeit typically statically bound to a vertex). It doesn't seem too wild to say "a processor from the processor pool executes pending work at a replica".

jonhoo · 2017-11-13T20:03:41Z

Yeah.. The downside of using "processor" is that CPUs are processors. The alternative @fintelia and I just discussed was instead using "worker" here, and then using a different word for what we now call "worker". "Host" maybe?

ms705 · 2017-11-13T20:50:17Z

"Host" seems suboptimal since you might run >1 Soup daemon/process per host; "process" or "instance" is perhaps more fitting. Or we just go back to the Soup theme and call the daemon process a "bowl" ;-)

Probably none of this matters very much, terms can change ;-)

fintelia · 2017-11-15T00:11:15Z

I like "instance" since it doesn't have an existing well defined meaning and isn't tied to the Soup theme (and thus unsuitable for use in papers, etc)

jonhoo · 2017-11-15T00:26:16Z

So each instance has a worker pool of workers that execute the replicas assigned to that instance? And a replica is one copy (well, replica) of one shard of a domain?

jonhoo · 2017-11-29T18:32:34Z

a6a14cf uses the word "Souplet" for instance, just until we settle on this. Instance still feels too generic. The commit only somewhat commits (heh) to this, in the sense that it doesn't also use that name to refer to the WorkerPool that is spawned during local operation, but I think Souplet applies equally there.

ms705 closed this as completed Jun 29, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Decide on terminology #44

Decide on terminology #44

fintelia commented Nov 11, 2017

ms705 commented Nov 11, 2017 •

edited

jonhoo commented Nov 12, 2017

jonhoo commented Nov 13, 2017

ms705 commented Nov 13, 2017

jonhoo commented Nov 13, 2017

ms705 commented Nov 13, 2017

fintelia commented Nov 15, 2017 •

edited

jonhoo commented Nov 15, 2017

jonhoo commented Nov 29, 2017

Decide on terminology #44

Decide on terminology #44

Comments

fintelia commented Nov 11, 2017

ms705 commented Nov 11, 2017 • edited

jonhoo commented Nov 12, 2017

jonhoo commented Nov 13, 2017

ms705 commented Nov 13, 2017

jonhoo commented Nov 13, 2017

ms705 commented Nov 13, 2017

fintelia commented Nov 15, 2017 • edited

jonhoo commented Nov 15, 2017

jonhoo commented Nov 29, 2017

ms705 commented Nov 11, 2017 •

edited

fintelia commented Nov 15, 2017 •

edited