Runtime worker threads #1459

gavofyork · 2019-01-16T14:50:01Z

The runtime is currently single threaded, which means all transactions/extrinsics must be executed in series. This is rather suboptimal. Instead, worker "threads" should be allowed in the runtime environment.

Since this is a consensus computation and the need for determinism is absolute, there should be a number of restrictions on worker threads:

They should be pure functions. This implies that as long as there is at least one worker thread, storage may not be altered (though it may be read). Furthermore, there should be no other kinds of data-sharing between threads, even within Sync safeguards like Mutex or Atomic. Separate memory slabs should be used if needed.
Worker threads may only be created from the main thread and have a specific identifier that is deterministic based upon the order in which they were created by the main thread.

If, while there is at least one worker thread, storage is written to, then there are two options that we should do one of:

panic; or
have set_storage buffer the alteration in a (thread-local) hash map, which ultimately takes effect once all worker threads are finished and in the order that worker threads were created (and thus deterministic).

Once this exists, a new issue should be created to introduce extrinsic parallelism whereby the block author prefixes the extrinsic set with an extrinsic to specify a dependency graph stating which extrinsics are mutually independent and can be executed in parallel. The runtime executive can then use this information together with a number of worker threads in order to safely parallelise extrinsic queue execution.

The text was updated successfully, but these errors were encountered:

Demi-Marie · 2019-04-23T19:03:39Z

Can we simply not expose the functions that write storage to the worker, and require that the worker have a separate wasm blob a la WebWorkers? If we did that, attempts to write to storage from a worker would be caught at load-time.

kianenigma · 2019-12-09T01:56:38Z

gavofyork assigned kianenigma 11 days ago

Sweet; as soon as I am back (Jan): finish offcahin phragmen and get started with this. Looking forward 👍

kianenigma · 2019-12-16T11:45:15Z

Some more additional notes from the latest company retreat, in which we briefly discussed this:

A serialization approach should be adopted from the academia or other resources, or developed by us, to determine which transactions within a block can be executed with no write conflicts. This will most likely need to be a function of the write-list of all transactions. I think this is a good start: https://link.springer.com/chapter/10.1007/978-3-319-96893-3_32

The memory model gurantee that we can promise in the image above is same as: https://www.cs.utexas.edu/~bornholt/post/memory-models.html. B and C can only make assumptions that they are writing on a state on top of A, but not on which will be executed first. Note that in the image, B and C are both marked in group 2, meaning that they are supposedly non-conflicting.

A validator is not to be trusted with serialization that they claim to be non-conflicting. Everyone should be able to verify this.

burdges · 2019-12-16T12:43:04Z

I wrote these notes from our discussion at the coding retreat:

We discussed 2.5 possible memory models for parallel transaction evaluation at Gavin's session on parallelization today.

Gavin's original suggestion:

All transaction execution threads have their own state write cache,
All parallel transaction execution threads read state freely from a (more) global state,
We rejoin the threads together by merely ensuring no conflicts occur among their their state write caches. Any conflicts are an invalid block.

We believe this approach corresponds to one of the weaker eventual consistency memory models used in multi-threaded execution. https://www.cs.utexas.edu/~bornholt/post/memory-models.html

In particular, you could easily make smart contracts whose sequential semantics becomes insecure under this memory model.

Jeff proposed we maintain full sequential consistency:

All transaction execution threads have both their own state write cache along with a state read subcache, but cannot access the global state without updating their read cache.
Any writes then replace entries from the read cache, so maybe you'd implement the read cache with flags on the write cache.
We rejoin finished threads together first merge all the read caches, second merge the write caches testing for disjointness among themselves, and third test for disjointness between the merged read and write caches. Any non-disjointness results in an invalid block, but we trash the read cache for a valid block.

We think this should be equivalent to the more "rust-like" model in which the spawn thread commands reserve portions of the state tree as mutable or immutable. At first blush, I'd think this reservation model runs faster but increases block size unacceptably.

We should support both these memory models in the code, and make smart contract platforms like ink use the full sequential consistency model, but attempt to prove much of the relay chain itself can run in the weaker model.

Set code conflicts with everything of course.
Alistair thinks ICMP works fine in the weaker model, but maybe ICMP writeup should address this.
Individual SPREE modules might or might not work in the weaker model, but we could likely show some simple one for DOT accounting worked in the weaker model.
We might address other relay chain transaction types as part of some future discussion about modularization of the relay chain.

After we have some designs like ICMP designed for the weaker model then we should use good auditors, like maybe Ralf Jung from the RustBelt project.

Demi-Marie · 2019-12-28T02:21:26Z

@burdges what does ICMP stand for? I only know of Internet Control Message Protocol, but that makes no sense here.

kianenigma · 2019-12-28T02:37:33Z

@burdges what does ICMP stand for? I only know of Internet Control Message Protocol, but that makes no sense here.

inter chain message passing, afaik

burdges · 2019-12-28T10:08:11Z

We've renamed it to XCMP for precisely that reason

kianenigma · 2020-03-01T15:57:33Z

have set_storage buffer the alteration in a (thread-local) hash map, which ultimately takes effect once all worker threads are finished and in the order that worker threads were created (and thus deterministic).

As discussed this changes the memory model; And more importantly, wouldn't it be a huge problem for Kusama? previous blocks, when executed with sequential consistency might lead to different outcomes once executed with a runtime workers.

bkchr · 2020-03-01T16:40:00Z

How is set_storage related to the memory model?

How should it affect kusama? Even if this would be work out of the box (without changing the runtime, what I don't expect), why should the storage root be different? If the storage root would be different, it would mean that there were multiple transactions modifying the same memory position. This would be a violation anyway.

I have some further questions:
Will you start with block production? Block production will inherently always be slower than import. So, optimizing import isn't probably the first priority?

kianenigma · 2020-03-01T16:58:52Z

How is set_storage related to the memory model?

If you buffer you basically go from Sequential (SC) to TSO model. After a write, your thread will have access to an updated version whilst others will only ever see the initial state of the block, regardless of any schedule. The link above is a good read about it.

How should it affect kusama? Even if this would be work out of the box (without changing the runtime, what I don't expect), why should the storage root be different? If the storage root would be different, it would mean that there were multiple transactions modifying the same memory position.

You can get a different root even if there is just a read-write conflict.

In a previous block, key k is initially 0 and k' is 2:
t1 writes 1 to k.
t2 reads from k and writes it to k'.
in that particular block, the block author chose to do t1 -> t2. k' is 1

if you re-execute the same block with a runtime worker, k' is 0 since the writes are buffered and then applied in the order of runtime threads.

This would be a violation anyway.

What do we consider a violation currently in our pure sequential model? I infer from your comment that two writes to the same key within a block is a violation? I don't see why it should be, given that everyone does the same thing deterministically.

Will you start with block production? Block production will inherently always be slower than import. So, optimizing import isn't probably the first priority?

Yeah indeed first block production. Import would come next and would be mostly about the follow up of this issue:

_ Once this exists, a new issue should be created to introduce extrinsic parallelism whereby the block author prefixes the extrinsic set with an extrinsic to specify a dependency graph stating which extrinsics are mutually independent and can be executed in parallel. The runtime executive can then use this information together with a number of worker threads in order to safely parallelise extrinsic queue execution._

Once an author can distribute a schedule, the role of the import would be simply to use it and import in parallel. That's in the future though.

kianenigma · 2020-03-01T17:06:36Z

Another fishy example that came to my mind: If we are to allow multiple transactions from the same origin in the same block, then the fee updates need to be precisely NOT buffered, otherwise one who submits n transactions within one block pays only for max(fee(n)) of those n.

Maybe we should also provide some memory fence to ensure such operations are actually visible to all other threads before we can move on.

burdges · 2020-03-10T11:38:48Z

Agreed, we cannot process multiple transactions spending from the same source in separate threads. All memory models discussed above should handle that particular case, but these memory models differ when some read the account and only one spends from the account. In the short term, we might simply forbid block from containing multiple transactions that require signatures by the same account.

NikVolf · 2020-03-16T11:03:26Z

I'll add sort of fork implementation prototype soonish we can iterate on :)

bkchr · 2020-03-16T11:50:33Z

How is set_storage related to the memory model?

If you buffer you basically go from Sequential (SC) to TSO model. After a write, your thread will have access to an updated version whilst others will only ever see the initial state of the block, regardless of any schedule. The link above is a good read about it.

How should it affect kusama? Even if this would be work out of the box (without changing the runtime, what I don't expect), why should the storage root be different? If the storage root would be different, it would mean that there were multiple transactions modifying the same memory position.

You can get a different root even if there is just a read-write conflict.

In a previous block, key k is initially 0 and k' is 2:
t1 writes 1 to k.
t2 reads from k and writes it to k'.
in that particular block, the block author chose to do t1 -> t2. k' is 1

if you re-execute the same block with a runtime worker, k' is 0 since the writes are buffered and
then applied in the order of runtime threads.

I don't understand this example :D

This would be a violation anyway.

What do we consider a violation currently in our pure sequential model? I infer from your comment that two writes to the same key within a block is a violation? I don't see why it should be, given that everyone does the same thing deterministically.

Okay, if we expect pure writes. Like someone for whatever reasons writes to a certain location some value that has no other connection, your view is correct. However, I think that this doesn't exist in a real world scenario. Let us for example just take the current weight of the block, that is a field that is read and written by a transaction. We need to synchronize these reads and writes, as otherwise the value will just be garbage.

Or think of a transaction where people want to put some money in a pot. Two different people want to do this in one block. Both transactions would need to happen in the same thread, as otherwise you would have a violation.

Will you start with block production? Block production will inherently always be slower than import. So, optimizing import isn't probably the first priority?

Yeah indeed first block production. Import would come next and would be mostly about the follow up of this issue:

I thought about this again and think that maybe doing import is easier and not that hard. You probably just need some sort of heuristic to distribute the tx between threads and start applying them. The good, if the state root is different you can just do single threaded import. For block production you probably not have enough time to switch again to the single threaded strategy.

Demi-Marie · 2020-03-24T23:55:40Z

Agreed, we cannot process multiple transactions spending from the same source in separate threads. All memory models discussed above should handle that particular case, but these memory models differ when some read the account and only one spends from the account. In the short term, we might simply forbid block from containing multiple transactions that require signatures by the same account.

Speaking as a potential user of a blockchain, I don’t consider that acceptable. The problem is that I might submit a transaction, and then decide I want to submit yet another transaction. Artificially delaying the second one doesn’t seem like a good idea.

Demi-Marie · 2020-03-25T00:06:31Z

Also, sequential consistency is not enough. What we need is determinism.

The best solution I can come up with is software transactional memory, or STM. If we have a conflict, one of the transactions is rolled back and retried later. Obviously, the order in which transactions appear to have been executed needs to be in the block.

kianenigma · 2020-06-05T14:34:11Z

Updating that due to the workload from the staking side and my sightly limited availability int he next few months, I see it very unlikely that I can work on this.

Remove DSN online status and increase get-piece backoff time.

gavofyork added this to the 1.x series milestone Jan 16, 2019

gavofyork added J0-enhancement An additional feature request. Z5-epic Can only be fixed by John Skeet. labels Jan 16, 2019

gavofyork closed this as completed Nov 28, 2019

gavofyork reopened this Nov 28, 2019

gavofyork assigned kianenigma Nov 28, 2019

gavofyork modified the milestones: 2.x series, As-and-when Nov 28, 2019

tomusdrw mentioned this issue Mar 10, 2020

Async/Await support for Off-chain Workers #5212

Open

kianenigma assigned tomusdrw Mar 11, 2020

kianenigma mentioned this issue Mar 13, 2020

Task executor extension for runtime host #5249

Merged

1 task

NikVolf self-assigned this Mar 16, 2020

burdges mentioned this issue Jun 16, 2020

PVF determinism paritytech/polkadot-sdk#990

Open

NikVolf mentioned this issue Sep 11, 2020

Runtime worker threads #7089

Merged

2 tasks

NikVolf modified the milestones: Ideas, 2.x series Sep 11, 2020

ghost closed this as completed in #7089 Oct 20, 2020

burdges mentioned this issue May 9, 2021

Bring back signature batch verification paritytech/polkadot-sdk#351

Open

pepyakin mentioned this issue Nov 10, 2022

Remove sp_tasks::spawn API and related code + host functions #12639

Merged

liuchengxu pushed a commit to liuchengxu/substrate that referenced this issue May 31, 2023

Merge pull request paritytech#1459 from subspace/increase-backoff-time

da96b48

Remove DSN online status and increase get-piece backoff time.

Overkillus mentioned this issue Sep 6, 2023

High Level PVF Nondeterminism paritytech/polkadot-sdk#1434

Open

This issue was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Runtime worker threads #1459

Runtime worker threads #1459

gavofyork commented Jan 16, 2019 •

edited

Loading

Demi-Marie commented Apr 23, 2019

kianenigma commented Dec 9, 2019

kianenigma commented Dec 16, 2019

burdges commented Dec 16, 2019 •

edited

Loading

Demi-Marie commented Dec 28, 2019

kianenigma commented Dec 28, 2019 •

edited

Loading

burdges commented Dec 28, 2019

kianenigma commented Mar 1, 2020

bkchr commented Mar 1, 2020

kianenigma commented Mar 1, 2020 •

edited

Loading

kianenigma commented Mar 1, 2020

burdges commented Mar 10, 2020

NikVolf commented Mar 16, 2020

bkchr commented Mar 16, 2020

Demi-Marie commented Mar 24, 2020

Demi-Marie commented Mar 25, 2020

kianenigma commented Jun 5, 2020

Runtime worker threads #1459

Runtime worker threads #1459

Comments

gavofyork commented Jan 16, 2019 • edited Loading

Demi-Marie commented Apr 23, 2019

kianenigma commented Dec 9, 2019

kianenigma commented Dec 16, 2019

burdges commented Dec 16, 2019 • edited Loading

Demi-Marie commented Dec 28, 2019

kianenigma commented Dec 28, 2019 • edited Loading

burdges commented Dec 28, 2019

kianenigma commented Mar 1, 2020

bkchr commented Mar 1, 2020

kianenigma commented Mar 1, 2020 • edited Loading

kianenigma commented Mar 1, 2020

burdges commented Mar 10, 2020

NikVolf commented Mar 16, 2020

bkchr commented Mar 16, 2020

Demi-Marie commented Mar 24, 2020

Demi-Marie commented Mar 25, 2020

kianenigma commented Jun 5, 2020

gavofyork commented Jan 16, 2019 •

edited

Loading

burdges commented Dec 16, 2019 •

edited

Loading

kianenigma commented Dec 28, 2019 •

edited

Loading

kianenigma commented Mar 1, 2020 •

edited

Loading