Transaction Pool Worker #3626

faustbrian · 2020-03-30T02:47:52Z

An initial explanation of the issue and idea based @supaiku0. This most likely will need refinement as implementation is ongoing.

A few months @supaiku0 worked on a proof of concept:
https://github.com/ArkEcosystem/core/tree/wip/core-transaction-pool/worker

It is nowhere near production-ready, but in principle working. Currently, the /transactions POST endpoint can be easily abused to cause high load on nodes, because it validates all transactions on the main thread, which causes CPU spikes and this can even be caused by broadcasting invalid transactions targeted at specific nodes since the signature check is pretty heavy. This is the reason why it is advised for node operators to secure their core-api access or completely disable it if used in front of a forger.

The p2p endpoint already received a workaround, by giving the main thread room to breath: https://github.com/ArkEcosystem/core/pull/2848/files

So the problem only manifests when using the core-api endpoint. However, ideally, this workaround is replaced with a more generic solution which also affects core-api. This is where the pool worker comes in.

The flow right now is:
POST /transactions -> create new Processor instance -> validate -> addTransactionsToPool
-> return response (accepted, ignored, excess, error)

With the pool worker it would change to something like this:
POST /transactions -> enqueue transactions which creates a job -> return response (ticketId, e.g a sequentially increasing number)

ticketId represents a job, that is either in the queue (about to be sent to worker), being processed (somewhere in worker) or done (returned from worker).

End users/clients can query the status by hitting an endpoint of a peer they broadcasted to using the ticket id which would return a response resembling what they get currently if they use the endpoint.

^ This is a significant change API wise and breaks all kind of client software so that's why the worker has been postponed to 3.0

Queued jobs are then pushed to the worker which does all the heavy lifting and once done reports back to the main thread, which will add valid transactions to the transaction pool.

A nice benefit of this approach is that it also makes the frequency in which transactions are rebroadcasted by a node to other peers more deterministic. Right now they rebroadcast whenever they are done validating the current batch of transactions (i.e. 1 request -> 1 broadcast), while a worker could report his finished jobs like only every 100ms, then a behaving node would at most rebroadcast 10 times per second. which greatly reduces the snowball effect that can currently be observed when the network is flooded with many transactions. Also, the rate-limit on the endpoint can then be properly calibrated.

The text was updated successfully, but these errors were encountered:

faustbrian added the Type: Feature label Mar 30, 2020

faustbrian added this to the 3.0.0 milestone Mar 30, 2020

faustbrian added Type: Refactor labels Mar 30, 2020

faustbrian assigned rainydio Apr 1, 2020

ghost mentioned this issue Apr 5, 2020

[Weekly Digest] Mar 30, 2020 - Apr 5, 2020 #3641

Closed

rainydio mentioned this issue May 11, 2020

feat(core-transaction-pool): implement workers #3693

Merged

2 tasks

faustbrian closed this as completed Jun 12, 2020

ghost mentioned this issue Jun 14, 2020

[Weekly Digest] Jun 8, 2020 - Jun 14, 2020 #3799

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transaction Pool Worker #3626

Transaction Pool Worker #3626

faustbrian commented Mar 30, 2020 •

edited

Loading

Transaction Pool Worker #3626

Transaction Pool Worker #3626

Comments

faustbrian commented Mar 30, 2020 • edited Loading

faustbrian commented Mar 30, 2020 •

edited

Loading