CRDT-Batching support #1008

hsanjuan · 2020-02-20T16:24:55Z

Describe the feature you are proposing

go-ds-crdt supports batching. Multiple updates can be included in the same DAG node. With batching, we do not need to issue one block per update.

Cluster with heavy pinning/unpinning intake can take advantage of batching to greatly reduce (I think around ~20000 pins can go in a batch) the DAG size.

Batching can be two ways:

Time based: A batch will be published every X seconds, if any updates have been made. This allows to group all updates every 5 seconds in a single batch.
Size based: A batch will be published as soon as X updates have been made.

Both approaches can be combined (whatever condition is hit first issues the batch). i.e. A batch should be issued every 100 updates, but in should be sent in any case if 10 minutes have passed from the previous batch.

Note that in-principle batch support is in-memory. Uncommitted batches will be lost.

Batching should be configurable in the crdt config section:

"batching" : {
  "max_batch_size": number_of_updates,
  "max_batch_age": "duration"
}

0 values disable batching.

The text was updated successfully, but these errors were encountered:

RubenKelevra · 2020-02-25T04:04:51Z

Sounds nice, should reduce the amount of height I collect per day significantly ;)
INFO crdt: crdt Datastore created. Number of heads: 1. Current max-height: 10990 crdt.go:262

How about the ability to open a batch and either commit it or discard it?

This way we can commit all changes to an open batch, and when everything is done can commit it as an atomic operation to the cluster.

If it fails to get the cluster head, the commit operation should fail and the whole batch gets discarded by the cluster.

This helps if multiple servers should write to the same cluster since you can add a retry on the application level, just running the same operation again on the new head/height.

So we're basically talking about 3 different modes:

set number_of_updates/duration -> just commit from time to time to the cluster (might result in data loss on concurrent writes). A commit/sync operation would be a nice addition, so a not yet expired duration can be considered expired and the batch will immediately pushed to the cluster.
no batching at all -> commit every operation alone (one operation might fail on concurrent writes)
open batch/close batch -> a logical write operation with many changes coalesces into one big batch. The close batch will wait until the majority of the cluster has returned a status, thus deciding that that's the new head/height.

Edit: Rewrote the comment

hsanjuan · 2020-02-27T09:22:41Z

How about the ability to open a batch and either commit it or discard it?
...

Sorry, but this issue is very well scoped already. Batches are local. There is no synchronization between batches started in different peers. Concurrent write operations in a cluster don't fail by design.

The close batch will wait until the majority of the cluster has returned a status

You underestimate the pitfalls and effort needed to set up coordination like this on a distributed system. It is way more complicated that it seems.

The ability of manually committing an open local batch, while useful, would need to be extended to Raft too which has no batching support at all. Therefore out of scope for the moment.

RubenKelevra · 2020-02-27T10:03:41Z

How about the ability to open a batch and either commit it or discard it?
...

Sorry, but this issue is very well scoped already. Batches are local. There is no synchronization between batches started in different peers. Concurrent write operations in a cluster don't fail by design.

I think you misunderstood what I meant.

Your approach sounds like auto-commit of mysql. Every now and then changes are flushed to the cluster.

To cut my idea down, can we get non-automatic batching?

one command to open a new batch
one command to discard the current batch (like IPFS-Cluster crashed)
one command to close the batch and flush it to the cluster.

The close/flush command could get a similar --wait flag like the pin-operation already has. So it will return when the cluster has completely answered.

The idea with a reduced 'wait quorum' would need to get the current number of peers, like peers ls from the cluster and return as soon as half of the numbers of acknowledges came back instead of 'all', which might wait indefinitely when a cluster member goes offline.

If that's too much for one change I fully understand! :)

hsanjuan · 2020-03-09T14:51:25Z

can we get non-automatic batching

I'd say yes, after automatic batching support as scoped here exists (this is smaller in scope and does not require API changes).

I see it would be super useful to you, just trying to define well scoped work units. I will give some thought and open a follow up issue on non-automatic batching.

RubenKelevra · 2020-03-09T16:20:25Z

Sounds great!

This adds batching support to CRDT consensus. The crdt component can now take advantage of the BatchingState, which uses the batching-crdt datastore. In batching mode, the crdt datastore groups any Add and Delete operations in a single delta (instead of just 1, as it does by default). Batching is enabled in the crdt configuration section by setting MaxBatchSize **and** MaxBatchAge. These two settings control when a batch is committed, either by reaching a maximum number of pin/unpin operations, or by reaching a maximum age. Batching unlocks large pin-ingestion escalability for clusters, but should be set according to expected work loads. An additional, hidden MaxQueueSize parameter provides the ability to perform backpressure on Pin/Unpin requests. When more than MaxQueueSize pin/unpins are waiting to be included in a batch, the LogPin/LogUnpin operations will fail. If this happens, it is means cluster cannot commit batches as fast as pins are arriving. Thus, MaxQueueSize should be increase (to accomodate bursts), or the batch size increased (to perform less commits and hopefully handle the requests faster). Note that the underlying CRDT library will auto-commit when batch deltas reach 1MB of size.

This adds batching support to crdt-consensus per #1008 . The crdt component can now take advantage of the BatchingState, which uses the batching-crdt datastore. In batching mode, the crdt datastore groups any Add and Delete operations in a single delta (instead of just 1, as it does by default). Batching is enabled in the crdt configuration section by setting MaxBatchSize **and** MaxBatchAge. These two settings control when a batch is committed, either by reaching a maximum number of pin/unpin operations, or by reaching a maximum age. Batching unlocks large pin-ingestion scalability for clusters, but should be set according to expected work loads. An additional, hidden MaxQueueSize parameter provides the ability to perform backpressure on Pin/Unpin requests. When more than MaxQueueSize pin/unpins are waiting to be included in a batch, the LogPin/LogUnpin operations will fail. If this happens, it is means cluster cannot commit batches as fast as pins are arriving. Thus, MaxQueueSize should be increase (to accommodate bursts), or the batch size increased (to perform less commits and hopefully handle the requests faster). Note that the underlying CRDT library will auto-commit when batch deltas reach 1MB of size.

hsanjuan added kind/enhancement A net-new feature or improvement to an existing feature help wanted Seeking public contribution on this issue exp/intermediate Prior experience is likely helpful status/ready Ready to be worked labels Feb 20, 2020

hsanjuan mentioned this issue Mar 13, 2020

[WIP] Manual batching support #1018

Open

3 tasks

This was referenced Jan 21, 2021

No new pins RubenKelevra/pacman.store#39

Closed

[meta] high IO usage RubenKelevra/pacman.store#42

Closed

hsanjuan added a commit that referenced this issue Apr 28, 2021

Feat #1008: Add batching configuration options and parsing

75cf1b3

hsanjuan mentioned this issue Apr 28, 2021

crdt: Add batching support #1346

Merged

hsanjuan added this to the Release v0.13.3 milestone Apr 30, 2021

hsanjuan closed this as completed in #1346 Apr 30, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CRDT-Batching support #1008

CRDT-Batching support #1008

hsanjuan commented Feb 20, 2020

RubenKelevra commented Feb 25, 2020

hsanjuan commented Feb 27, 2020

RubenKelevra commented Feb 27, 2020

hsanjuan commented Mar 9, 2020

RubenKelevra commented Mar 9, 2020

CRDT-Batching support #1008

CRDT-Batching support #1008

Comments

hsanjuan commented Feb 20, 2020

RubenKelevra commented Feb 25, 2020

hsanjuan commented Feb 27, 2020

RubenKelevra commented Feb 27, 2020

hsanjuan commented Mar 9, 2020

RubenKelevra commented Mar 9, 2020