Add random sharding for the DynamoDb outbox by jtsalva · Pull Request #2813 · BrighterCommand/Brighter

jtsalva · 2023-09-07T16:07:11Z

Addresses #2810

Add configuration for number of shards (default 3, max 20)
Add optional configuration for TTL (default null - so forever)
Applies random sharding when writing items into the dynamo outbox for the 'Outstanding' index PK
Query all shards when getting all outstanding messages

CLAassistant · 2023-09-07T16:07:16Z

All committers have signed the CLA.

jtsalva · 2023-09-07T16:18:16Z

@iancooper I'm not sure how to trigger CI

jtsalva · 2023-09-07T16:21:11Z

+
+        private async Task<IEnumerable<MessageItem>> QueryAllOutstandingShardsAsync(string topic, DateTime minimumAge, CancellationToken cancellationToken = default)
+        {
+            using var semaphore = new SemaphoreSlim(20);


No reasoning behind this being 20, maybe should be configurable

What is the reason for the semaphore?

I now realise maybe doing a Parallel.ForEachAsync is more suitable? I was thinking we need to set MaxDegreeOfParallelism as max number of shards is unbounded

I wonder if we could sensibly limit the number of shards. At some point we only want so many threads, so many open sockets etc. If someone exceeds that number in order for their outbox to work, I wonder if an outbox is the right solution to transactional messaging for them.

It is a good question though

i think we should set an upper limit when you configure the number of partitions. Let's say 20 for now and throw an error if you use more than that.

I wonder if you need more than 20 partitions whether you should first think about using DAX over increasing the number of partitions further.

That makes sense, will simplify to remove semaphores and just set max shards to 20. Can always be another PR if we find 20 isn't enough.

Speaking of DAX, I'm not sure how that would solve the hot partitioning issue, I think it would alleviate maybe some reads and reduce latency but underlying issue would still be there? (from someone who's only just read about DAX on the surface)

Looks like we have reached agreement to limit at config, which is more explicit.

In principle a write through-cache will limit a hotspot as our partition would tend to be held in memory not on disk so access would not hit an RCU or WCU limit. For "hot spots" DAX is often a good solution, particularly in this case where you write and then read shortly after what you just read, and can evict older elements (which would be dispatched messages for us). I would guess that a write through cache would resolve this issue. So if our partitioning strategy gets too many partitions and thus requires a lot of threads on your Sweeper process it may be more efficient to move to an in-memory cache.

Thanks for explaining Ian

jtsalva · 2023-09-07T16:28:14Z

+            _dynamoOverwriteTableConfig = new DynamoDBOperationConfig
+            {
+                OverrideTableName = _configuration.TableName,
+                ConsistentRead = true


Setting ConsistentRead = true as default for non-GSI lookups to mitigate against NullReferenceExceptions we've been seeing when calling PostAsync

iancooper · 2023-09-07T17:16:16Z

@iancooper I'm not sure how to trigger CI

GitHub feature, one of us has to approve you when you have not triggered one before. It's to stop you uploading something that will start bitcoin mining or the like

* Start to add random sharding for the DynamoDb outbox * Limit to 20 shards and remove semaphores --------- Co-authored-by: Ian Cooper <ian_hammond_cooper@yahoo.co.uk>

Start to add random sharding for the DynamoDb outbox

7ced4f2

jtsalva commented Sep 7, 2023

View reviewed changes

jtsalva and others added 2 commits September 8, 2023 13:21

Limit to 20 shards and remove semaphores

815c2e4

Merge branch 'master' into ShardDynamoDbOutbox

6b33195

jtsalva marked this pull request as ready for review September 8, 2023 12:23

jtsalva requested review from DevJonny, holytshirt and preardon as code owners September 8, 2023 12:23

jtsalva requested a review from iancooper September 8, 2023 12:24

iancooper added 2 - In Progress grabbed by community feature request labels Sep 11, 2023

iancooper added 3 - Done and removed 2 - In Progress labels Sep 25, 2023

iancooper self-assigned this Sep 25, 2023

Merge branch 'master' into ShardDynamoDbOutbox

4f5f185

iancooper merged commit bb9623b into BrighterCommand:master Sep 25, 2023

jtsalva deleted the ShardDynamoDbOutbox branch September 26, 2023 08:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add random sharding for the DynamoDb outbox#2813

Add random sharding for the DynamoDb outbox#2813
iancooper merged 4 commits into
BrighterCommand:masterfrom
jtsalva:ShardDynamoDbOutbox

jtsalva commented Sep 7, 2023 •

edited

Loading

Uh oh!

CLAassistant commented Sep 7, 2023 •

edited

Loading

Uh oh!

jtsalva commented Sep 7, 2023

Uh oh!

jtsalva Sep 7, 2023

Uh oh!

iancooper Sep 7, 2023

Uh oh!

jtsalva Sep 7, 2023

Uh oh!

iancooper Sep 7, 2023

Uh oh!

iancooper Sep 7, 2023

Uh oh!

jtsalva Sep 8, 2023

Uh oh!

iancooper Sep 10, 2023

Uh oh!

jtsalva Sep 11, 2023

Uh oh!

jtsalva Sep 7, 2023

Uh oh!

iancooper commented Sep 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

jtsalva commented Sep 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Sep 7, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jtsalva commented Sep 7, 2023

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

iancooper commented Sep 7, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

jtsalva commented Sep 7, 2023 •

edited

Loading

CLAassistant commented Sep 7, 2023 •

edited

Loading