Initial commit of DynamoDB batch writer #118

jamesls · 2015-06-01T22:22:47Z

This is similar to what boto2 does in terms of
the interface with a few internal details changed:

Keep a single buffer of puts and deletes. This simplifies
the logic of when to send requests. It also sends the
requests in the order they were called. In boto2, puts
were always sent before deletes(). This shouldn't affect
the semantics though because you can't put/delete the same
object in a batch request.
Immediately handle unprocessed items in the next batch. boto2
would keep these and flush them only at exit. This meant
you could have unbounded growth of unprocessed items.

Perf is about the same as boto2.

This is similar to what boto2 does in terms of the interface with a few internal details changed: * Keep a single buffer of puts and deletes. This simplifies the logic of when to send requests. It also sends the requests in the order they were called. In boto2, puts were always sent before deletes(). This shouldn't affect the semantics though because you can't put/delete the same object in a batch request. * Immediately handle unprocessed items in the next batch. boto2 would keep these and flush them only at __exit__. This meant you could have unbounded growth of unprocessed items. Perf is about the same as boto2.

coveralls · 2015-06-01T22:24:24Z

Coverage decreased (-0.0%) to 97.66% when pulling 7f518e7 on jamesls:ddb-batch-write into 95c377b on boto:develop.

kyleknap · 2015-06-02T17:04:22Z

boto3/dynamodb/table.py

+            RequestItems={self._table_name: self._items_buffer})
+        unprocessed_items = response['UnprocessedItems']
+
+        if unprocessed_items and unprocessed_items[self._table_name]:


Is it guaranteed that the table_name is in the unprocessed_items? I would be wary of accessing it directly. Probably would use a get().

Yes. From the API docs, the unprocessed items is a map of table name to unprocessed items. Given we control the request and can guarantee that we're only ever adding items from a single table, then we know that if the unprocessed items is not empty, it has to come from the table we specified in the originating request.

kyleknap · 2015-06-02T17:29:58Z

Looks good. Code is pretty clean. I had a couple comments. Otherwise, 🚢

coveralls · 2015-06-02T20:03:36Z

Coverage decreased (-0.01%) to 97.66% when pulling 6535a93 on jamesls:ddb-batch-write into 95c377b on boto:develop.

Reduce log messages in batch to a single message

f9aadd9

kyleknap reviewed Jun 2, 2015
View reviewed changes

jamesls added 2 commits June 2, 2015 11:36

Rename unit test file to follow naming conventions

b4d3587

Fix log message

6535a93

jamesls force-pushed the ddb-batch-write branch from 92fb597 to 6535a93 Compare June 2, 2015 19:59

jamesls merged commit 6535a93 into boto:develop Jun 2, 2015

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Initial commit of DynamoDB batch writer #118

Initial commit of DynamoDB batch writer #118

jamesls commented Jun 1, 2015

coveralls commented Jun 1, 2015

kyleknap Jun 2, 2015

jamesls Jun 2, 2015

kyleknap commented Jun 2, 2015

coveralls commented Jun 2, 2015

Initial commit of DynamoDB batch writer #118

Initial commit of DynamoDB batch writer #118

Conversation

jamesls commented Jun 1, 2015

coveralls commented Jun 1, 2015

kyleknap Jun 2, 2015

Choose a reason for hiding this comment

jamesls Jun 2, 2015

Choose a reason for hiding this comment

kyleknap commented Jun 2, 2015

coveralls commented Jun 2, 2015