[BEAM-3516] Spanner BatchFn does not respect mutation limits #4860

NathanHowell · 2018-03-13T16:17:34Z

Estimate the number of mutations in in a group by counting the affected columns and associated indexes.

The code currently assumes each row contains a single mutation and has flushes batches after a (hardcoded) 10k row threshold has been exeeded. Spanner rejects commits that exceed 20k mutations, including indexes. This disconnect between the estimated mutations and the actual count causes commit failures.

This change estimates the actual mutations by counting the number of indexes covering each column, and summing up the counts of columns and indexes contained within a MutationGroup. The group is flushed prior to the limit being exceeded.

NathanHowell · 2018-03-14T19:56:02Z

Hi @mairbek and @dhalperi, could you take a look at this change? It's a bit light on tests..

mairbek

Thank you, Nathan! This looks good to me, I've left some minor comments. @chamikaramj could you please also take a look?

mairbek · 2018-03-20T16:45:13Z

...e-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationCellEstimator.java

+ * Estimate the number of cells modified in a {@link MutationGroup}.
+ */
+final class MutationCellEstimator implements ToLongFunction<MutationGroup> {
+  private final LoadingCache<String, ImmutableMap<String, Long>> tables;


Let's move this to SpannerSchema.

mairbek · 2018-03-20T18:43:59Z

...e-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationCellEstimator.java

+        // ranges should already be broken up into individual batches
+        // but just in case, make a worst-case estimate about the size
+        // of the key range so they will get their own transaction
+        final long ranges = Iterables.size(keySet.getRanges());


We only batch single key deletes. I think we can return zero or -1 here and avoid passing maxNumMutations

mairbek · 2018-03-20T18:45:46Z

...oogle-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/ReadSpannerSchema.java

+            + "     FROM information_schema.columns as c"
+            + "     WHERE c.table_catalog = '' AND c.table_schema = '') AS c"
+            + "  LEFT OUTER JOIN ("
+            + "    SELECT t.table_name, t.column_name, COUNT(1) AS indices"


Could we use COUNT(*) here? I think it is more common.

Estimate the number of mutations in in a group by counting the columns and associated indexes

dhalperi · 2018-03-21T15:18:37Z

Given current job responsiibilities, will not be able to review in a reasonable amount of time -- sorry!

Reviewed 6 of 6 files at r1.
Review status: all files reviewed at latest revision, 3 unresolved discussions.

Comments from Reviewable

NathanHowell · 2018-03-23T20:21:05Z

@mairbek I've made the requested changes, can you take another look? (an unrelated test is failing atm)

mairbek · 2018-04-05T20:16:08Z

lgtm, @chamikaramj can you take a look?

mairbek · 2018-04-05T20:16:46Z

retest this please

mairbek · 2018-05-07T21:18:00Z

I'm looking to make it part of Beam 2.5.0

NathanHowell · 2018-05-07T21:51:19Z

Excellent. We're no longer using Spanner (AWS won the contract) so I can't do much testing... but I can probably get it rebased against master later this week.

jkff · 2018-05-07T21:55:07Z

This PR needs to be rebased. Please also take a look at precommit failures and fix them.

mairbek · 2018-05-07T22:20:22Z

@NathanHowell sad to hear that AWS won 🥇

I can fork and rebase the PR, 2.5.0 cut is planned for tomorrow.

NathanHowell · 2018-05-07T22:23:13Z

@mairbek I know... we're still using Beam though 👍 😃 thanks for picking it up!

NathanHowell · 2018-05-11T14:40:01Z

Fixed in #5297, thanks for the help! 👍

NathanHowell force-pushed the BEAM-3516 branch from 10f9485 to d61ef64 Compare March 14, 2018 01:59

mairbek reviewed Mar 20, 2018

View reviewed changes

Nathan Howell added 5 commits March 20, 2018 18:56

[BEAM-3516] Spanner BatchFn does not respect mutation limits

7812ac5

Estimate the number of mutations in in a group by counting the columns and associated indexes

Add license header to MutationCellEstimator

43c2a85

Revert back to Guava 20.0 for Hadoop compatibility

a372455

Use COUNT(*) instead of COUNT(1)

fc71a1d

Remove special handling for batching of range deletes

695d9c0

Precompute mutation counts for each column and table

ae0404e

NathanHowell force-pushed the BEAM-3516 branch from d61ef64 to ae0404e Compare March 22, 2018 21:42

Mark batch field as transient to satisfy FindBugs

acdf12c

mairbek mentioned this pull request May 7, 2018

[BEAM-3516] Spanner BatchFn does not respect mutation limits #5297

Merged

NathanHowell closed this May 11, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BEAM-3516] Spanner BatchFn does not respect mutation limits #4860

[BEAM-3516] Spanner BatchFn does not respect mutation limits #4860

NathanHowell commented Mar 13, 2018

NathanHowell commented Mar 14, 2018

mairbek left a comment

mairbek Mar 20, 2018

mairbek Mar 20, 2018

mairbek Mar 20, 2018

dhalperi commented Mar 21, 2018

NathanHowell commented Mar 23, 2018

mairbek commented Apr 5, 2018

mairbek commented Apr 5, 2018

mairbek commented May 7, 2018

NathanHowell commented May 7, 2018

jkff commented May 7, 2018

mairbek commented May 7, 2018

NathanHowell commented May 7, 2018

NathanHowell commented May 11, 2018

[BEAM-3516] Spanner BatchFn does not respect mutation limits #4860

[BEAM-3516] Spanner BatchFn does not respect mutation limits #4860

Conversation

NathanHowell commented Mar 13, 2018

NathanHowell commented Mar 14, 2018

mairbek left a comment

Choose a reason for hiding this comment

mairbek Mar 20, 2018

Choose a reason for hiding this comment

mairbek Mar 20, 2018

Choose a reason for hiding this comment

mairbek Mar 20, 2018

Choose a reason for hiding this comment

dhalperi commented Mar 21, 2018

NathanHowell commented Mar 23, 2018

mairbek commented Apr 5, 2018

mairbek commented Apr 5, 2018

mairbek commented May 7, 2018

NathanHowell commented May 7, 2018

jkff commented May 7, 2018

mairbek commented May 7, 2018

NathanHowell commented May 7, 2018

NathanHowell commented May 11, 2018