[BEAM-6604] Check for null values when encoding Key columns. by nielm · Pull Request #7747 · apache/beam

nielm · 2019-02-06T09:39:40Z

When encoding Key Column values, if the column value is unspecified, assume that the value is null.

This corrects an NPE when encoding keys.
@chamikaramj

Post-Commit Tests Status (on master branch)

Lang	Apex	Dataflow	Flink	Gearpump	Samza	Spark
Go	---	---	---	---	---	---
Java
Python	---			---	---	---

chamikaramj · 2019-02-06T10:15:04Z

...ogle-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationKeyEncoder.java

    for (SpannerSchema.KeyPart part : schema.getKeyParts(m.getTable())) {
      Value val = mutationMap.get(part.getField());
-      if (val.isNull()) {
+      if (val == null || val.isNull()) {


So basically we consider all fields that are omitted as NULL ? Isn't it better to ask users to explicitly set these values to NULL ? I'm worried if this will result in NULL values being inserted due to bugs in user code.

Another option might be to make this more explicit, for example, by adding a transform builder method withNullAsDefault().

Also, looks like Spanner supports NOT NULL columns which can result in a confusing behavior in-combination with a default non-explicit NULL from Beam.

WDYT ?

Consider the behavior of a normal insert statement that omits a column value:

CREATE TABLE test ( key1 STRING(MAX), key2 STRING(max)) PRIMARY KEY (key1, key2) INSERT INTO test (key1) VALUES ("a")

This will result in a row with values ("a", <NULL>)
So assuming a missed value == NULL is the same behavior as the database.

Note also that this code is only concerned with Primary Key columns, and the encoded value is only used for sorting mutations in a bundle by table and primary key -- the value encoded here is never written to the database.

NOT NULL constraints can be set on any column. If the Mutation is an insert and does not specify a column which is NOT NULL (or sets it to NULL), then the insert will fail, and the Mutation appended to a PCollection of failed MutationGroups which is output by the SpannerIO.Write transform.

Ok. Thanks for clarifying.

When encoding Key Column values, if the column value is unspecified, assume that the value is null.

chamikaramj · 2019-02-06T17:59:51Z

Run Beam PostCommit

chamikaramj · 2019-02-06T17:59:58Z

Run Java PostCommit

chamikaramj · 2019-02-06T18:00:25Z

LGTM. Will merge after post-commit tests pass.

chamikaramj reviewed Feb 6, 2019

View reviewed changes

Check for null values when encoding Key columns.

59a71fd

When encoding Key Column values, if the column value is unspecified, assume that the value is null.

nielm force-pushed the nielmbeam branch from 258e6db to 59a71fd Compare February 6, 2019 12:56

chamikaramj merged commit 1512551 into apache:master Feb 8, 2019

nielm changed the title ~~[BEAM-4359] Check for null values when encoding Key columns.~~ [BEAM-6604] Check for null values when encoding Key columns. Feb 19, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BEAM-6604] Check for null values when encoding Key columns.#7747

[BEAM-6604] Check for null values when encoding Key columns.#7747
chamikaramj merged 1 commit intoapache:masterfrom
nielm:nielmbeam

nielm commented Feb 6, 2019

Uh oh!

chamikaramj Feb 6, 2019

Uh oh!

nielm Feb 6, 2019 •

edited

Loading

Uh oh!

chamikaramj Feb 6, 2019

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

nielm commented Feb 6, 2019

Post-Commit Tests Status (on master branch)

Uh oh!

chamikaramj Feb 6, 2019

Choose a reason for hiding this comment

Uh oh!

nielm Feb 6, 2019 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

chamikaramj Feb 6, 2019

Choose a reason for hiding this comment

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

chamikaramj commented Feb 6, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

nielm Feb 6, 2019 •

edited

Loading