Prefer DEFAULT to NULL when inserting into, actually validate append #607

benc-db · 2024-03-06T17:44:02Z

Description

A github user pointed out that technically using NULL could lead to different behavior than *, where DEFAULT would be honored for missing columns, so I have adjusted insert into to use DEFAULT instead. Also, noticed that at some point I lost my config that made the append tests actually test append, so added that back. The second half of the 'append columns' test for the append strategy validates that DEFAULT is used and does not break.

Checklist

I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have updated the CHANGELOG.md and added information about my change to the "dbt-databricks next" section.

nimeshpatni · 2024-03-21T06:43:18Z

dbt/include/databricks/macros/materializations/incremental/strategies.sql

@@ -60,7 +60,7 @@
      {%- if dest_col in source_columns -%}
        {%- do common_columns.append(dest_col) -%}
      {%- else -%}
-        {%- do common_columns.append('NULL') -%}
+        {%- do common_columns.append('DEFAULT') -%}


Hi,
This seems to be a breaking change, "NULL" (String value) or "DEFAULT" (String value) won't solve the purpose. For all the unmatched columns, we will end up having multiple "DEFAULT" (String) values as columns that can raise conflict/ambiguous column error.

Default value works when we don't pass the column in the argument of insert statement. Could you please try this simple resolution:

{%- set common_columns_csv = set(source_columns).intersection(dest_columns) | sort | join(', ') -%}

They're not inserted as strings, but as literals.

Example from test run:

insert into table `peco`.`test17110372962014884605_test_incremental_on_schema_change`.`incremental_append_new_columns_remove_one` (`id`, `field1`, `field2`, `field3`, `field4`) select `id`, `field1`, DEFAULT, `field3`, `field4` from `incremental_append_new_columns_remove_one__dbt_tmp`

prefer DEFAULT to NULL, and fix test to actually validate append

8027efc

benc-db requested review from andrefurlan-db and rcypher-databricks as code owners March 6, 2024 17:44

update unit test

55edd3a

benc-db had a problem deploying to azure-prod March 6, 2024 17:53 — with GitHub Actions Error

changelog

bee2619

benc-db merged commit 3f2e84b into main Mar 6, 2024
18 checks passed

nimeshpatni reviewed Mar 21, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prefer DEFAULT to NULL when inserting into, actually validate append #607

Prefer DEFAULT to NULL when inserting into, actually validate append #607

benc-db commented Mar 6, 2024 •

edited

nimeshpatni Mar 21, 2024

benc-db Mar 21, 2024

benc-db Mar 21, 2024

Prefer DEFAULT to NULL when inserting into, actually validate append #607

Prefer DEFAULT to NULL when inserting into, actually validate append #607

Conversation

benc-db commented Mar 6, 2024 • edited

Description

Checklist

nimeshpatni Mar 21, 2024

Choose a reason for hiding this comment

benc-db Mar 21, 2024

Choose a reason for hiding this comment

benc-db Mar 21, 2024

Choose a reason for hiding this comment

benc-db commented Mar 6, 2024 •

edited