implement type_boolean macro #5875

jpmmcneill · 2022-09-18T21:31:40Z

resolves (partially) #5739

Description

dbt-core component required for #5739.

Other PRs

dbt-redshift: add type boolean dbt-redshift#190
dbt-bigquery: add type boolean dbt-bigquery#313
dbt-snowflake: Jpmmcneill/snowflake type boolean dbt-snowflake#268
dbt-spark: Jpmmcneill/spark type boolean dbt-spark#471

Checklist

I have read the contributing guide and understand what's expected of me
I have signed the CLA
I have run this code in development and it appears to resolve the stated issue
This PR includes tests, or tests are not required/relevant for this PR
I have opened an issue to add/update docs, or docs changes are not required/relevant for this PR
- Update list of cross-database macros docs.getdbt.com#2049
I have run changie new to create a changelog entry

jpmmcneill · 2022-09-18T21:35:01Z

core/dbt/include/global_project/macros/utils/data_types.sql

@@ -59,7 +59,7 @@ The TIMESTAMP_* variation associated with TIMESTAMP is specified by the TIMESTAM
    {{ return(api.Column.translate_type("float")) }}
 {% endmacro %}

-{# numeric  ------------------------------------------------     #}
+{# numeric  -------------------------------------------------     #}


every other comment had this many dashes 🙃

dbeatty10 · 2022-09-19T14:08:45Z

@jpmmcneill this is looking good to me! 🤩

Here's the next steps for this PR:

Run changie new per these instructions and commit and push the resulting file.
Open up pull requests for dbt-redshift, dbt-snowflake, and dbt-spark that add your new test to their test suites.
- Unfortunately, new tests aren't inherited and executed automatically -- they need to be added explicitly.

I'll add some further instructions on your PR for dbt-bigquery on how to point at this PR branch for the purposes of automated CI tests in GitHub. (We'll want the adapters to be using your PR branch rather than the main branch.)

Once all of the PRs are passing the tests in GitHub Actions, we will merge dbt-core first. Then we will run CI once more for each adapter (pointing back at main) before merging them.

dbeatty10 · 2022-09-20T02:17:13Z

tests/adapter/dbt/tests/adapter/utils/data_types/test_type_boolean.py

+seeds__expected_csv = """boolean_col
+True
+""".lstrip()
+
+models__actual_sql = """
+select cast('True' as {{ type_boolean() }}) as boolean_col
+"""


Probably want to cover both boolean options at the very least.

Suggested change

seeds__expected_csv = """boolean_col

True

""".lstrip()

models__actual_sql = """

select cast('True' as {{ type_boolean() }}) as boolean_col

"""

seeds__expected_csv = """boolean_col

True

False

""".lstrip()

models__actual_sql = """

select cast('True' as {{ type_boolean() }}) as boolean_col

union all

select cast('False' as {{ type_boolean() }}) as boolean_col

"""

Even better

But test cases covering the truth tables for conjunction, disjunction, and negation would be even better. That way, we are verifying that everything is acting like booleans.

Something like the following untested code:

seeds__boolean_permutations_csv = """ x,y False,False True,False False,True True,True """.lstrip() seeds__expected_csv = """ x,y,conjunction,disjunction,negation_x False,False,False,False,True True,False,False,True,False False,True,False,True,True True,True,True,True,False """.lstrip() models__actual_sql = """ select x, y, x and y as conjunction, x or y as disjunction, not x as negation_x from {{ ref("boolean_permutations" }}

Best? 🤷

Even though BOOLEAN is a data type described in the SQL standard, some databases don't have it (looking at you, SQL Server!).

If we are feeling extra magnanimous, we could change all the True/False values in the seeds to be 1/0 instead.

I'm hoping it wouldn't be necessary, but we could update the models__actual_sql definition so that x/y values are replaced with the following instead:

cast(x as {{ type_boolean() }})

cast(y as {{ type_boolean() }})

Update:

The "Even better" example I gave above didn't actually test the type_boolean() macro at all! 😅

Something like this should fix that situation:

models__actual_sql = """ select cast(x as {{ type_boolean() }}) as x_bool, cast(y as {{ type_boolean() }}) as y_bool, cast(x as {{ type_boolean() }}) and cast(y as {{ type_boolean() }}) as conjunction, cast(x as {{ type_boolean() }}) or cast(y as {{ type_boolean() }}) as disjunction, not cast(x as {{ type_boolean() }}) as negation_x from {{ ref("boolean_permutations" }}

Cheers @dbeatty10. I'll take a second pass at this later today :)

dbeatty10

@jpmmcneill I just realized that your simple pytest for type_boolean() was exactly in line with the rest of the type_x macros.

So I'm approving as-is.

After this PR is merged, then the next steps will be to restore the original dev-requirements.txt files in each of your adapter PRs.

jpmmcneill · 2022-09-22T12:25:59Z

@jpmmcneill I just realized that your simple pytest for type_boolean() was exactly in line with the rest of the type_x macros.

So I'm approving as-is.

After this PR is merged, then the next steps will be to restore the original dev-requirements.txt files in each of your adapter PRs.

Hey @dbeatty10 - thanks. I personally completely agree with the sentiment around the current level of testing :).

Do you agree with me that an issue that basically scopes "improve the current test coverage for data types" would be welcome?

dbeatty10 · 2022-09-22T13:10:23Z

Do you agree with me that an issue that basically scopes "improve the current test coverage for data types" would be welcome?

Formally logging where expectations differs from reality is an important mechanism for us. A new issue describing the current level of testing and how that compares to what your expectations were as a contributor (here) and dbt package maintainer (here) would be great!

From there, someone from dbt Labs (maybe me!) will triage the issue submission and give feedback, determine urgency, compare to current roadmap and capacity, etc.

jpmmcneill · 2022-09-22T14:15:04Z

Brill, will do. Thanks @dbeatty10 🐐

implement type_boolean macro

5df41fd

jpmmcneill requested a review from a team as a code owner September 18, 2022 21:31

jpmmcneill requested a review from McKnight-42 September 18, 2022 21:31

cla-bot bot added the cla:yes label Sep 18, 2022

jpmmcneill mentioned this pull request Sep 18, 2022

add type boolean dbt-labs/dbt-bigquery#313

Merged

6 tasks

jpmmcneill commented Sep 18, 2022

View reviewed changes

jpmmcneill marked this pull request as draft September 19, 2022 09:08

jpmmcneill mentioned this pull request Sep 19, 2022

[CT-1110] [Feature] Cross-database macro for type_boolean() #5739

Closed

3 tasks

jtcohen6 added Team:Adapters Issues designated for the adapter area of the code ready_for_review Externally contributed PR has functional approval, ready for code review from Core engineering labels Sep 19, 2022

changie result

2bb86b5

jpmmcneill mentioned this pull request Sep 19, 2022

Jpmmcneill/snowflake type boolean dbt-labs/dbt-snowflake#268

Merged

6 tasks

jpmmcneill marked this pull request as ready for review September 19, 2022 23:09

jpmmcneill requested a review from a team as a code owner September 19, 2022 23:09

jpmmcneill requested a review from iknox-fa September 19, 2022 23:09

dbeatty10 reviewed Sep 20, 2022

View reviewed changes

dbeatty10 approved these changes Sep 21, 2022

View reviewed changes

dbeatty10 merged commit b089a47 into dbt-labs:main Sep 21, 2022

This was referenced Sep 21, 2022

Update list of cross-database macros dbt-labs/docs.getdbt.com#2049

Closed

add type boolean dbt-labs/dbt-redshift#190

Merged

Jpmmcneill/spark type boolean dbt-labs/dbt-spark#471

Merged

jpmmcneill deleted the jpmmcneill/type-boolean branch September 22, 2022 12:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

implement type_boolean macro #5875

implement type_boolean macro #5875

jpmmcneill commented Sep 18, 2022 •

edited

Loading

jpmmcneill Sep 18, 2022

dbeatty10 commented Sep 19, 2022

dbeatty10 Sep 20, 2022

dbeatty10 Sep 20, 2022 •

edited

Loading

jpmmcneill Sep 20, 2022

dbeatty10 left a comment

jpmmcneill commented Sep 22, 2022

dbeatty10 commented Sep 22, 2022

jpmmcneill commented Sep 22, 2022

implement type_boolean macro #5875

implement type_boolean macro #5875

Conversation

jpmmcneill commented Sep 18, 2022 • edited Loading

Description

Checklist

jpmmcneill Sep 18, 2022

Choose a reason for hiding this comment

dbeatty10 commented Sep 19, 2022

dbeatty10 Sep 20, 2022

Choose a reason for hiding this comment

Even better

Best? 🤷

dbeatty10 Sep 20, 2022 • edited Loading

Choose a reason for hiding this comment

jpmmcneill Sep 20, 2022

Choose a reason for hiding this comment

dbeatty10 left a comment

Choose a reason for hiding this comment

jpmmcneill commented Sep 22, 2022

dbeatty10 commented Sep 22, 2022

jpmmcneill commented Sep 22, 2022

jpmmcneill commented Sep 18, 2022 •

edited

Loading

dbeatty10 Sep 20, 2022 •

edited

Loading