[CT-112] Better UX for macro dispatch #4646

jtcohen6 · 2022-01-31T10:33:47Z

Big idea

We're moving, slowly but surely, in the direction of sensible defaults for macro dispatch. Now that we've established conventions, let's find ways to make it easier/quicker to write.

Take dbt_utils.hash for example. This has to include a lot of supporting code to do something that's ultimately pretty simple:

{% macro hash(field) -%}
  {{ return(adapter.dispatch('hash', 'dbt_utils') (field)) }}
{%- endmacro %}

{% macro default__hash(field) -%}
    md5(cast({{field}} as {{dbt_utils.type_string()}}))
{%- endmacro %}

{% macro bigquery__hash(field) -%}
    to_hex({{dbt_utils.default__hash(field)}})
{%- endmacro %}

What are the problems here?

The first macro, which dispatches a macro by its own name, is redundant / boilerplate. Could we support dispatch for all macros, and make this the default behavior?
The need to specify the default__ adapter decorator is unintuitive for folks who are looking to take advantage of dispatch's package override capabilities, and who might be unfamiliar with its history as (primarily) a way to support cross-database macros

Imagine the same macro, like:

{% macro hash(field) -%}
    md5(cast({{field}} as {{dbt_utils.type_string()}}))
{%- endmacro %}

{% macro bigquery__hash(field) -%}
    to_hex({{dbt_utils.default__hash(field)}})
{%- endmacro %}

The implied (default) behavior is that a macro named hash, in the package named dbt_utils:

Is going to search for macros named hash in the dbt_utils namespace
Is itself the default__ implementation of hash

The prime beneficiaries of this change would be package maintainers (+ us, as global_project maintainers).

There's two additional considerations we need to make, for "special kinds of macros": materializations and generic tests.

Materializations

Adapter-specific materializations work differently from adapter-specific macros. Namely:

{% materialization table, adapter = 'snowflake' %} creates a macro named materialization_table_snowflake, rather than (as one would expect) snowflake__materialization_table
"Parent" adapter inheritance is not supported for materializations, as it is for all other macros. This causes confusion for external adapter maintainers: re-implement sqlserver test materialization microsoft/dbt-synapse#74 (comment), Cleanup unnecessary codes by the inheritance of dbt-spark any more. databricks/dbt-databricks#40 (comment)

Could we make materialization macro generation / discovery work more like dispatch? We'd need to offer backwards compatibility / avoiding breaking changes for folks who currently rely on the materialization_<name>_<adapter> construction.

One other thing: I actually like the syntax for defining adapter-specific materializations. (It's important to have the adapter__-prefixed version available, in order to call from other macros.) As part of that reconciliation, could we support this for other macros, too? Same example as above:

{% macro hash(field) -%}
    md5(cast({{field}} as {{dbt_utils.type_string()}}))
{%- endmacro %}

{% macro hash(field), adapter = 'bigquery' -%}
    to_hex({{dbt_utils.default__hash(field)}})
{%- endmacro %}

Better, no?

And then even:

macros:
    # this is what we'd actually want
  - name: hash 
    description: "Hash macro"

  - name: bigquery__hash   # don't love this
    description: "Special note about the BigQuery implementation"

  - name: hash
    adapter: bigquery   # maybe this?? do we need it?
    description: "Special note about the BigQuery implementation"

Generic tests

Tests aren't dispatch-able at all today, so it's necessary to do something like:

{% test at_least_one(model, column_name) %}
  {{ return(adapter.dispatch('test_at_least_one', 'dbt_utils')(model, column_name)) }}
{% endtest %}

{% macro default__test_at_least_one(model, column_name) %}

If we pursue either/both of the ideas presented above, I think we'd be in good shape for tests, too:

If we made the changes proposed above—dispatch was the implied/default behavior for all macros, and it was consistent for "special macros"—then creating a macro named bigquery__test_at_least_one should "just work"
Now imagine that, but with the nicer syntax of {% test at_least_one(model, column_name), adapter = 'bigquery' %}

The text was updated successfully, but these errors were encountered:

github-actions · 2022-07-31T02:14:40Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please remove the stale label or comment on the issue, or it will be closed in 7 days.

github-actions · 2022-08-08T02:14:02Z

Although we are closing this issue as stale, it's not gone forever. Issues can be reopened if there is renewed community interest; add a comment to notify the maintainers.

jtcohen6 · 2022-09-13T09:21:38Z

I'm going to un-stale this, I'd still like to do it someday :)

jeremyyeo · 2022-09-13T21:21:16Z

Following this thread - some global_project macros are not dispatched today¹ which could be confusing to some end users when they try and follow our docs on "Overriding global macros" (https://docs.getdbt.com/reference/dbt-jinja-functions/dispatch#overriding-global-macros).

Like should_store_failures() ↩

dataders · 2022-09-16T14:21:58Z

Like should_store_failures() ↩

You raise a great point. Virtually all of my PRs to dbt-core are to do exactly this. However, we don't normally see a need for this macro to be overridden, as there's not actually SQL code in there.

Can you explain the user's usecase for wanting to override it?
Does the user know they can just put a macro with the same name as the one they want to override within a .sql file in their macros/ dir?

jeremyyeo · 2022-09-20T08:43:46Z

TL;DR: All macros should be dispatch for the sake of consistency - I think this makes it "better UX".

@dataders - My 2c is that it's irrelevant what the use-case is for wanting to override a global_project macro (perhaps for argument sake we can say the end user wants to add some var('STORE_OR_NOT') or something to this here macro in order to control it's behaviour).

I shared this internally:

Just checking on the behaviour of dispatching in dbt-core. Going by docs - all (?) built-in macro behaviour should be overridable by importing them from packages yes? I did find some instances where this isn't true, for example should_store_failures() does not seem to have adapter.dispatch in core - this means that to import the behaviour, you need to add a dispatch macro into your main dbt project (or simply return shared_lib.should_store_failures() I suppose). For generate_x_name you don't need to do "extra work" in your main dbt project because dbt-core has those as dispatched.

Does the user know they can just put a macro with the same name as the one they want to override within a .sql file in their macros/ dir?

Yea 100%. In this scenario, a team leader wants to write a "shared_lib" with overriden "core macros" like generate_x_name(), ref(), should_store_failures(), etc so that all other projects follow some certain pattern. This shared_lib dbt package is imported into many other dbt projects via the packages.yml file.

Now as the end user of the "shared_lib" package, one has to remember that... "oh for should_store_failures, I need to remember to add this dispatch thing into my own dbt project but I don't need to do that for generate_x_name overrides" - which is probably unnecessary cognitive burden.

So at the end of it all, I think all core macros should have the same pattern wrt overriding them in root projects without needing to think of when you'd need to also add some dispatching code in your root project.

Although Jerco has mentioned "special kinds of macros" - so perhaps differentiating between macros that are dispatched and not dispatched by default is appropriate? I'm not sure what category to put this should_store_failures macro but just for simplicity I'm voting for making all macros the same way wrt overriding :P

dbt-labs/dbt-core#4646

github-actions · 2023-03-21T01:53:02Z

This issue has been marked as Stale because it has been open for 180 days with no activity. If you would like the issue to remain open, please comment on the issue or else it will be closed in 7 days.

github-actions · 2023-03-28T01:56:24Z

Although we are closing this issue as stale, it's not gone forever. Issues can be reopened if there is renewed community interest. Just add a comment to notify the maintainers.

jtcohen6 added enhancement New feature or request Team:Language Team:Adapters Issues designated for the adapter area of the code labels Jan 31, 2022

github-actions bot changed the title ~~Better UX for macro dispatch~~ [CT-112] Better UX for macro dispatch Jan 31, 2022

jtcohen6 mentioned this issue Apr 5, 2022

Make internal macros use macro dispatch to be overridable in child adapters dbt-labs/dbt-spark#320

Merged

4 tasks

volkangurel mentioned this issue Jun 8, 2022

Add inheritance to materialization macro resolution #5348

Merged

6 tasks

jtcohen6 mentioned this issue Jul 7, 2022

CT-808 grant adapter tests #5447

Merged

6 tasks

github-actions bot added the stale Issues that have gone stale label Jul 31, 2022

github-actions bot closed this as completed Aug 8, 2022

jtcohen6 reopened this Sep 13, 2022

jtcohen6 removed the stale Issues that have gone stale label Sep 13, 2022

sdebruyn added a commit to microsoft/dbt-synapse that referenced this issue Sep 24, 2022

remove extra test materialization

fe20c57

dbt-labs/dbt-core#4646

github-actions bot added the stale Issues that have gone stale label Mar 21, 2023

github-actions bot closed this as not planned Won't fix, can't repro, duplicate, stale Mar 28, 2023

jtcohen6 mentioned this issue Apr 2, 2023

New command: dbt clone #7258

Closed

9 tasks

stu-k mentioned this issue May 8, 2023

[CT-2543] Execution <> Adapters: Cloning from production #7549

Closed

jtcohen6 mentioned this issue Jun 6, 2023

[SPIKE] [CT-2650] Materialization macros should support dispatch #7799

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[CT-112] Better UX for macro dispatch #4646

[CT-112] Better UX for macro dispatch #4646

jtcohen6 commented Jan 31, 2022 •

edited

Loading

github-actions bot commented Jul 31, 2022

github-actions bot commented Aug 8, 2022

jtcohen6 commented Sep 13, 2022

jeremyyeo commented Sep 13, 2022

dataders commented Sep 16, 2022

jeremyyeo commented Sep 20, 2022

github-actions bot commented Mar 21, 2023

github-actions bot commented Mar 28, 2023

[CT-112] Better UX for macro dispatch #4646

[CT-112] Better UX for macro dispatch #4646

Comments

jtcohen6 commented Jan 31, 2022 • edited Loading

Big idea

Materializations

Generic tests

github-actions bot commented Jul 31, 2022

github-actions bot commented Aug 8, 2022

jtcohen6 commented Sep 13, 2022

jeremyyeo commented Sep 13, 2022

Footnotes

dataders commented Sep 16, 2022

jeremyyeo commented Sep 20, 2022

github-actions bot commented Mar 21, 2023

github-actions bot commented Mar 28, 2023

jtcohen6 commented Jan 31, 2022 •

edited

Loading