Relation name '*__dbt_tmp' is longer than 63 characters #2869

moltar · 2020-11-06T05:24:29Z

Describe the bug

My view name is pretty long, but it is less than 63 characters.

When dbt adds the __dbt_tmp suffix, it goes over the limit of 63 chars.

The following error is thrown by PostgreSQL:

Relation name 'foo__dbt_tmp' is longer than 63 characters

Steps To Reproduce

Use a view name that is 63 characters long.
dbt run

Expected behavior

To work anyways.

Screenshots and log output

See above.

System information

Which database are you using dbt with?

The output of dbt --version:

Using Docker image fishtownanalytics/dbt:0.18.1.

The operating system you're using:

Using Docker image fishtownanalytics/dbt:0.18.1.

The output of python --version:

Using Docker image fishtownanalytics/dbt:0.18.1.

Additional context

Perhaps the temporary name can be truncated by the length of the suffix to make sure it fits within the limits?

The text was updated successfully, but these errors were encountered:

jtcohen6 · 2020-11-08T19:08:02Z

Thanks for opening, @moltar! I agree, this is peskier than it could be.

Following the discussion about this over in #2850, the real max character length of a model name on Postgres is currently 51, because of the appended suffix __dbt_backup in the table and view materializations.

I think we could do as you recommend, and truncate the model name (if >51 characters) to accommodate the suffix. I view that as a reasonable next step on top of the work in #2850, which handles the truncation of uniquely suffixed identifiers in the incremental materialization.

moltar · 2020-11-08T23:34:29Z

Thank you.

I see this tagged as redshift/pg.

I'd like to note that Redshift names seem to allow names of 127 bytes long.

https://docs.aws.amazon.com/redshift/latest/dg/r_CREATE_TABLE_NEW.html

ghost · 2020-12-07T14:30:01Z

With snapshots the suffix is even longer, e.g. dbt_tmp20201206072656412865. If you add _snapshot to the relation name, not much characters are left to use for the relation name.
Please also consider snapshots when addressing this issue. Thanks:)

jtcohen6 · 2020-12-07T14:34:06Z

Heard @Marinto. Like the incremental materialization, the snapshot materialization already uses make_temp_relation, which was addressed in #2850. The question in this issue is how to best cross-apply a similar improvement to the view + table materializations, which do not use make_temp_relation.

ghost · 2020-12-07T20:23:33Z

@jtcohen6 ah I see, I wasn't aware of #2850. This is exactly what I was looking for 👍

danielefrigo · 2021-06-28T15:52:36Z

Thanks for opening, @moltar! I agree, this is peskier than it could be.

Following the discussion about this over in #2850, the real max character length of a model name on Postgres is currently 51, because of the appended suffix __dbt_backup in the table and view materializations.

I think we could do as you recommend, and truncate the model name (if >51 characters) to accommodate the suffix. I view that as a reasonable next step on top of the work in #2850, which handles the truncation of uniquely suffixed identifiers in the incremental materialization.

Regarding the proposal of truncating the model name, I see a potential overlapping issue: you could have 2 models running in parallel, with a long common prefix in the name. If we truncate the model name, the tmp tables created by dbt could end up having exactly the same name.
An alternative could be to completely loose the name link between the model name and the temp table name, using some kind of unique short identifier (e.g. an hash of the model name).

jtcohen6 · 2021-06-28T23:38:29Z

@danielefrigo A hash of the model name makes a lot of sense to me! I'd still like to keep the conventional suffixes (__dbt_backup, __dbt_tmp) as well, for clarity of ownership.

epapineau · 2022-02-28T22:50:42Z

I'm interested in tackling this as a first issue. After digging into the code some, it looks like it could potentially be handled in a couple of places:

plugins/postgres/dbt/adapters/postgres/relation.py
Add truncating before known suffixes in the PostgresRelation __post_init__ method
core/dbt/include/global_project/macros/materializations/models/view/view.sql
Conditional truncating if using the postgres adapter

Do either of these seem like an appropriate approach?

jtcohen6 · 2022-03-02T17:58:19Z

@epapineau I'd be excited to have you contribute this one!

I think having the truncation/hashing logic live inside the Postgres adapter makes the most sense. Of the two approaches you've recommended, I'd hesitate to add it to the PostgresRelation class itself, since that's used all over, even with the check for known suffixes.

My instinct would be to follow the same approach we took with make_temp_relation:

The global macro default just adds the __dbt_tmp suffix, very simple
The Postgres version includes all hashing + truncation logic

So I think the move is:

Create a make_backup_relation macro that follows the same pattern
Update the global view + table materializations to use make_temp_relation + make_backup_relation, instead of hard-coding the suffix logic themselves

moltar added bug Something isn't working triage labels Nov 6, 2020

jtcohen6 added good_first_issue Straightforward + self-contained changes, good for new contributors! redshift and removed triage labels Nov 8, 2020

samwedge mentioned this issue Nov 10, 2020

Include a unique identifier as part of the temporary relation name #2881

Closed

ChristopheDuong mentioned this issue Jan 21, 2021

Postgres hits the max table name length airbytehq/airbyte#1750

Closed

jtcohen6 added postgres and removed redshift labels Sep 28, 2021

jtcohen6 added the Team:Adapters Issues designated for the adapter area of the code label Mar 2, 2022

epapineau mentioned this issue Mar 22, 2022

Truncate relation names when appending a suffix #4921

Merged

4 tasks

jtcohen6 closed this as completed in #4921 May 19, 2022

jtcohen6 added this to the v1.2 milestone May 31, 2022

This was referenced Apr 4, 2023

dbt-oracle 1.3.1 problem "oracle adapter: Oracle error: ORA-00972: identifier is too long" on dbt snapshot oracle/dbt-oracle#62

Closed

[Bug] "Oracle error: ORA-00972: identifier is too long" with view materialization oracle/dbt-oracle#82

Closed

MattTriano mentioned this issue May 6, 2023

Remove the report schema and its models MattTriano/analytics_data_where_house#122

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Relation name '*__dbt_tmp' is longer than 63 characters #2869

Relation name '*__dbt_tmp' is longer than 63 characters #2869

moltar commented Nov 6, 2020

jtcohen6 commented Nov 8, 2020

moltar commented Nov 8, 2020

ghost commented Dec 7, 2020

jtcohen6 commented Dec 7, 2020

ghost commented Dec 7, 2020

danielefrigo commented Jun 28, 2021

jtcohen6 commented Jun 28, 2021

epapineau commented Feb 28, 2022

jtcohen6 commented Mar 2, 2022

Relation name '*__dbt_tmp' is longer than 63 characters #2869

Relation name '*__dbt_tmp' is longer than 63 characters #2869

Comments

moltar commented Nov 6, 2020

Describe the bug

Steps To Reproduce

Expected behavior

Screenshots and log output

System information

Additional context

jtcohen6 commented Nov 8, 2020

moltar commented Nov 8, 2020

ghost commented Dec 7, 2020

jtcohen6 commented Dec 7, 2020

ghost commented Dec 7, 2020

danielefrigo commented Jun 28, 2021

jtcohen6 commented Jun 28, 2021

epapineau commented Feb 28, 2022

jtcohen6 commented Mar 2, 2022