-
Notifications
You must be signed in to change notification settings - Fork 2.8k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inconsistent datasets name case in UI #9276
Comments
@remisalmon I have a hypothesis for this, related to #9227 If you re-run the snowflake/postgres ingestion, does this problem appear to go away? |
Hi @hsheth2 thanks for taking a look! Yes this does go away when I re-run the ingestion (even with stateful ingestion enabled): This is with:
|
Hi! We have the same issue! DataHub CLI version: 0.12.0.3 But when we re run the ingestión the issue is not solved, any suggestion? Thanks! |
This issue is stale because it has been open for 30 days with no activity. If you believe this is still an issue on the latest DataHub release please leave a comment with the version that you tested it with. If this is a question/discussion please head to https://slack.datahubproject.io. For feature requests please use https://feature-requests.datahubproject.io |
We are also seeing the same issue on Snowflake, which doesn't disappear when rerunning the ingestion. Datahub version: 0.12.1.3 |
@JoakimNil that's surprising! Can we get some more details about your setup - are you using quickstart or a helm deployment? Have you tweaked any settings e.g. GMS replicas, async ingest, standalone mae/mce consumers? Edit: also, if you could use a file sink and send us the resulting JSON (can be shared in a private DM in slack if needed). I mainly want to look at the |
@hsheth2 We're using quickstart, with all default settings. The only thing that's changed is adding the Snowflake source, with this config: source:
type: snowflake
config:
account_id: [REMOVED]
warehouse: COMPUTE_WH
username: datahub
password: '${SNOWFLAKE_DATAHUB_PASSWORD}'
incremental_lineage: true
profiling:
enabled: false
stateful_ingestion:
enabled: true I will send you the resulting JSON in a DM in slack. |
Describe the bug
The DataHub UI shows a (random?) mix of uppercase and lowercase dataset names for the same database/schema.
See the attached screenshot where Snowflake has only 1 SNOWPIPE.PUBLIC database/schema but DataHub shows 2 of those. The tables in this schema are split between those two.
To Reproduce
Steps to reproduce the behavior:
convert_urns_to_lowercase: true
)Expected behavior
The DataHub UI should not split datasets between uppercase and lowercase in the UI if their Snowflake identifiers are not explicitely uppercase or lowercase.
Snowflake query:
returns
(21 = 13+8 in the screenshot...)
Screenshots
Desktop (please complete the following information):
The text was updated successfully, but these errors were encountered: