Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(siblingsHook): add logic to account for non dbt upstreams #9154

Merged
merged 5 commits into from Dec 4, 2023

Conversation

ethan-cartwright
Copy link
Contributor

@ethan-cartwright ethan-cartwright commented Oct 31, 2023

Description

Right now, when there is an upstream connection such as kafka -> snowflake , the dbtUpstream will be zero and we'll print out log.error("{} has an unexpected number of dbt upstreams: {}. Not adding any as siblings.", sourceUrn.toString(), dbtUpstreams.size());. This is spamming MAE logs.

The desired behavior is that the above error log only prints out if there are multiple dbt upstreams, as that's unexpected.

This PR makes it so that non-dbt upstream lineage is ignored in handleSourceDatasetEvent.

Checklist

  • The PR conforms to DataHub's Contributing Guideline (particularly Commit Message Format)
  • Links to related issues (if applicable)
  • Tests for the changes have been added/updated (if applicable)
  • Docs related to the changes have been added/updated (if applicable). If a new feature has been added a Usage Guide has been added for the same.
  • For any breaking change/potential downtime/deprecation/big changes an entry has been made in Updating DataHub

Copy link
Collaborator

@pedro93 pedro93 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm confused how this PR accounts for non DBT upstreams. Can you add some details in the PR description?

@ethan-cartwright
Copy link
Contributor Author

I'm confused how this PR accounts for non DBT upstreams. Can you add some details in the PR description?

Updated! Thanks

@pedro93
Copy link
Collaborator

pedro93 commented Nov 30, 2023

Thank you for clarifying

Copy link
Collaborator

@anshbansal anshbansal left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Lint needs to be fixed

@anshbansal anshbansal merged commit 7857944 into datahub-project:master Dec 4, 2023
32 of 34 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

3 participants