Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix(ingestion/clickhouse) move to two tier sqlalchemy #8300

Conversation

alplatonov
Copy link
Contributor

Checklist

There 2 big improvements:

  1. ClickHouseConfig inherits TwoTierSQLAlchemyConfig, ClickHouseSource inherit TwoTierSQLAlchemySource
  2. Added uri_opt params, old uri_opts cannot be changed or extended via config.

@github-actions github-actions bot added docs Issues and Improvements to docs ingestion PR or Issue related to the ingestion of metadata labels Jun 26, 2023
Copy link
Collaborator

@mayurinehate mayurinehate left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Could you please also update integration tests / golden files with respect to the changes ?

@anshbansal anshbansal added the community-contribution PR or Issue raised by member(s) of DataHub Community label Jun 28, 2023
…qlalchemy

# Conflicts:
#	docs/how/updating-datahub.md
@maggiehays maggiehays added the pending-submitter-response Issue/request has been reviewed but requires a response from the submitter label Jul 17, 2023
docs/how/updating-datahub.md Outdated Show resolved Hide resolved


@freeze_time(FROZEN_TIME)
@pytest.mark.integration
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank for adding the test. This is not the test I was looking for. I was looking for unit test similar to test_clickhouse_uri_native_secure in test_clickhouse_source.py for clickhouse secure and protocol config that demonstrate backward compatibility if these configs are used.

The current test looks good too, however, I believe the golden files are exactly same as existing clickhouse_mces_golden.json, except the run_id. I'd suggest using the same golden files in check_golden_file by using same run_id in new sqlalchemy uri recipe, rather than adding a duplicate 99% same golden file.

Copy link
Collaborator

@mayurinehate mayurinehate left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm.

metadata-ingestion/tests/unit/test_clickhouse_source.py Outdated Show resolved Hide resolved
metadata-ingestion/tests/unit/test_clickhouse_source.py Outdated Show resolved Hide resolved
alplatonov and others added 2 commits August 11, 2023 09:40
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
@asikowitz
Copy link
Collaborator

Merging as should not be related to smoke test failure

@asikowitz asikowitz merged commit 11fdfcf into datahub-project:master Aug 11, 2023
43 of 44 checks passed
yoonhyejin pushed a commit that referenced this pull request Aug 24, 2023
Co-authored-by: Mayuri Nehate <33225191+mayurinehate@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
community-contribution PR or Issue raised by member(s) of DataHub Community docs Issues and Improvements to docs ingestion PR or Issue related to the ingestion of metadata pending-submitter-response Issue/request has been reviewed but requires a response from the submitter
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

5 participants