-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(sdk/dataflow): deprecate cluster and use env and platform_instance instead #8201
fix(sdk/dataflow): deprecate cluster and use env and platform_instance instead #8201
Conversation
@@ -35,11 +52,24 @@ class DataFlow: | |||
url: Optional[str] = None | |||
tags: Set[str] = field(default_factory=set) | |||
owners: Set[str] = field(default_factory=set) | |||
platform_instance: Optional[str] = None | |||
env: Optional[str] = None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Previously we used cluster
as the env value - why do we have both cluster and env now? I don't think we need this new instance variable
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Plus this is a breaking change and not deprecation. This needs to be documented in the PR doc. Plus we should not be changing the default as part of deprecation as that will cause existing deployments to have problems due to possible change of URNs.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This isn't a breaking change yet, just a deprecation of cluster
in favor of env
+ addition of platform_instance
However, we still need to make some changes:
- the deprecation should still be noted in the PR description + our updating datahub doc
- we should be printing a deprecation warning if folks use the old
cluster
param - we should throw an error if both env and cluster are provided
- all users of this class within datahub should be updated to use
env
- I believe airflow is one, but there might be more
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
@@ -35,11 +52,24 @@ class DataFlow: | |||
url: Optional[str] = None | |||
tags: Set[str] = field(default_factory=set) | |||
owners: Set[str] = field(default_factory=set) | |||
platform_instance: Optional[str] = None | |||
env: Optional[str] = None |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This isn't a breaking change yet, just a deprecation of cluster
in favor of env
+ addition of platform_instance
However, we still need to make some changes:
- the deprecation should still be noted in the PR description + our updating datahub doc
- we should be printing a deprecation warning if folks use the old
cluster
param - we should throw an error if both env and cluster are provided
- all users of this class within datahub should be updated to use
env
- I believe airflow is one, but there might be more
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
metadata-ingestion/src/datahub/api/entities/datajob/dataflow.py
Outdated
Show resolved
Hide resolved
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Deployment failed with the following error:
|
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
…e instead (datahub-project#8201) Co-authored-by: mohdsiddique <mohdsiddiquebagwan@gmail.com> Co-authored-by: Harshal Sheth <hsheth2@gmail.com>
Deprecation
In Dataflow, cluster argument is deprecated. Please use env instead.
Checklist