-
Notifications
You must be signed in to change notification settings - Fork 2.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
fix(ingest): respect max_threads for ingestion reporter #8521
Conversation
@@ -42,7 +42,9 @@ def create( | |||
elif config_dict is None: | |||
raise ConfigurationError("Missing provider configuration.") | |||
else: | |||
provider_config = DatahubIngestionStateProviderConfig.parse_obj(config_dict) | |||
provider_config = ( | |||
DatahubIngestionStateProviderConfig.parse_obj_allow_extras(config_dict) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@hsheth2, Thank you for the fix! I'm wondering since max_thread
is also used in DatahubRestSinkConfig
, which I think the one that handles the rest sink, should parse_obj_allow_extras
also be called when creating DatahubRestSinkConfig
?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That one actually should be fine as-is, since that class has a max threads option
The main fix here is the removal of that pydantic field removed validator, which was deleting the user provided max threads config before it got parsed
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you!
Merging through unrelated CI failures. |
Slack ref: https://datahubspace.slack.com/archives/CUMUWQU66/p1690491969370859
Checklist