-
Notifications
You must be signed in to change notification settings - Fork 3
Prod 411 hui adobe parser error databricks on azure #11
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Prod 411 hui adobe parser error databricks on azure #11
Conversation
|
Kudos, SonarCloud Quality Gate passed!
|
| # Sometimes an SQL event will be missing. To be informative, both | ||
| # events must be present. But this information is not critical, so | ||
| # if either event is missing then simply reject the SQL data | ||
| if "start_time" not in sql.keys() or "end_time" not in sql.keys(): |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Not critical, but just to understand:
Could you do just if "end_time" not in sql.keys(). This is on the assumption that if an end_time exists, then a start_time should as well.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This won't work because "start_time" and "end_time" come from separate events in the Spark eventlog. So it's possible for one, or other other, or both to be missing.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Ah, I see.
NKSync
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me. Nice 👍
rmoneys
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great!








Sometimes SQL Submission or Completion events are missing which throws an error. However, these events are not critical for the predictor run. This solution bypasses the problem by neglecting the SQL data for any queries for which either the Submission or Completion events is missing.
Also added a test for this case which indicates the effect on the sqlData dataframe.
Also modified version.