New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feat: Onboard Mimiciii dataset #449
Conversation
This PR includes 2 activities w.r.t. the dataset -
|
Use the BQ data transfer service. There are examples in the repo that you can just reuse: |
Also, please rename to mimic_iii |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please use BQ data transfer and rename to mimic_iii
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please remove extra lines
"Fetching the source tables from bq. Each pipeline will be undergoing ETL" | ||
) | ||
source_table_names = fetch_source_tables(source_project, source_dataset) | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please remove extra lines
Closing this pull request as there is a new one, and this one is obsolete. |
Description
This is to onboard mimiciii dataset with 25 pipelines using Airflow v2 operators only.
Checklist
Use the sections below based on what's applicable to your PR and delete the rest:
Feature
README
accordinglyData Onboarding
datasets/mimiciii
and nothing outside of that directoryCode cleanup or refactoring