Feat: Onboard Human Variant Annotation dataset #438

vijay-google · 2022-08-09T10:09:30Z

Description

Dataset: human_variant_annotation pipeline: clinvar
Dataset: human_variant_annotation pipeline: db_snp

Checklist

(Required) This pull request is appropriately labeled
Please merge this pull request after it's approved
I'm adding or editing a dataset
The Google Cloud Datasets team is aware of the proposed dataset
I put all my code inside datasets/human_variant_annotation and nothing outside of that directory

nlarge-google

Minor changes. Please make the changes and retest.

nlarge-google · 2022-08-10T15:24:33Z

datasets/human_variant_annotation/pipelines/_images/run_csv_transform_kub/csv_transform.py

+    gcs_bucket: str,
+    target_gcs_folder: str,
+    pipeline: str,
+):


Always incorporate a return type. For no return type use:

def fn(...) -> None:

nlarge-google · 2022-08-10T15:25:07Z

datasets/human_variant_annotation/pipelines/_images/run_csv_transform_kub/csv_transform.py

+    source_url = base_url + f"archive_{version}/{date_time.strftime('%Y')}/{file_name}"
+    source_file = f"./files/{folder}/{file_name}"
+    status_code = download_gzfile(source_url, source_file)
+    if status_code == 200:


nlarge-google · 2022-08-10T15:25:30Z

datasets/human_variant_annotation/pipelines/_images/run_csv_transform_kub/csv_transform.py

+    folder: pathlib.Path,
+    gcs_bucket: str,
+    target_gcs_folder: str,
+):


return type None

nlarge-google

Great Job. In future though, please reduce the number of blank lines between code blocks to 0. Thanks!

vijay-google added 5 commits August 8, 2022 18:43

pipeline.yaml files are ready for clinvar & db_snp pipelinnes

7a7128e

fix: yamllint errors

8909da6

feat: Onboard Human Variant Annotation dataset

a75a989

fix: function annotation for csv_transform.py

899fdcb

ready for PR

8db3422

vijay-google requested review from nlarge-google and adlersantos August 9, 2022 10:09

vijay-google self-assigned this Aug 9, 2022

vijay-google added the data onboarding Onboard a dataset or submit a pipeline label Aug 9, 2022

vijay-google changed the title ~~Human variant annotation~~ Feat: Onboard Human Variant Annotation dataset Aug 9, 2022

nlarge-google requested changes Aug 10, 2022

View reviewed changes

fix: function annotation

fb76413

nlarge-google approved these changes Aug 25, 2022

View reviewed changes

nlarge-google merged commit ebfe4de into GoogleCloudPlatform:main Aug 25, 2022

release-please bot mentioned this pull request Aug 25, 2022

chore(main): release 5.2.0 #435

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Onboard Human Variant Annotation dataset #438

Feat: Onboard Human Variant Annotation dataset #438

vijay-google commented Aug 9, 2022 •

edited

nlarge-google left a comment

nlarge-google Aug 10, 2022

vijay-google Aug 10, 2022

nlarge-google Aug 10, 2022

vijay-google Aug 10, 2022

nlarge-google Aug 10, 2022

vijay-google Aug 10, 2022

nlarge-google left a comment

Feat: Onboard Human Variant Annotation dataset #438

Feat: Onboard Human Variant Annotation dataset #438

Conversation

vijay-google commented Aug 9, 2022 • edited

Description

Checklist

nlarge-google left a comment

Choose a reason for hiding this comment

nlarge-google Aug 10, 2022

Choose a reason for hiding this comment

vijay-google Aug 10, 2022

Choose a reason for hiding this comment

nlarge-google Aug 10, 2022

Choose a reason for hiding this comment

vijay-google Aug 10, 2022

Choose a reason for hiding this comment

nlarge-google Aug 10, 2022

Choose a reason for hiding this comment

vijay-google Aug 10, 2022

Choose a reason for hiding this comment

nlarge-google left a comment

Choose a reason for hiding this comment

vijay-google commented Aug 9, 2022 •

edited