-
Notifications
You must be signed in to change notification settings - Fork 59
Adds an option to split annotation fields. #107
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
Pull Request Test Coverage Report for Build 294
💛 - Coveralls |
arostamianfar
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks, Bashir! I looked at the high level design and I a few concerns:
- We should not modify vcfio. We should keep vcfio as a very low level transform (think of it as just a text reader). Any higher level semantics should be done outside.
- I think we should decouple annotations from the main flow as much as possible so that it's easy to change it later on. Is it possible to add a PTransform for this?
- The design should be flexible enough to support multiple annotation fields.
Let's discuss this in person when you're here. I think we need a short design doc now that you have the prototype working.
| _JSON_CONCATENATION_OVERHEAD_BYTES = 5 | ||
|
|
||
|
|
||
| # TODO(bashir2): Using type identifiers like ``HeaderFields`` does not seem |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yeah, the type identifiers have been on my mind for some time. I inherited it from Beam conventions, but have not actually set up automated documentation generation to verify that it finds the correct class.
I think the 'correct' way is something like :class:`HeaderFields`. In any case, I filed Issue #108 for this. You can remove the TODO from here as it's awkward to single out this particular field.
|
To document follow-up discussion on the design decisions:
|
|
This was superseded by PR #109 . |
Please note that some more unit-tests need to be added and there are a few TODOs (some for follow-up PRs), but this is in a reasonable shape for the first pass review.
Tested:
Added some unit-tests and one integration test (valid_4_2_VEP.json).
More interesting query: https://bigquery.cloud.google.com/results/gcp-variant-transforms-test:bquijob_7d798216_1616efded18
Issue #81