-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Check the output in light of Reece's comments #1
Comments
Hi Jules- The basic structure of Model.kt is understandable and looks right to me. So far, so good I think. You have a comment "How's this supposed to work?". I think you're referring to the issue that in order to generate a Computed Identifier for a Location, one needs the CI for the referenced sequence. And, since VMC doesn't specify sequences, where do those come from? Right? To start the answer, see this notebook. You'll see this: ir = models.Identifier(namespace="NCBI", accession="NC_000019.10") Okay, so what should get_vmc_sequence_id() do? The naive implementation would be to fetch the sequence, run it through the truncated digest, and construct an Identifier. Obviously, that's going to stink for large sequences and/or a lot of variants. But, that's the base implementation. An improvement on that would be to keep a 1:1 map of VMC ids and external identifiers. That is, cache the lookup. (Reversible is nice so that variants can emitted with familiar sequence ids.) I use SeqRepo, a package I wrote to efficiently maintain and update a superset of essentially all sequences. (You might be interested in the design doc.) I have ambitions for a REST interface, but, alas, it's just a dream. Does that answer your question? And, since you're actually banging your head on this, please give feedback... especially if you see something that's ill-conceived. |
Thanks for the feedback, especially on the sequence id part. There are a few issues I though of/encountered.
|
Adding to the above, have you tried doing a round-trip from VCF->VMC->VCF on a genome? |
See/follow ga4gh/vrs#4 for vcf->vmc. I hadn't thought about vmc->vcf, but that's an interesting idea. |
ga4gh/vrs#11 (comment)
The text was updated successfully, but these errors were encountered: