Join GitHub today
GitHub is home to over 28 million developers working together to host and review code, manage projects, and build software together.Sign up
Explore if SeqDict data can be factored out more aggressively #983
Hoping to get feedback/discussion on below:
There remains a substantial chunk of repeating Contig/SequenceRecord metadata that we are carrying around within
The situation looks different though for SequenceDictionary and how Contig and mateContig continue to exist in their entirety in
, when these instead could be represented just as well in
I'm curious to know if there is a reason why we are not normalizing this Contig and mateContig data in the same way was we do for the
Also curious why the RecordGroupMetadata gets an entry in bdg.avdl
I would +1 that change. (And, that same change for all other records, e.g., Genotype, Variant, etc).
I don't remember entirely why I prioritized the
I'm still not entirely happy with the separate SequenceDictionary/RecordGroupDictionary stuff in our API; for some context expand the comments in closed PR #906.
A lot of our API is based on