New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Subsetting the SharedRaw_Pool #17
Comments
I think you could do a |
Yep, this is the consequence of the copy-less semantic of
The sequences you "see" in your DNAStringSet object are just (internal) views on the pool of string data. And since the pool of string data is a list of external pointers to raw vectors, the entire thing gets serialized :-/ So before you serialize it, pass your object thru
@lawremi The problem with
Hope this helps. |
Thanks Herve, as(as.character(sub.X), class(X)) |
Parts of my workflow involve taking a subsequence and saving it to file in a RDS object. However, the way that
subseq
works on theSharedRaw_Pool
means that the entire sequence is still in the object:I understand the rationale for this - I do a very similar thing in InteractionSet - but is there a function to reconstruct the pool from only the parts of sequence that are in use? This will reduce the size of the objects being passed around, which would make life a lot easier for analyzing long-read sequencing data.
Session information
The text was updated successfully, but these errors were encountered: