New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Best way to represent the protein data #20
Comments
I would have thought Do you want a single data structure for all the points above? What do you mean by protein domain ID? Functional domains, or transcript exons start/end sites? |
regarding the protein domain ID: from Ensembl I get for each protein coding transcript its translation, which is in fact a protein sequence (AA) along with its ID (the Ensembl In a first version I will return protein results from the database as a |
Fantastic! Let me know when this becomes available and I will update |
Extracting the protein annotations in form of a
data.frame
andDataFrame
is straight forward, the question however is what type of object could best represent the protein annotation.The object should be something similar to a
GRanges
, eventually theProteins
class from thePbase
(https://github.com/ComputationalProteomicsUnit/Pbase) package?I've got:
@lgatto any suggestions/preferences here?
The text was updated successfully, but these errors were encountered: