New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Guidelines for ISS protein binding #1876
Comments
Sometimes we might need to infer a binding relationship to fill a gap in a model? @vanaukenk ? |
@ValWood how do you capture the protein you inferred from in species A as well as the inferred binding partner in species B (where the annotation object is from species B)? I.e. |
Good point. we haven't ever actually done it so I don't have an example, but we would probably need to to fill annotation gaps .... |
Thanks, Pascale |
The way I had been doing this was to annotate the mouse protein in the model with ISS and the with field filled in for the human protein. Then I would make a separate model for the human protein and the experimental evidence. |
But the mouse-human interaction is not 'normal', so it seems inconsistent with that basic principle of GO annotations ? Pascale |
We divide the capture process in two. First, we annotate a multispecies reaction perhaps with multispecies physical entities in it to capture the actual experimental data. That reaction is labeled "chimeric". (We asked NCBI taxonomy for a species instance "chimeric" and they refused.) That reaction is also accompanied by citations to the relevant primary literature. Then, we annotate the inferred human reaction, populated entirely by human entities and tag it as inferred_from (manually, by curator) the chimeric one. If the second reaction has any literature references, they should be ones in which experts spell out their reasons for believing that the chimeric (or other non-human model system) accurately reflects the behavior of the human one. Maybe an analogy in GO would be likewise to create two models, one that corresponds exactly to the evidence, one that represents the curator's inference to the species of interest. Easy to say; probably a lot of work to implement. |
My example has evidence for one species (e.g. human-human) that is being inferred to another species (e.g. mouse-mouse). So BOTH GPs that are binding are being inferred. |
We have had, for some time now, an annotation rule that disallows ISS to 'protein binding' for exactly the reason that what could be captured in the With/From field was limited: https://github.com/geneontology/go-site/blob/master/metadata/rules/gorule-0000005.md http://wiki.geneontology.org/index.php/Inferred_from_Sequence_or_structural_Similarity_(ISS) I think this is another good argument for capturing the inputs of protein binding with an annotation extension of has_input. That way, the 'input' would be species-specific and physiologically relevant while the With/From value would, as always, reflect the source of the sequence-based similarity inference. |
Yes FYI If I do an ISS for: Ruth |
Why would someone want to infer a human-mouse (or yeast-pombe etc) binding when the experimental data is species-specific for one species already? I would only use mixed-species evidence if that's what the authors used in the lab and then I see the point at inferring to the 2 species-specific interactions by ISS. I.e.: You'd want to get to GP-1(human) | protein binding | IPI | with GP-2(mouse) Or what did I miss conceptually? |
Hi Birgit I'm afraid you do not appreciate that the 'with' field is used for multiple purposes. So we would not create |
Ok, and AE has_input GP-x is the actual binding partner in the same species? But who's actually using this data and how? Unless you go through the pain of parsing out the annotated GP and the GP in the AE if ECO is ISS all you get is "protein binding" - and I'd challenge anyone to find a protein that will never ever bind another protein ;-) Just playing devil's advocate here as so far I've been more of a user of GO annotations than a hard-core contributor. |
yes you have got it. this is why the GOC has said don't create ISS protein binding annotations. But if it was possible to provide information about the predicted binding partner in the AE field and this data could be extracted by uses then this might be useful |
Summary from call on 23/4/18: Options:
Next step: Discuss on next Noctua call (today, Wed, 25/4/18?) as they have more use cases and this is not complex-annotation-specific. |
@vanaukenk & @pgaudet : were we meant to create some guidelines and stick them on the Wiki? Did the Noctua group ever discuss this? |
Thanks. Let me know if and where/when I can help. Memory is fading ;-( |
Added Cambridge GOC meeting as milestone. Further discussions in geneontology/go-ontology#16833 |
Presented at Cambridge GOC mtg on 11/4/19: Agreed to no longer use pre-composed "x complex binding" terms but with/from field (and AE where required).
Closing as new ticket will capture remaining tasks. |
ISS to 'protein binding' should NOT be allowed, since we cannot capture the 'with' (only the 'from').
There are 91 existing annotations:
https://www.ebi.ac.uk/QuickGO/annotations?goUsage=exact&goUsageRelationships=is_a,part_of,occurs_in&goId=GO:0005515&evidenceCode=ECO:0000250&evidenceCodeUsage=descendants
Similarly, ISS to 'protein-containing complex binding' should NOT be allowed for the same reason.
@vanaukenk
@bmeldal
@ValWood
@RLovering
Does that seem OK to you ?
Thanks, Pascale
The text was updated successfully, but these errors were encountered: