Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

evidence determination for gene-product's involvement with downstream processes #99

Open
ukemi opened this issue Mar 10, 2017 · 10 comments

Comments

@ukemi
Copy link

ukemi commented Mar 10, 2017

When we generate complex models, we want to be able to retrieve correct evidence to support inferences or assertions for processes that are far downstream of intitial annotons. This becomes problematic when the evidence for the association between a gene product's function and the evidence that links the two processes together is different. It is compounded when multiple annotons all feed into a complicated downstream pathway. We need to be able to specify which evidence statement goes with which annoton and we would like to be able to put multiple pieces of evidence on some statements (even inferred ones) if there is experimental evidence to support the direct inference between the action of a gene product and a downstream process.

Here are a couple example to look at for next Thursday's meeting:

simple http://noctua.berkeleybop.org/editor/graph/gomodel:586fc17a00000729
complex http://noctua.berkeleybop.org/editor/graph/gomodel:5745387b00001750

computational combinatorial evidence used in automatic assertion(ECO:0000246)?

@balhoff
Copy link
Member

balhoff commented Mar 10, 2017

For reference, one of the ideas we had to support this is in this Noctua request: geneontology/noctua#407

Alternatively, curators could simply add redundant direct relations to the model (which would have otherwise been inferred) and add evidence to them. Ideally a reasoner would inform the curator when any inconsistencies are introduced.

@ukemi
Copy link
Author

ukemi commented Mar 10, 2017 via email

@kltm
Copy link
Member

kltm commented Jun 14, 2017

So, post meeting, the idea now is to add an annotation to the evidence individual that is a virtual edge back to the desired GP individual, correct?

@ukemi
Copy link
Author

ukemi commented Jun 15, 2017

Yes. Although I think this is still a compromise in representation, it is the way that we will proceed based on curator input. So the idea is that we will allow an additional annotation to the evidence that will refer to a gene product (could also be a complex) that is used in the model.

@kltm
Copy link
Member

kltm commented Jun 15, 2017

But a single individual, right? Or do we want allow for the option to have an evidence individual contain a (virtual) edge back to multiple entities to support the evidence.

For @cmungall and @balhoff , instead of virtual edge annotations, which are easy to deal with as they are not "real", would it be conceptually easier to fold this into the evidence refactor or geneontology/noctua#141 and realize them as actual edges?

@ukemi
Copy link
Author

ukemi commented Jun 15, 2017

I think it should go to a single individual. This leaves open the possibility for what we brought up yesterday concerning multiple instances of the same gene-product class being in a model. Although I think this will be very rare, I think when it happens we will need the link to a single individual for the reasons that @balhoff pointed out on the call. That being, that the final annotation is always inferred from an instance representation of a gene product executing some function at the beginning of a chain of relations. If this could all be somehow behind the scenes so curators wouldn't have to think about it unless it becomes necessary, that would be good.

@vanaukenk
Copy link

Just FYI, I have constructed a subset of my Notch signaling model that, for simplicity, only contains the function annotations for the ligand and receptor.

It is fully evidenced and still has all of the features we'd like for testing evidence on downstream processes, but being simpler, is easier to dissect and troubleshoot.

http://noctua.berkeleybop.org/editor/graph/gomodel:5941cdbd00000002

The more complete Notch signaling pathway actually may be an interesting use case for potentially having multiple instances of the same gene class in a model since at least three distinct instances of C. elegans Notch activity, localized to: 1) extracellular space, 2) plasma membrane, and 3) nucleus, are involved in the pathway. In this case, I can imagine having a piece of evidence that supports involvement of all three activities (e.g. mutant phenotype based on a deletion allele) in regulation of a downstream process, as well as evidence that may only support involvement of a specific activity (e.g. mutant phenotype based on a mutation only affecting formation of the transcription factor complex) and I think I'd want to be able to clearly differentiate between the two.

@cmungall
Copy link
Member

@vanaukenk - thanks for the simpler model

Do we have a list of what we consider to be errors of omission or comission from:
http://noctua.berkeleybop.org/workbench/annpreview/?model_id=gomodel:5941cdbd00000002&barista_token=

@vanaukenk
Copy link

@cmungall - No, but I will create one.

@vanaukenk
Copy link

From 2019-01-31:
Discussed this issue again at hackathon.
Proposal:
For GPAD output, nodes more than 2 steps downstream will get the evidence between the MF and the first BP or the evidence between the MF and the first CC.
Curators can put evidence on as many edges as needed in the actual model.
This needs to be tested with different models that chain BPs and CCs with different relations.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Development

No branches or pull requests

5 participants