Define structure for validation report #14

goodb · 2019-07-10T18:13:03Z

When we apply the shapes to a go_cam model, we need to formalize what the code should be providing in response. The shex libraries provide a mapping of the RDF nodes in the model to the labels of the shapes in the provided schema. This alone seems insufficient for users. I'm thinking of a response that would require some additional logic, something that contained additional elements like:

boolean for if the model as a whole should be called 'valid' according to the schema - similar to the OWL consistency check. This might be refined into subtypes of model-level quality.
A human readable explanation of 1.
anything else? I was thinking it would be useful to integrate the shape validation with the OWL validation so the OWL inference report could go in here as well.

On computing model-level validity, I'm thinking something like:
For each named individual in the model:

It must have an RDF type and a biolink category (these should probably be added to the root gocamentity shape).
The BL:category annotation should match a predefined shape. e.g. anything tagged bl:category [GoMolecularFunction:] must match the shape and must not match anything else.
Anything else ?

balhoff · 2019-07-10T19:20:28Z

The BL:category annotation should match a predefined shape. e.g. anything tagged bl:category [GoMolecularFunction:] must match the shape and must not match anything else.

How does this interact with "inheritance"/shape intersection? The following definitions imply to me that a node matching the <Complex> shape will have two values for bl:category: GoComplex: and GoMolecularEntity:. Is that a problem for this principle?

<Complex> @<MolecularEntity> {
   bl:category [GoComplex:]  ;
}// rdfs:comment  "a protein complex"

<MolecularEntity>  EXTRA bl:category {
   bl:category [GoMolecularEntity:]  ;
}// rdfs:comment  "a molecular entity (a gene product, chemical, or complex typically)"

goodb · 2019-07-10T20:16:22Z

I think it is, but its something we could implement around if we needed to. Basically, do we allow multiple BL categories for individual nodes or not? I feel like we probably do not want to recreate hierarchies with category tags. So here we should either make a subbshape of @ if we need to refer to complexes in shapes or just eliminate the shape and use only .

cmungall · 2019-07-11T06:28:59Z

I think explanations will be massively important in the long run but we have some time to defer on this as we can make do with geeky explanations in the short term while the modeling group iterates over some of the basics.

I do think we will need to refer to complexes in the schema, for example to state the expected has-part structure

goodb · 2019-07-11T16:03:49Z

@cmungall they key thing is to get the computation of the multi-node, model-level validity in place. Once that is done, the explanations, geeky or otherwise, will fall out easily.

goodb mentioned this issue Jul 10, 2019

bl:category should point to a biolink model URI #5

Closed

cmungall mentioned this issue Jul 11, 2019

Attempting to combine CLOSED with inheritance fails cmungall/obo-shapes#1

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Define structure for validation report #14

Define structure for validation report #14

goodb commented Jul 10, 2019

balhoff commented Jul 10, 2019

goodb commented Jul 10, 2019

cmungall commented Jul 11, 2019

goodb commented Jul 11, 2019

Define structure for validation report #14

Define structure for validation report #14

Comments

goodb commented Jul 10, 2019

balhoff commented Jul 10, 2019

goodb commented Jul 10, 2019

cmungall commented Jul 11, 2019

goodb commented Jul 11, 2019