Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

(Metric Impl) Ontology is a member of the OBO Foundry #20

Closed
jerdeb opened this issue Jun 4, 2014 · 2 comments
Closed

(Metric Impl) Ontology is a member of the OBO Foundry #20

jerdeb opened this issue Jun 4, 2014 · 2 comments

Comments

@jerdeb
Copy link
Contributor

jerdeb commented Jun 4, 2014

This is a Reputation Dimension.

In this metric we need to check if the used ontologies (therefore we need to check only for the type of an instance) are part of the OBO Foundry

For more information check D5.1

In the class comment, mention that this metric is specific to the EBI use-case

@jerdeb jerdeb changed the title Ontology is a member of the OBO Foundry (Metric Impl) Ontology is a member of the OBO Foundry Jun 4, 2014
@jerdeb jerdeb added this to the July Deliverable milestone Jun 4, 2014
@clange
Copy link
Contributor

clange commented Jun 13, 2014

@muhammadaliqasmi I discussed this metric with @nfriesen and will give you a more detailed guide for its implementation below. Please stop reading at the horizontal line. @jerdeb below that line there are some further questions that still need clarification (by you looking into the literature or even talking to the use case partners).

This metric is actually very similar to UndefinedClassesOrProperties. We need to compute the ratio of resources referenced in our dataset that are defined in ontologies that are members of the OBO Foundry.

@jerdeb's comment above that “we need to check only for the the type of an instance” (i.e. for the objects of triples whose predicate is rdf:type) is wrong because there are OBO Foundry ontologies (e.g. http://svn.code.sf.net/p/obi/code/releases/2014-03-29/obi.owl) that also define properties and individuals, which means that we need to do the check described below for all predicates and objects of all triples in our dataset. (By the reasoning of #31 I think we can skip the subjects.)

Therefore, @muhammadaliqasmi, I think we can do the following:

  1. check whether the URIs start with a string that's, for now, in a hard-coded list, which for now contains http://purl.obolibrary.org/obo/ as the only element.
  2. check, using VocabularyReader, and similarly to UndefinedClassesOrProperties metric, whether the class/property with this URI is defined.

From this we know that

  • the resource that our dataset reuses is defined in an ontology, …
  • … which is a member of the OBO Foundry.

E.g., for http://purl.obolibrary.org/obo/OBI_9991118, this is the case.


@jerdeb open questions start here:

Should this metric return "true or false", or a ratio (i.e. how of the classes/properties (also individuals?) used in a dataset are from ontologies in the OBO Foundry). "ratio" makes more sense IMHO, as the question with "true or false" is when we should return true: if all classes/properties/individuals that our dataset reuses are from OBO Foundry ontologies, or if some of them are?

Secondly, is the above “OBO Foundry membership check” actually right? Dereferencing http://purl.obolibrary.org/obo/OBI_9991118 takes me to http://www.ontobee.org/browser/rdf.php?o=OBI&iri=http://purl.obolibrary.org/obo/OBI_9991118 (and it also gives some meaningful RDF/XML to a linked data client). So I think our job is to see whether the stuff we can download by dereferencing that URI has a certain structure that looks like “OBO Foundry”.

I don't know whether this mechanism

  • fails for ontologies in the list of “candidate ontologies and other ontologies of interest” on http://www.obofoundry.org/. These are less reputable ontologies, and we'd like our metric to “fail” on them.
  • works with those ontologies that are written in the language OBO, not in OWL. I know OBO is somehow compatible to OWL, but I don't know what URIs the stuff declared in these OBO ontologies has, and whether the stuff from OBO is also published as linked data.

@muhammadaliqasmi
Copy link
Contributor

This metric detects non reputable resources by retrieving URI of resources from data sets and prefix match with "http://purl.obolibrary.org/obo/".

Metric value = (total number of NOT reputable resources ) / ( total number of resources )

Metric Value Range : [0 - 1] , Best Case : 0, Worst Case : 1

(for further changes/improvements, kindly re-open this issue).

.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants