This repository collects the supplementary materials to the article titled "Ontology Reuse: the Real Test of Ontological Design", published in the proceedings of the SOMET 2022 conference, Frontiers in Artificial Intelligence and Applications vol. 355. A preprint version of the article can be accessed (here).
The code in this repo consists of mostly Jupyter notebooks and SPARQL queries. For the former, you will need to set up a Python environment using the attached requirements.txt
file. We used Python 3.9 on Linux. Running this on Windows will most likely fail.
In the subdirectories, additional README.md
files explain the sequence in which the files should be run and how to set up the triple store. In our experiments, we used Blazegraph due to its high performance on large KBs, but please do note that it is currently unmaintained. It may be possible to replicate the queries using other triple stores, e.g., Apache Jena.
In the subdirectories you will find the code used to perform the analyses, as well as the results and (in some cases) suggested patches.
obo/
– analysis of OBO Foundry ontologiesobo/1_rare_props
– rarely used propertiesobo/2_prop_values
– properties with inconsistent value typesobo/3_cross_refs
– cross-ontology referencesobo/summary
– summary of the identified issues
cso/
– analysis of the Computer Science Ontologycso/1_synonym_structure
– CSO's structure transformation demonstrated in the papercso/2_ext_ref_consistency
– externally-checked consistency of references to DBpediacso/3_missing_refs
– missing references to corresponding KBscso/4_refs_reasoning
– logically inconsistent external referencescso/5_intra_ref_consistency
– intra-cluster inconsistencies in external referencescso/6_conflation
– term conflation in clusterscso/summary
– summary of the identified issues
We attempted to describe the experiment well enough, so that interested researchers can reproduce them in the future. However, in case of any unclear documentation or other issues, we will be happy to help. Feel free to contact us, contact details are in the paper.
The materials in this repository are licensed under the MIT License. You can cite this code using this DOI:
Piotr Sowiński, Katarzyna Wasielewska-Michniewska, Maria Ganzha, Marcin Paprzycki