Option to cache ontologies #62

cthoyt · 2020-03-02T17:04:02Z

I've been debugging the CHIRO (CHEBI Integrated Role Ontology) OBO export, and it had a few issues. First, I had to manually add some Typedef stanzas for its ad-hoc relations. Second, I had to switch the imports from its slimmed versions to the originals since the slim versions were missing several entities.

This lead me to the problem that it has to download each ontology file each time, and this takes a loooong time. Therefore, I'd like to request an option to cache OBO files (either the source .obo or a pre-compiled version as a pickle or OBO JSON)

I understand there could be problems to keeping the caches up-to-date, but maybe there's a simple way to add a dictionary argument to Ontology.__init__ so I can specify where I have my own copies like

from pronto import Ontology

Ontology.from_obo_library('chiro.obo', cache_files={
    'http://purl.obolibrary.org/obo/chiro/imports/chebi_import.owl': '/Users/cthoyt/obo/chebi.owl',
    'http://purl.obolibrary.org/obo/chiro/imports/envo_import.owl' : '/Users/cthoyt/obo/envo.owl',
    ...
})

Or alternatively, maybe you have an idea that could take care of this kind of caching for me so I don't have to look into the imports of the OBO file specifically.

The text was updated successfully, but these errors were encountered:

althonos · 2020-03-02T19:53:22Z

Hi @cthoyt ,

you currently have a (hacky) workaround if you replace import: http://purl.obolibrary.org/obo/chiro/imports/chebi_import.owl with import: chebi_import.owl in the source OBO; then pronto will try to use a local file named chebi_import.owl.

I could add something to provide a source as a interface (like an SourceProvider interface), it would indeed be a better way to let the user implement it, plus the current code could benefit from refactoring.

althonos added the enhancement An issue to request an enhancement, or a pull request implementing one. label Mar 2, 2020

althonos self-assigned this Mar 2, 2020

althonos added this to the v2.3.0 milestone Jul 18, 2020

althonos modified the milestones: v2.3.0, v2.4.0 Sep 21, 2020

althonos modified the milestones: v2.4.0, v3.0.0 Feb 18, 2021

althonos mentioned this issue Sep 7, 2022

Add supports for catalogs in imports #186

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Option to cache ontologies #62

Option to cache ontologies #62

cthoyt commented Mar 2, 2020

althonos commented Mar 2, 2020

Option to cache ontologies #62

Option to cache ontologies #62

Comments

cthoyt commented Mar 2, 2020

althonos commented Mar 2, 2020