-
Notifications
You must be signed in to change notification settings - Fork 33
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Mapping to QUDT2 #60
Comments
Hi,
For a detailed list of individuals you can have a look at this list. This is the result of a tool we built to compare two versions of an ontology. I abused it here to show the difference between Sweet 2.3 and QUDT 1.1, so PS: A publication for the complete ontology evaluation is currently under review. Most of the (data) results, however, are already publicly available as are the scripts to generate them. |
Thank you @SirkoS would you be interested in joining the ESIPFed Org on Github and being notified of updates to SWEET? |
@lewismc I already added this repo to my watchlist. I think this should send me messages regarding any updates. |
The Geological Survey of Queensland where I work is keen to see SWEET/QUDT integration. They will use QUDT for basic units of measure and will declare specialised geo units in QUDT form as well as contribute a QUDT Geo discipline (http://qudt.org/doc/2019/10/DOC_VOCAB-DISCIPLINES-v2.1.html). They are already using SWEET objects here and there, e.g.
But now we have to choose units of measure and we see QUDT as a deep ontology with strong mathematical backing where SWEET units are a bit lighter on. Sweet units do have mathematical elements, e.g. Some touch points are:
Classes: Properties: Unit instances: |
@SirkoS do you have an update of your work? Thank you |
We only created mappings for the unit instances. Other classes etc would have to be added. The mappings we created at the time can be found in the respective Github repo in the file I extracted the mappings between SWEET and QUDT into the following file: SWEET3 - QUDT mapping.txt. This also contains the mappings within SWEET and QUDT themselves, as we found some duplicates within the ontologies at the time. This is also supported by the provided scripts. So if you need another mapping subset can either run the scripts yourself (available here - again the I was not able to rerun the whole analysis but just reused our old results. So the mappings are based on SWEET 3.1 - I don't know if there were any major changes to the units themselves since then. We were/are also planning to push all our mappings to Wikidata, which might be relevant for #156 . However, there are two obstacles right now:
As for the general outlook: @jmkeil is working on generalizing the approach, but I have no clue about progress and schedule for that. |
So I'm not sure if these tabulations are helpful at present. |
I guess you are referring to QUDT2. As I tried to mention, I just ran the respective script based on our old results, which only included QUDT1.1 at the time. QUDT1.1 still has those units in (see http://qudt.org/1.1/vocab/OVG_units-qudt-(v1.1).ttl ):
I will try a rerun for QUDT2, but this will take some time. Also I currently do not have the time for a manual revision. So there might be some mappings missing, but maybe it can serve as the basis for your efforts here. |
I was looking at the thing that the base URI http://qudt.org/vocab/unit resolves to. Now I fully understand that QUDT was a bit late to the party in publishing their resources as linked data, but anyone who doesn't know about the OVG file would likely try to resolve the URIs in the mapping file first. |
Meanwhile, I've submitted an issue to QUDT to add |
So I managed to rerun the scripts now also for QUDT2. Again the warning that this is not validated as carefully as our previous paper, but may be the starting point for you. I found some issues in QUDT2 (see the issue in their repo), which should not affect the mapping - none of the affected IRIs are contained in the mapping. PS: Due to the structure of our scripts this also includes the mappings between duplicates within SWEET (e.g., PPS: Right now the scripts find ~71% of SWEET's units in QUDT2, whereas it only sees 10% of QUDT2's units in SWEET. So right now QUDT2 has a lot more units: 901 vs 137 (just by IRIs). |
@SirkoS this is excellent. I have a few questions
Thanks |
QUDT development has recently moved to an open GitHub repository here: https://github.com/qudt/qudt-public-repo - @SirkoS and I have contributed issues there |
I exported a full list of "duplicates" here. However, I want to add some word of caution: I think most (all?) of them are already connected via
The first seems to be an undetected duplicate across different namespaces.
Right now I don't think that there is one ontology with the claim to be the universal unit ontology (although most names suggest otherwise). Each one caters to their specific domain and will in time contain the units relevant there. As QUDT2 now has a more open process compared to the first version, I expect them to stabilize as well in a while. [1] I'm currently unable to update the results for Wikidata, as I keep running into timeouts. I fear, this will require at least a substantial rewrite of the queries, maybe even of the scripts, so we can split the queries in multiple smaller ones.
In theory, all scripts are openly available here. I had to make some minor adjustments to make them run again, but in general the instructions should suffice. However, it still requires quite some manual effort to validate all results, which we could not automate at the time. Regarding you proposition, @jmkeil and I had discussed about something similar in the past already. As mentioned before, he is also still working in that direction. I'll try to talk to him, whether he sees this as a potential application of his work and to join the discussions here. Nevertheless, I think this can only be some kind of dashboard giving you hints towards issues and not a fully automated quality-check pipeline. |
This issue relates to #23 and #55. What is the extent of QUDT's units coverage as compared to SWEET? Could QUDT be imported--either partially (modular) or entirely--instead of having replication?
An initial mapping should provide a basis for further informed discussion.
The text was updated successfully, but these errors were encountered: