use IRIs uniformly in HETS #1596

mcodescu · 2016-03-14T14:41:57Z

We currently have IRIs defined in OWL2, in Common.IRI and moreover we have CASL identifiers. This complicates things, e.g. when translating from OWL to CASL or when using Common.IRIs for OWL2. Ideally, there should be just one type of identifiers, that we could create by extending Common.IRI with some mixfix annotations to cover CASL identifiers as well. We should also have a convention, documented in the wiki, about implicit values for fields in the IRI type. As a result, all logics (including CASL and OWL2) should use the new Common.IRI type.

Hets modules to look at: Common.IRI (datatype IRI), OWL2.AS (datatype QName, same as IRI, but different from IRI in Common.IRI), Common.Id (datatype Id). These need to be integrated into a uniform datatype. Later on, look at the parsers: Common.IRI, OWL2.Parse, Common.Token (e.g. parseId).

The text was updated successfully, but these errors were encountered:

tillmo · 2017-09-18T10:11:39Z

What happens with fully qualified names f:s->t, infixes __+__ and compound IDs List[Elem]? First idea: just include them verbatim. Does the IRI syntax allow this?
Some answers: in compound IDs, only List could be an IRI, not Elem. If Elem is later instantiated by an IRI, only the local part of that IRI will be substituted for Elem. This of course can lead to name clashes.
Special symbols like : would be escaped with their hex code (while still being nicely displayed in Ontohub).

tillmo · 2017-09-20T16:54:37Z

The problem is that in the standard http://www.ietf.org/rfc/rfc3987.txt brackets [] are forbidden in IRIs, while some symbols like + and : are allowed (at least in the local part). However, all unicode characters beyond ASCII are allowed, too. Hence we could replace square brackets by some unicode brackets. Alternatively, round brackets () are allowed, too (but these are already used for application to arguments), as well as curly braces {} (but these already are used for DOL structured OMS).

Possible solution: use, when displaying the OMS (i.e. for "show theory"):
Id〚X﹐Y〛for Id[X,Y]
f：s×t→u for f:s*t->u
× for *
／ for /
＼ for \
＜ for <
＞ for >
？ for ?
： for :
＃ for #
＾for ^
❙ for |
The CASL SIGNs _+-&=!.$@~¡¿÷£©±¶§¹²³·¢◦¬μ are legal in IRIs.

Another question is whether all CASL SIGNs should be legal in full DOL IRIs. Probably it would be better to allow them only in unprefixed identifiers.

tillmo · 2017-09-25T08:44:17Z

For mapping CASL IDs to IRIs, if there is no empty prefix, we need a default. This could be the IRI of the library and should be consistent with Ontohub. However, in Ontohub, we need to disambiguate between OMS, mappings, symbols, axiom names etc. (see ontohub/ontohub-backend#9 (comment)). We could append a the entity kind OMS, symbol etc. to the IRI, which is quite unusual, but from a Hets perspective this would work better than prepending it, because Hets needs to work not only with Ontohub, but also with other sources on the web. We could allow the omission of the appended entity kind if there is no overloading, i.e. if a resolution to a unique enitty is possible. There could be a redirection mechanism from the default to the version with the appended entity kind. For example, in a library http://ontohub.org/user/pizza-repo/pizza-library, a symbol PeperoniPizza in OMS pizza-oms would be expanded to the loc/id (IRI)

http://ontohub.org/user/pizza-repo/pizza-library//pizza-oms//PeperoniPizza//symbol

but there also would be the default loc/id

http://ontohub.org/user/pizza-repo/pizza-library//pizza-oms//PeperoniPizza

Note that this mechanism is different from the usual DOL prefixing mechanism. For example, if we had

prefix : http://ontohub.org/user/pizza-repo/pizza-library//

a symbol PeperoniPizza would be expanded to the loc/id

http://ontohub.org/user/pizza-repo/pizza-library//PeperoniPizza

which does not follow the DOL/Hets convention that a loc/id should include both the library, the OMS and the entity name. The general form of a loc-id therefore is

:iri-of-library//:oms//:entity-kind
:iri-of-library//:oms//:entity-name//:entity-kind
:iri-of-native_document//:entity-name//:entity-kind

and in Ontohub, this specialises to

http://ontohub.org/:user/:repo/:path-to-library//:oms//:entity-kind
http://ontohub.org/:user/:repo/:path-to-library//:oms//:entity-name//:entity-kind
http://ontohub.org/:user/:repo/:path-to-native_document//:entity-name//:entity-kind

Alternatively, we could use the usual web convention to prefix each name with a kind:

:iri-of-library//oms//:oms
:iri-of-library//oms//:oms//symbol//:symbol-name
:iri-of-native_document//symbol//:symbol-name

Note that we also have to process native OWL, RDFa documents etc. The symbols there have IRIs that do not follow the above mechanism. But these symbols do not follow link data principles anyway. We provide alternative IRIs (loc/ids) for these symbols in the above form, which follow linked data principles.

tillmo · 2017-09-25T10:37:26Z

Another question is whether we use the IRI of the location of the document or the IRI in the library declaration within the document. (This is comparable to: IRI of the location of an OWL document versus IRI declared in the Ontology declaration.) It seems that we should use the latter, because this is the explicitly declared - even if this breaks linked data principles in case where the two IRIs differ.

BerndKB · 2017-09-26T11:43:48Z

Since the problem of incomplete character sets seems to occur with several people, we might try
"‚" - 1738 - U+201A single lower-9 quotation mark
that displays well her, in Protégé and in my TextEdit editor.
I have not found a really FAT comma ...

BerndKB · 2017-10-11T15:01:15Z

I encountered another Problem, perhaps only with Protégé (?): if we use the following names
C［XˌY］
D〚X‚Y〛
E〚X❜Y〛
then the full IRI (with the "Show Full IRI" option) will show correctly, but without this option Protégé displays
C［XˌY］
‚Y〛
❜Y〛
i.e. the text before the resp. "comma" is taken to be part of the URI prefix.
If it stays that way, it will be extremely confusing for the user.

tillmo · 2017-10-11T15:27:37Z

yes, I can confirm this. The problem is the comma. Only ˌ seems to work. So for example F〚XˌY〛works for me.

BerndKB · 2017-10-11T16:21:46Z

Yes, F〚XˌY〛works for me also.

clange · 2017-11-05T13:53:23Z

Some thoughts while I'm reading this. I have bit yet read the comments to the end. @BerndKB BTW good to "see" you here. I was also at ISWC (just the main conference); didn't manage to talk to you there, but I heard some of my colleagues did.

Another alternative character for enclosing type arguments might be <...> (think of Java or C++ generics), but this is even less allowed in IRIs than other bracket characters, probably because it's commonly used to enclose complete IRIs.

clange · 2017-11-05T14:08:26Z

The discussion about prepending the entity kind to IRIs reminds me of another possible solution: following the approach of punning in OWL 2, i.e. allowing the use of the same name for different entities having different entity kinds, where the kind of an entity is determined from the syntactic context.

clange · 2017-11-05T14:19:22Z

In any case I would not mess with the mechanism of "prefix expansion by concatenation", which DOL reuses from the specification of CURIEs in RDFa, and which occurs similarly in related standards, including SPARQL and Turtle. The beauty of this mechanism is its simplicity. Or, on the contrary, if we wanted to deviate from this mechanism, we should be bold instead of half-hearted and do away with it completely, i.e. devise a powerful mechanism for abbreviating long identifiers that doesn't have to respect the restrictions of any existing standard (except maybe RFC 3987). As an analogy, compare the URI syntax of MMT, which, IIRC, has its own approach to relative paths, which is not supported by any other implementation but is self-contained and makes a lot of sense for the MMT use cases.

clange · 2017-11-05T14:26:03Z

On whether or not to follow linked data principles, one can also take inspiration from MMT's approach of deliberately not following them. Not following them might of course cause confusion because DOL intends to be compatible with languages whose best practice is to follow them, and DOL should also be inviting to the users of such languages.

clange · 2017-11-05T14:29:12Z

Done with my comments. No straightforward solution I could offer, but I hope you'll find my input useful at least.

tillmo · 2017-11-05T20:09:49Z

many thanks for your comments. Do you have the impression that we mess with the mechanism of "prefix expansion by concatenation"?

mcodescu mentioned this issue May 8, 2016

fail if names of sentences appear as names of symbols #1630

Merged

mcodescu mentioned this issue Jun 1, 2016

Module extraction does not keep prefixes #1641

Closed

tillmo mentioned this issue Aug 13, 2016

Use save_file for external repositories ontohub/ontohub#1763

Open

mcodescu mentioned this issue Mar 22, 2017

disambiguate OWL symbols with their kind #1697

Open

tillmo mentioned this issue Sep 20, 2017

Stratify names of instantiations of compound ids in OWL2 #1754

Open

This was referenced Oct 15, 2017

[WIP] 1691 dg calculus on database #1752

Merged

names for nodes in unit specs #1762

Merged

tillmo mentioned this issue Oct 24, 2017

1596 iris #1763

Merged

tillmo mentioned this issue Dec 11, 2017

flattening for (generic) specifications definition #1755

Open

tillmo closed this as completed in #1763 Jan 4, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

use IRIs uniformly in HETS #1596

use IRIs uniformly in HETS #1596

mcodescu commented Mar 14, 2016 •

edited by tillmo

tillmo commented Sep 18, 2017 •

edited

tillmo commented Sep 20, 2017 •

edited

tillmo commented Sep 25, 2017 •

edited

tillmo commented Sep 25, 2017

BerndKB commented Sep 26, 2017

BerndKB commented Oct 11, 2017

tillmo commented Oct 11, 2017

BerndKB commented Oct 11, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

tillmo commented Nov 5, 2017

use IRIs uniformly in HETS #1596

use IRIs uniformly in HETS #1596

Comments

mcodescu commented Mar 14, 2016 • edited by tillmo

tillmo commented Sep 18, 2017 • edited

tillmo commented Sep 20, 2017 • edited

tillmo commented Sep 25, 2017 • edited

tillmo commented Sep 25, 2017

BerndKB commented Sep 26, 2017

BerndKB commented Oct 11, 2017

tillmo commented Oct 11, 2017

BerndKB commented Oct 11, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

clange commented Nov 5, 2017

tillmo commented Nov 5, 2017

mcodescu commented Mar 14, 2016 •

edited by tillmo

tillmo commented Sep 18, 2017 •

edited

tillmo commented Sep 20, 2017 •

edited

tillmo commented Sep 25, 2017 •

edited