-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Revise terminology to align with current RFCs. #15
Comments
There is also the WHATWG URL living standard that is mostly RFC 3987 with some changes for web backward compatibility. |
Yes - I nearly put HTML URLs in. I couldn't find a grammar to link to. They do come from a different design space and have various features related to that (e.g. display, normalization of output). They do discuss "c:"! It is "living" so may change. |
There have been issues raised before on using the URL spec rather than IRI. The main advantage is that it is properly citable, but the lack of a BNF grammar. My thought is that we continue to use the term "IRI", defined in RDF Concepts for others specs to cite. That is, in turn, defined in an appendix based on the URL living standard with a historical reference to RFC 3987 (also in changes). The appendix can include the ABNF from RFC397, but probably should be non-normative. Validation comes from the URL spec, with informative reference to the historical ABNF and note that there are valid IRIs not handled by the ABNF. We also update our terminology here, and elsewhere as noted in RDF1.1 Erratum 29. |
Alternatively, we could join with consensus and define
|
Some of the historical naming is not great. It works in the specs (2396) but get lost when used outside. An "Absolute URI" (3986, 4.3) is with-scheme and without-fragment. In the RDF context, it is not very useful. |
The URL standard brings in a lot of material that is related to its use domain. It is something RDF applications may wish to use. RDF is compatible with the URL spec. I don't see what advantages it brings to the RDF platform (RDF specs) to be based on the URl spec. We need less than what the URL spec covers. We end up with two sets of terminology, confusing the situation. That does not help the material on the web in books and training materials already in existence.
validation == the parsing algorithm? The URl spec defines "validation error". For example, what will be the RDF position on percent-encoding? "Code points greater than U+007F DELETE will be converted to percent-encoded bytes by the URL parser." That seems to imply a change to RDF even if we, somehow, explain it does not apply. "A URL is a struct that represents a universal identifier". That is confusing for our usage. RDF is a data format, we don't need to be involved with the parsed presentation.
I think it would raise questions. One of the stated URL goals is to superseded the terminology of URI and IRIs. We still have to include the grammar (3987 section 2) and the URI algorithms extended for unicode (section5) in RDF Concepts. "As the editors learn more about the subject matter the goals might increase in scope somewhat." As far as I can see, we can probably use some of the URL spec (not all of it) if we do the necessary checking. |
I'll add the Note that URL is a living spec, so could be updated to address our needs, given the time and expertise to spend on it.
For other obsolete terms still in use we've used
One of the things to explore with the I18N group is the lack of an ABNF grammar, which I presume has some reasoning behind it. Distinguishing
This seems to be the output of the parser, which I don't believe we would use directly. Interesting, though, that the description of URL serializing does not decode the characters beyond U+007F, which does not seem very friendly to international users.
Presuming that it is compatible with URL. I'd hate to do this without fully understanding why the URL spec does not include such a grammar. Perhaps the spec, or one related, could be updated to deal with the specific needs of RDF. |
Pointer please.
I'm wasn't talking about the specs. I mean material that has been blogged, video'ed, and printed.
Why would it differ? (aside from the false statements about spaces in the goals section). |
Notably IRIStatus from the I18N wiki.
Why does URL have a grammar? I don't have anything to point to, but I have some recollection that ABNF was inadequate, thus the procedural description of parsing URLs. I'd like to get clarification on this.
The IRIStatus doc seems to describe the motivation. But it is certainly heavily biased to the needs of browser vendors. We need to have a good rationale for either subsuming the RFC3987 work relevant to RDF, or adopting/adapting the URL spec. |
IRIStatus was last updated was 2015. It is mainly concerned about presentation of international domain names and bidi URLs. That, and conversion (i.e. changing the string) to ASCII are important for browsers. There isn't a problem with RFC 3987 We should (and, currently, we are) compatible with the URL spec. There are several RFCs and W3C for specific URI schemes. We don't reference those nor discuss their specifics. |
Also notable, IRIStatus was authored and edited over ~18 months by a single person (Aphillip who did not see fit to create a user page in the i18n wiki, but appears to have been Addison Phillips whose personal pages all appear to be outdated and/or offline). Addison Phillips's chairmanship of the i18n WG notwithstanding, this does not suggest to me that the apparently unratified IRIStatus is the "intended direction for W3C", a group with hundreds of member organizations and not-easily-counted hundreds if not thousands of Member-representing and Invited Expert individual participants. |
Clearly, we need to understand if W3C has a policy about international identifiers and how to handle the state of RFC3987 which presents an issue; is this a TAG question, or an I18N question? As previously noted, we can either choose adopt the pertinent core of RFC3987 and use RDF Concepts as the citable reference for Updating or adopting RFC3987 seems like serious scope creep, but not having a normative specification to reference isn't acceptable either. |
Please note that most IETF specifications are at Proposed Standard. These get referenced all the time. It's not like a CR or PR at W3C.
Please note that RFC 8820 obsoletes RFC 7320, so as a result, RFC 3986 is really only updated by RFC 6874 and RFC 8820. Also, there's an update to RFC 6874 at https://datatracker.ietf.org/doc/draft-ietf-6man-rfc6874bis/. That also updates RFC 3987 (if it gets approved). But please note that IPv6 zone identifiers are something extremely local, definitely not suited for RDF. As for RFC 8820, that's essentially just best current practice, and there, RDF has its own considerations. |
The URL spec is not the right spec. It is concerned with processing URLs, and the parsing algorithm changes the input string. It has specific percent-encoding handling which must only be performed once between original IRI and any deference action or the meaning has been changed (and it's a security risk). As previously discussed (RDF 1.1 timeframe - whether IRIs be normalized) RDF specs are transporting IRIs and not concerned with "producing" or "dereference" (application use). The title here is "Revise terminology to align with current RFCs." - e.g. IRI references and absolute IRIs. #17 mentions the WG may change core terminology based on URL spec - that is completely different and should not be in FPWD when the WG hasn't indicated it will even consider doing this. #17 does not mention "adopt the pertinent core of RFC3987". We need a grammar for IRIs. It would be useful (but is not essential) to have a complete updated IRI grammar and not just the changes. |
Yes. The "Proposed Standard level" are defined in RFC 2026#section-4.1.1 (which is not changed by RFC 6410#section-2.1). I found URNs are "proposed standard":
|
Perhaps it's worth an informative note describing the relationship/compatibility of RFC3987 and the URL standard.
Would you suggest an appendix that merges and updates the ABNF grammar in this document?
It is great that we can put this to bed and continue to use RFC3987, as the issue of referencing that and using the URL standard instead has come up a number of times in different places. |
Good idea - give me a task to do it. |
I have tooling to generate HTML for an ABNF grammar; I created #20 for adding an ABNF grammar we can collaborate on. |
This should be split into two issues. One for aligning terminology and one for including the combined IRI grammar. |
From #21. Ensure that "IRI" in the section title "IRI Grammar" does not have under-dots (the abbreviation style). |
* Described IRIs as "resolved" rather than "absolute". For #15. --------- Co-authored-by: Ted Thibodeau Jr <tthibodeau@openlinksw.com> Co-authored-by: Andy Seaborne <andy@apache.org>
RFC 3987 is a "proposed standard" in the RFC process.
RDF relies on the syntax extensions that RFC 3987 makes to RFC 3986.
RFC 3986 is updated by RFC 6874 and RFC 8820.
This should include RDF1.1 Erratum 29 which refers to the use of certain URI/IRI terminolgy. ("absolute IRI", "relative IRI reference")
The text was updated successfully, but these errors were encountered: