-
Notifications
You must be signed in to change notification settings - Fork 57
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
URL comparisons when retrieving metadata files #562
Comments
This is based on Table Compatibility:
So, they're not canonicalized, but are made absolute by joining to the base URL. I think the text is clear, can we close this? |
The question is rather whether a modern W3C spec ought to be comparing URIs/(IRIs?) as strings, vs some other deeper check for equivalence. I'm not thinking of semweb "sameAs" for refers to the same real world thing, but rather equivalencies baked into the URI family of specs, e.g. meaning of :80, case insensitivity of domain names, perhaps some internationalization concerns. |
Probably the best route is to simply reference [[RFC3968]] section 6 "Normalization and Comparison". There is an unbounded set of potential scheme normalization rules, but we might want to limit ourselves to just HTTP and HTTPS normatively. This would affect Table Compatibility as well as Metadata Discovery (where it determines that the metadata refers to the URL used for discovery. I've marked it for discussion on the next call. |
…tadata compatibility and discovery. For consistency, `link` property normalization includes URL normalization. Note the issue marker in the section to find a better way of describing the schemes for which scheme-based normalization should/must be defined. This should affect tests, both by exploring cases of link-property normalization, metadata discovery and compatibility. #562.
I will raise this with TAG when I speak with them. |
TAG has no objection to using [[RFC3968]] section 6 "Normalization and Comparison", so please go ahead with that PR. |
When comparing URLs for this purpose, what do we say if anything about non-string-identical URIs eg. http://example.COM/ vs http://example.COM:80/, rel=canonical etc?
See http://www.w3.org/2015/05/20-csvw-irc#T14-27-34
The text was updated successfully, but these errors were encountered: