You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In this case, if we were to use percent-escaping to transform the (same) characters in the address so that they to conform to the URI requirements, we would base the escapes on the bytes that represent 引き割り.html in Shift-JIS.
"they to conform to" above should be "they conform to".
mod_fileiri looks unmaintained, should we keep the reference to it?
The reference to Internet Explorer and Netscape should probably be removed.
You can run a basic check to see whether IDNs work on your system using this simple test.
^ There should be a more up-to-date test.
Different browsers use different strategies to determine whether the URI should be shown in Unicode or punycode.
There is a similar issue with the use of simplified vs. traditional characters in the Chinese Han script.
This isn't a huge problem, because if a character isn't unified, most people who know Simplified Chinese or Traditional Chinese can easily see the difference. The bigger problem are things like Kangxi radicals (such as U+2F04 乙 and U+4E59 乙) and duplicate encoded characters (such as 㘽 U+363D and 㦳 U+39B3), because the glyphs are often the same. Also, some registries solves this by making the simplified and traditional characters equivalent (see 1 and 2).
There are some improvements needed to the specifications for IDN and IRIs, and these are currently being discussed. For example, there is a need to extend the range of Unicode characters that can be used in domain names to cover later versions of Unicode, and to allow combining characters at the end of labels in right to left scripts.
What's the status of this? Is this essentially IDNA2008? ^
"ICANN Guidelines for the Implementation of Internationalized Domain Names Version 2.1" should be updated. There is now a newer version.
The link to "IDN and IRI test pages" has been moved.
The link to "IDN-enabled TLDs supported by Mozilla.org" should be updated.
It might be useful to add or link to related information about the differences between IDNA2003, IDNA2008, and UTS #46. For example, 2003 is locked to Unicode version 3.2, while 2008 supports code points that appear in new versions of Unicode; 2003 normalizes ß to ss while 2008 makes it a valid character.
A link to UTS #46 should be added in the Further Reading section.
Examples of registered IDNs
IDN and URI [PDF], Michel Suignard
Opera International Domain Name support
Safari International Domain Name support
These four links are broken. ^
I can raise a PR to fix some of the issues above.
The text was updated successfully, but these errors were encountered:
[source] (https://www.w3.org/International/articles/idn-and-iri/) [en]
Some information in this article needs to be updated, like:
mod_fileiri
looks unmaintained, should we keep the reference to it?UTS #46
. For example, 2003 is locked to Unicode version 3.2, while 2008 supports code points that appear in new versions of Unicode; 2003 normalizes ß to ss while 2008 makes it a valid character.UTS #46
should be added in the Further Reading section.I can raise a PR to fix some of the issues above.
The text was updated successfully, but these errors were encountered: