Skip to content

Commit

Permalink
[] (0) Elaborate on the rules for ASCII-compatible encodings (see las…
Browse files Browse the repository at this point in the history
…t checkin). (credit: pt)

git-svn-id: http://svn.whatwg.org/webapps@3332 340c8d12-0b0e-0410-8428-c7bf67bfef74
  • Loading branch information
Hixie committed Jun 28, 2009
1 parent bdae7b7 commit 3ff70fb
Show file tree
Hide file tree
Showing 2 changed files with 7 additions and 6 deletions.
6 changes: 3 additions & 3 deletions index
Original file line number Diff line number Diff line change
Expand Up @@ -1705,13 +1705,13 @@ a.<a href=#dom-uda-protocol title=dom-uda-protocol>protocol</a> = 'https'; // ch
the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
character sets we want to support do things outside that range?
-->, ignoring cases where those bytes would be part of multibyte
sequences. <a href=#refsRFC1345>[RFC1345]</a></p>
-->, ignoring the second and later bytes of multibyte sequences. <a href=#refsRFC1345>[RFC1345]</a></p>

<p class=note>This includes such exotic encodings as Shift_JIS and
variants of ISO-2022, even though it is possible for bytes like 0x70
to be part of longer sequences that are unrelated to their
interpretation as ASCII.</p>
interpretation as ASCII. It excludes such encodings as UTF-7,
UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>

<!--
We'll have to change that if anyone comes up with a way to have a
Expand Down
7 changes: 4 additions & 3 deletions source
Original file line number Diff line number Diff line change
Expand Up @@ -730,13 +730,14 @@ a.<span title="dom-uda-protocol">protocol</span> = 'https'; // change just the s
the set 0x09, 0x0A, 0x0C, 0x0D, 0x20 - 0x22, 0x26, 0x27, 0x2C -
0x3F, 0x41 - 0x5A, and 0x61 - 0x7A<!-- is that list ok? do any
character sets we want to support do things outside that range?
-->, ignoring cases where those bytes would be part of multibyte
sequences. <a href="#refsRFC1345">[RFC1345]</a></p>
-->, ignoring the second and later bytes of multibyte sequences. <a
href="#refsRFC1345">[RFC1345]</a></p>

<p class="note">This includes such exotic encodings as Shift_JIS and
variants of ISO-2022, even though it is possible for bytes like 0x70
to be part of longer sequences that are unrelated to their
interpretation as ASCII.</p>
interpretation as ASCII. It excludes such encodings as UTF-7,
UTF-16, HZ-GB-2312, GSM03.38, and EBCDIC variants.</p>

<!--
We'll have to change that if anyone comes up with a way to have a
Expand Down

0 comments on commit 3ff70fb

Please sign in to comment.