Skip to content

Commit

Permalink
Minor tweak to ascending code unit sort.
Browse files Browse the repository at this point in the history
Made the example clear that the code units are in the UTF-16 character
encoding.
  • Loading branch information
aphillips committed Feb 7, 2023
1 parent e16100d commit 6da1a39
Showing 1 changed file with 1 addition and 1 deletion.
2 changes: 1 addition & 1 deletion index.html
Original file line number Diff line number Diff line change
Expand Up @@ -2864,7 +2864,7 @@ <h4>Program Internal Sorting</h4>
&#x1f63a; (U+1F63A)
</pre>

<p>In ascending <em>code unit order</em>, the character U+1F63A is encoded as the code unit sequence <code>0xD83D 0xDE3A</code>, so the strings sort like:</p>
<p>In ascending <em>code unit order</em> in the UTF-16 character encoding, the code point U+1F63A is encoded as the code unit sequence <code>0xD83D 0xDE3A</code> (a [=surrogate pair=]), so the strings sort like:</p>
<pre>
&#x1f63a; (0xD83D 0xDE3A)
&#xff5e; (0xFF5E)
Expand Down

0 comments on commit 6da1a39

Please sign in to comment.