|
4 | 4 |
|
5 | 5 | =SUBTITLE Unicode Support in Perl 6
|
6 | 6 |
|
7 |
| -Perl 6 has a high level of support of Unicode. It supports extended grapheme |
8 |
| -clusters for breaking up words. |
| 7 | +Perl 6 has a high level of support of Unicode. |
9 | 8 |
|
10 | 9 | You can access Unicode codepoints by name:
|
11 | 10 | Rakudo supports all Unicode 9.0 names.
|
12 | 11 |
|
13 | 12 | say "\c[PENGUIN]" # 🐧
|
14 | 13 |
|
15 |
| -By name alias: |
| 14 | +By name alias. For full list see L<here|http://www.unicode.org/Public/UCD/latest/ucd/NameAliases.txt>. |
16 | 15 |
|
17 | 16 | say "\c[ALERT] # Not visible (U+0007 control code (also accessible as \a))
|
18 | 17 | say "\c[BELL] # 🔔 (U+1F514 BELL)
|
19 | 18 |
|
20 |
| -You can also use any of the L<Named Sequences|http://www.unicode.org/Public/UCD/latest/ucd/NamedSequences.txt>, these are not single codepoints, but sequences of them. |
| 19 | +You can also use any of the L<Named Sequences|http://www.unicode.org/Public/UCD/latest/ucd/NamedSequences.txt>, |
| 20 | +these are not single codepoints, but sequences of them. |
21 | 21 | (In Rakudo blead but not in the latest release of the compiler yet).
|
22 | 22 |
|
23 | 23 | say "\c[LATIN CAPITAL LETTER E WITH VERTICAL LINE BELOW AND ACUTE] # É̩
|
24 | 24 | say "\c[LATIN CAPITAL LETTER E WITH VERTICAL LINE BELOW AND ACUTE]".ords # (201 809)
|
25 | 25 |
|
26 |
| -You can also use L<Emoji ZWJ Sequences|http://www.unicode.org/Public/emoji/4.0/emoji-zwj-sequences.txt> and L<Emoji Sequences|http://www.unicode.org/Public/emoji/4.0/emoji-sequences.txt> |
| 26 | +You can also use |
| 27 | +L<Emoji ZWJ Sequences|http://www.unicode.org/Public/emoji/4.0/emoji-zwj-sequences.txt> |
| 28 | +and L<Emoji Sequences|http://www.unicode.org/Public/emoji/4.0/emoji-sequences.txt> |
27 | 29 |
|
28 |
| -Additionally, in Rakudo blead and in the next monthly release of Perl 6 these names are all now case-insensitive: |
| 30 | +Additionally, in Rakudo blead and in the next monthly release of Perl 6 all |
| 31 | +Unicode codepoint names/named seq/emoji sequences are now case-insensitive: |
29 | 32 |
|
30 | 33 | say "\c[boy]" # 👦 (U+1F466)
|
0 commit comments