-
Notifications
You must be signed in to change notification settings - Fork 0
/
Unicode.pm
141 lines (77 loc) · 2.52 KB
/
Unicode.pm
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
package Task::Unicode;
use 5.006;
use strict;
use warnings;
our $VERSION = '0.01';
1;
__END__
=encoding utf8
=head1 NAME
Task::Unicode - Everything needed to work with Unicode data
=head1 VERSION
This document describes Task::Unicode version 0.01.
=head1 DESCRIPTION
This L<Task> module installs everything needed for working with Unicode and
UTF-8 strings as well as a number of modules and utilities to assist in
development and debugging. It is not intended to be an all-inclusive bundle
of Unicode modules on the CPAN. Instead, it is a collection of the essential
and some of the most useful general-purpose modules.
This is an early release of Task::Unicode and the bundled module list is not
yet complete. Modules may be added or removed. Please contact the author
with suggestions for upcoming releases.
The brief module descriptions are currently taken directly from each module.
In the future, these will be slightly expanded and explained for those with
less Unicode knowledge.
=head2 MODULES
=over
=item L<Unicode::Collate::Locale>
Linguistic tailoring for DUCET via Unicode::Collate
=item L<Unicode::GCString>
String as Sequence of UAX #29 Grapheme Clusters
=item L<Unicode::LineBreak>
Line Folding for Plain Text
=item L<Unicode::Stringprep>
Preparation of Internationalized Strings (RFC 3454)
=item L<utf8::all>
Turn on Unicode—all of it
=back
=head2 DEBUGGING
=over
=item L<String::Dump>
Dump strings of characters or bytes for printing and debugging
=item L<Encode::DoubleEncodedUTF8>
Fix double-encoded UTF-8 bytes to the correct one
=back
=head2 UTILITIES
=over
=item L<App::Uni>
Command-line utility to grep UnicodeData.txt
=item L<Unicode::Tussle>
Tom’s Unicode Scripts So Life is Easier (only installed with Perl 5.14+)
=back
=head2 SEE ALSO
This task requires Perl 5.8.0, which bundles the following important modules.
=over
=item charnames
Access to Unicode character names and named character sequences; also define
character names
=item utf8
Perl pragma to enable/disable UTF-8 (or UTF-EBCDIC) in source code
=item open
Perl pragma to set default PerlIO layers for input and output
=item Encode
Character encodings in Perl
=item Unicode::Collate
Unicode Collation Algorithm
=item Unicode::Normalize
Unicode Normalization Forms
=item Unicode::UCD
Unicode character database
=back
=head1 AUTHOR
Nick Patch <patch@cpan.org>
=head1 COPYRIGHT AND LICENSE
© 2012 Nick Patch
This library is free software; you can redistribute it and/or modify it under
the same terms as Perl itself.
=cut