Task::Unicode - Everything needed to work with Unicode data
This document describes Task::Unicode version 0.04.
This Task module installs everything needed for working with Unicode and UTF-8 strings as well as a number of modules and utilities to assist in development and debugging. It is not intended to be an all-inclusive bundle of Unicode modules on the CPAN. Instead, it is a collection of the essential and some of the most useful general-purpose modules.
This is an early release of Task::Unicode and the bundled module list is not yet complete. Modules may be added or removed. Please contact the author with suggestions for upcoming releases.
The brief module descriptions are currently taken directly from each module. In the future, these will be slightly expanded and explained for those with less Unicode knowledge.
Unicode case-folding for case-insensitive lookups
Linguistic tailoring for DUCET via Unicode::Collate
String as Sequence of UAX #29 Grapheme Clusters
Line Folding for Plain Text
Subtraction and Intersection of Character Sets in Unicode Regular Expressions
Preparation of Internationalized Strings (RFC 3454)
Turn on Unicode—all of it
Dump strings of characters or bytes for printing and debugging
Fix double-encoded UTF-8 bytes to the correct one
Command-line utility to grep UnicodeData.txt
Tom’s Unicode Scripts So Life is Easier (only installed with Perl 5.14+)
This task requires Perl 5.8.0, which bundles the following important modules.
Access to Unicode character names and named character sequences; also define character names
Perl pragma to enable/disable UTF-8 (or UTF-EBCDIC) in source code
Perl pragma to set default PerlIO layers for input and output
Character encodings in Perl
Unicode Collation Algorithm
Unicode Normalization Forms
Unicode character database
Nick Patch <firstname.lastname@example.org>
© 2012 Nick Patch
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.