Unicode::UTF8 - Encoding and decoding of UTF-8 encoding form.
use Unicode::UTF8 qw[decode_utf8 encode_utf8]; $string = decode_utf8($octets); $octets = encode_utf8($string);
This module provides functions to encode and decode UTF-8 encoding form as defined by Unicode and ISO/IEC 10646:2011.
Returns an decoded representation of $octets in UTF-8 encoding as a character string.
$octets
Throws an exception if $octets constains an ill-formed UTF-8 sequence or code points which can't be interchanged.
Returns an encoded representation of $string in UTF-8 encoding as an octet string.
$string
Throws an exception if $string contains code points which can't be interchanged or represented in UTF-8 encoding form.
None by default. All functions can be exported using the :all tag or individually.
:all
(F) Wide character in octets.
(F) Encountered an ill-formed octet sequence.
(F) Noncharacters is permanently reserved for internal use and that should never be interchanged. Noncharacters consist of the values U+nFFFE and U+nFFFF (where n is from 0 to 10^16) and the values U+FDD0..U+FDEF.
(F) Surrogate code points are designated only for surrogate code units in the UTF-16 character encoding form. Surrogates consist of code points in the range U+D800 to U+DFFF.
(F) Code points in the range U-00110000 to U-7FFFFFFF.
ISO/IEC 10646 originally defined codespace up to U-7FFFFFFF. This was restricted by JTC1/SC2/WG2 Resolution M38.6 (Restriction of encoding space) to U-0010FFFF in 2000.
(F) Code points in the range 2^31 to 2^64-1. Perl's extended codespace.
Please report any bugs or feature requests by email to bug-unicode-utf8 at rt.cpan.org, or through the web interface at http://rt.cpan.org/Public/Dist/Display.html?Name=Unicode-UTF8. You will be automatically notified of any progress on the request by the system.
bug-unicode-utf8 at rt.cpan.org
This is open source software. The code repository is available for public review and contribution under the terms of the license.
http://github.com/chansen/p5-unicode-utf8
git clone http://github.com/chansen/p5-unicode-utf8
Christian Hansen chansen@cpan.org
chansen@cpan.org
Copyright 2011 by Christian Hansen.
This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.
To install Unicode::UTF8, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Unicode::UTF8
CPAN shell
perl -MCPAN -e shell install Unicode::UTF8
For more information on module installation, please visit the detailed CPAN module installation guide.