The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Unicode::Normalize - normalized forms of Unicode text

SYNOPSIS

  use Unicode::Normalize;

  $string_NFD  = NFD($raw_string);  # Normalization Form D
  $string_NFC  = NFC($raw_string);  # Normalization Form C
  $string_NFKD = NFKD($raw_string); # Normalization Form KD
  $string_NFKC = NFKC($raw_string); # Normalization Form KC

   or

  use Unicode::Normalize 'normalize';

  $string_NFD  = normalize('D',  $raw_string);  # Normalization Form D
  $string_NFC  = normalize('C',  $raw_string);  # Normalization Form C
  $string_NFKD = normalize('KD', $raw_string);  # Normalization Form KD
  $string_NFKC = normalize('KC', $raw_string);  # Normalization Form KC

DESCRIPTION

$string_NFD = NFD($raw_string)

returns the normalization Form D (formed by canonical decomposition).

$string_NFC = NFC($raw_string)

returns the normalization Form C (formed by canonical decomposition, followed by canonical composition).

$string_NFKD = NFKD($raw_string)

returns the normalization Form KD (formed by compatibility decomposition).

$string_NFKC = NFKC($raw_string)

returns the normalization Form KC (formed by compatibility decomposition, followed by canonical composition).

$normalized_string = normalize($form_name, $raw_string)

As $form_name, one of the following names must be given.

  'C'  or 'NFC'  for Normalization Form C
  'D'  or 'NFD'  for Normalization Form D
  'KC' or 'NFKC' for Normalization Form KC
  'KD' or 'NFKD' for Normalization Form KD

EXPORT

NFC, NFD, NFKC, NFKD: by default.

normalize: on request.

AUTHOR

SADAHIRO Tomoyuki, <SADAHIRO@cpan.org>

  http://homepage1.nifty.com/nomenclator/perl/

  Copyright(C) 2001, SADAHIRO Tomoyuki. Japan. All rights reserved.

  This program is free software; you can redistribute it and/or 
  modify it under the same terms as Perl itself.

SEE ALSO

Lingua::KO::Hangul::Util

utility functions for Hangul Syllables

http://www.unicode.org/unicode/reports/tr15/

Unicode Normalization Forms - UAX #15