Author image 二十日☆鼠 - IKEDA Soji
and 1 contributors

NAME

Encode::ISO2022::CCS - coded character sets for ISO/IEC 2022 character encoding scheme

DESCRIPTION

This document lists encodings which may be used as coded character set (CCS) for ISO/IEC 2022 character encoding scheme.

From view of Perl, a CCS is the Encode::Encoding object performing stateless and fixed-length conversion over 94^n or 96^n code table (By now, Encode::ISO2022 does not support variable length multibyte CCS).

In tables below, "reg#" column indicates the number of registry in ISO International Registry of Coded Character Sets [ISO-IR].

Built-in encodings

  reg# Name                  Description
  ----------------------------------------------------------------
    6  ascii                 ANSI X3.4-1986, equivalent to ISO 646 IRV
   -   ascii-ctrl            ISO 6429 control characters (C0)
   -   c1-ctrl               ISO 6429 control characters (C1)
  ----------------------------------------------------------------

Encode::ISOIRSingle

  reg# Name                  Description
  ----------------------------------------------------------------
   13  jis-x-0201-right      JIS X 0201 katakana set
   14  iso-646-jp            JIS X 0201 Latin set, ISO/IEC 646 JP
  100  iso-8859-1-right      ISO/IEC 8859 no. 1, Latin 1
  101  iso-8859-2-right      ISO/IEC 8859 no. 2, Latin 2
  109  iso-8859-3-right      ISO/IEC 8859 no. 3, Latin 3
  110  iso-8859-4-right      ISO/IEC 8859 no. 4, Latin 4
  144  iso-8859-5-right      ISO/IEC 8859 no. 5, Cyrilic set
  127  iso-8859-6-right      ISO/IEC 8859 no. 6, Arabic set
  126  iso-8859-7-right      ISO/IEC 8859 no. 7, Greek set
  138  iso-8859-8-right      ISO/IEC 8859 no. 8, Hebrew set
  148  iso-8859-9-right      ISO/IEC 8859 no. 9, Latin 5
  157  iso-8859-10-right     ISO/IEC 8859 no. 10, Latin 6
  166  iso-8859-11-right     ISO/IEC 8859 no. 11, Thai set
  179  iso-8859-13-right     ISO/IEC 8859 no. 13, Latin 7
  199  iso-8859-14-right     ISO/IEC 8859 no. 14, Latin 8
  203  iso-8859-15-right     ISO/IEC 8859 no. 15, Latin 9
  226  iso-8859-16-right     ISO/IEC 8859 no. 16, Latin 10
  ----------------------------------------------------------------

Encode::JISLegacy

  reg# Name                  Description
  ----------------------------------------------------------------
   42  jis-x-0208-1978       JIS C 6226-1978, 1st rev. of JIS X 0208
 (159) jis-x-0212-ascii      JIS X 0212-1990 cooperable with ASCII
 (159) jis-x-0212-cp932      JIS X 0212-1990 cooperable with CP932
  ----------------------------------------------------------------

Encodings provided by external modules

Encode::CN

  reg# Name                  Description
  ----------------------------------------------------------------
   58  gb2312-raw            GB2312-80
  n/a  gb12345-raw           GB12345-90
  165  iso-ir-165            CCITT Chinese set
  ----------------------------------------------------------------

Encode::HanExtra

  reg# Name                  Description
  ----------------------------------------------------------------
  n/a  cccii                 CCCII
  171  cns11643-1            CNS 11643 plane 1
  172  cns11643-2            CNS 11643 plane 2
  183  cns11643-3            CNS 11643 plane 3
  184  cns11643-4            CNS 11643 plane 4
  185  cns11643-5            CNS 11643 plane 5
  186  cns11643-6            CNS 11643 plane 6
  187  cns11643-7            CNS 11643 plane 7
  n/a  cns11643-f            CNS 11643 plane 15 or 8
  ----------------------------------------------------------------

Encode::JISX0213::CCS

  reg# Name                  Description
  ----------------------------------------------------------------
   87  jis-x-0208            JIS X 0208-1983, 2nd rev. of JIS X 0208
  168      ditto             JIS X 0208-1990, 3rd rev. of JIS X 0208
       jis-x-0208-ascii
       jis-x-0208-jis
  228  jis-x-0213-2000-plane1  JIS X 0213:2000 level 3 (plane 1)
       jis-x-0213-2000-plane1-ascii
       jis-x-0213-2000-plane1-jis
  233  jis-x-0213-plane1     JIS X 0213:2004 level 3 (plane 1)
       jis-x-0213-plane1-ascii
       jis-x-0213-plane1-jis
  229  jis-x-0213-plane2     JIS X 0213:2000/2004 level 4 (plane 2)
  ----------------------------------------------------------------
  • About "-ascii" and "-jis" variants see documentation of module.

Encode::JP

  reg# Name                  Description
  ----------------------------------------------------------------
   87  jis0208-raw           JIS X 0208-1983, 2nd rev. of JIS X 0208
  168      ditto             JIS X 0208-1990, 3rd rev. of JIS X 0208
  159  jis0212-raw           JIS X 0212-1990
  ----------------------------------------------------------------
  • About compatible encodings taking care of characters with combining marks, see Encode::JISX0213::CCS.

  • Note that GL mapping of jis0201-raw provided by this module is broken. Use iso-646-jp provided by Encode::ISOIRSingle.

Encode::KR

  reg# Name                  Description
  ----------------------------------------------------------------
  149  ksc5601-raw           KS C5601-1987, 1st rev. of KS X 1001
  ----------------------------------------------------------------

SEE ALSO

See modules described above for specific CCS.

[ISO-IR] International Register of Coded Character Sets to be Used with Escape Sequences, http://www.itscj.ipsj.or.jp/ISO-IR/.

Encode, Encode::ISO2022.

AUTHOR

Hatuka*nezumi - IKEDA Soji, <nezumi@cpan.org>

COPYRIGHT AND LICENSE

Copyright (C) 2013 by Hatuka*nezumi - IKEDA Soji

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.