The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

uni - command-line utility to find or display Unicode characters

VERSION

version 9.005

DESCRIPTION

  $ uni ☺
  263A ☺ WHITE SMILING FACE

  # Only on Perl 5.14+
  $ uni wry
  1F63C <U+1F63C> CAT FACE WITH WRY SMILE

uni has several modes of operation:

DWIM Mode

By default, uni will interpret your arguments usefully. If the only argument is a single character, it will be looked up. Otherwise, it will work in name search mode, with the exception that search terms comprised entirely of hex digits are allowed to match against the codepoint's numeric value.

Single Character Mode

  $ uni -s SINGLE-CHAR

This will print out the name and codepoint of the character.

  $ uni -s ¿
  ¿ - U+000BF - INVERTED QUESTION MARK

Name Search Mode

  $ uni -n SOME /SEARCH/ TERMS

This one will look for codepoints where each term appears as a (\b-bounded) word in the name. If the term is bounded by slashes, it's treated as a regular expression and is used to filter candidate codepoints by name.

  $ uni -n roman five
  Ⅴ - U+02164 - ROMAN NUMERAL FIVE
  Ⅾ - U+0216E - ROMAN NUMERAL FIVE HUNDRED
  ⅴ - U+02174 - SMALL ROMAN NUMERAL FIVE
  ⅾ - U+0217E - SMALL ROMAN NUMERAL FIVE HUNDRED
  ↁ - U+02181 - ROMAN NUMERAL FIVE THOUSAND

String Decomposition

  $ uni -c SOME STRINGS

This prints out the codepoints in each string, with a blank line between each argument's codepoints.

  $ uni -c Hey リコ
  H - U+00048 - LATIN CAPITAL LETTER H
  e - U+00065 - LATIN SMALL LETTER E
  y - U+00079 - LATIN SMALL LETTER Y

  リ- U+030EA - KATAKANA LETTER RI
  コ- U+030B3 - KATAKANA LETTER KO

Lookup By Codepoint

  $ uni -u NUMBERS IN HEX

This prints out the codepoint for each given hex value.

  $ uni -u FF 1FF 10FF
  ÿ - U+000FF - LATIN SMALL LETTER Y WITH DIAERESIS
  ǿ - U+001FF - LATIN SMALL LETTER O WITH STROKE AND ACUTE
  ჿ - U+010FF - GEORGIAN LETTER LABIAL SIGN

PERL VERSION SUPPORT

This module has the same support period as perl itself: it supports the two most recent versions of perl. (That is, if the most recently released version is v5.40, then this module should work on both v5.40 and v5.38.)

Although it may work on older versions of perl, no guarantee is made that the minimum required version will not be increased. The version may be increased for any reason, and there is no promise that patches will be accepted to lower the minimum required perl.

NOTES

If you'd like to search for Emojis in Unicode 6.0, please upgrade to Perl 5.14!

ACKNOWLEDGEMENTS

This is a re-implementation of a program written by Audrey Tang in Taiwan. I used that program for years before deciding I wanted to add a few features, which I did by rewriting from scratch.

That program, in turn, was a re-implementation of a same-named program Larry copied to me, which accompanied Audrey for years. However, that program was lost during a hard disk failure, so she coded it up from memory.

Thank-you, Larry, for everything. ♡

AUTHOR

Ricardo Signes <rjbs@semiotic.systems>

COPYRIGHT AND LICENSE

Ricardo Signes has dedicated the work to the Commons by waiving all of his or her rights to the work worldwide under copyright law and all related or neighboring legal rights he or she had in the work, to the extent allowable by law.

Works under CC0 do not require attribution. When citing the work, you should not imply endorsement by the author.