The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Text::TEI::Collate::Lang - base class for collation language-specific extensions

DESCRIPTION

Text::TEI::Collate::Lang is the base package for any language extension (e.g. Text::TEI::Collate::Lang::Latin) to be used by Text::TEI::Collate. The base package provides three subroutines as default; any implementation should re-implement one or more of these functions, and can use the ones defined here otherwise. This would be a base class to subclass if we ever had a reason to instantiate it.

SUBROUTINES

distance

This is a rudimentary, and hopefully pretty quick, word distance function. It counts the occurrence of each letter in a word, and returns the sum of lettercount differences between the two passed words.

canonizer

This is essentially just the lc() builtin function.

comparator

This is a function that replaces all characters with their base character after an NFKD (Normalization Form Compatibility Decomposition) operation.

AUTHOR

Tara L Andrews <aurum@cpan.org>

1 POD Error

The following errors were encountered while parsing the POD:

Around line 37:

Non-ASCII character seen before =encoding in ''αι̣τια̣ν̣','. Assuming CP1252