The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.83

  • UTF8::is_sane_utf8() now runs through entire string instead of stopping at first suspect sequence.
  • add Query->unique_terms, ->num_unique_terms, ->phrases, and ->non_phrases methods in aid to HeatMap, which needed a refactor to fix a bug affecting duplicate terms in phrases when stemming was on.

Modules

high-performance tools for building search applications
locate the best matches in a snippet extract
highlight terms in text
Class::Accessor::Fast-compatible accessors
base class for Search::Tools objects
objectified string for highlighting, snipping, etc.
convert string queries into objects
regular expressions for terms
extract terms in context
offer spelling suggestions
a token object returned from a TokenList
a bunch of tokens from a Tokenizer
a bunch of tokens from a Tokenizer
mixin methods for TokenList and TokenListPP
a token object returned from a TokenList
split a string into meaningful tokens
transliterations of UTF-8 chars
UTF8 string wrangling
methods for playing nice with XML and HTML