Changes for version 0.67

  • bolster Tokenizer sentence detection, adding list of abbreviations from Linga::EN::Tagger.
  • fix missing 'lang' param for SpellCheck
  • fix placement of dSP macro in tokenize() C func to properly scope stack variables.
  • add slurp() method to Search::Tools

Modules

high-performance tools for building search applications
locate the best matches in a snippet extract
highlight terms in text
(**DEPRECATED**) extract keywords from a search query
Class::Accessor::Fast-compatible accessors
base class for Search::Tools objects
objectified string for highlighting, snipping, etc.
convert string queries into objects
regular expressions for terms
(**DEPRECATED**) build regular expressions from search queries
(**DEPRECATED**) access regular expressions for a keyword
(**DEPRECATED**) access regular expressions for keywords
extract terms in context
offer spelling suggestions
a token object returned from a TokenList
a bunch of tokens from a Tokenizer
a bunch of tokens from a Tokenizer
mixin methods for TokenList and TokenListPP
a token object returned from a TokenList
split a string into meaningful tokens
transliterations of UTF-8 chars
UTF8 string wrangling
methods for playing nice with XML and HTML