Marvin Humphrey
and 1 contributors

NAME

KinoSearch::Analysis::Stemmer - Reduce related words to a shared root.

DEPRECATED

The KinoSearch code base has been assimilated by the Apache Lucy project. The "KinoSearch" namespace has been deprecated, but development continues under our new name at our new home: http://lucy.apache.org/

SYNOPSIS

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
    
    my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new(
        analyzers => [ $case_folder, $tokenizer, $stemmer ],
    );

This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.

DESCRIPTION

Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.

CONSTRUCTORS

new( [labeled params] )

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
  • language - A two-letter ISO code identifying a language supported by Snowball.

INHERITANCE

KinoSearch::Analysis::Stemmer isa KinoSearch::Analysis::Analyzer isa KinoSearch::Object::Obj.

COPYRIGHT AND LICENSE

Copyright 2005-2011 Marvin Humphrey

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.