The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

KinoSearch::Analysis::Stemmer - Reduce related words to a shared root.

SYNOPSIS

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
    
    my $polyanalyzer = KinoSearch::Analysis::PolyAnalyzer->new(
        analyzers => [ $case_folder, $tokenizer, $stemmer ],
    );

This class is a wrapper around Lingua::Stem::Snowball, so it supports the same languages.

DESCRIPTION

Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'.

CONSTRUCTORS

new( [labeled params] )

    my $stemmer = KinoSearch::Analysis::Stemmer->new( language => 'es' );
  • language - A two-letter ISO code identifying a language supported by Snowball.

INHERITANCE

KinoSearch::Analysis::Stemmer isa KinoSearch::Analysis::Analyzer isa KinoSearch::Object::Obj.

COPYRIGHT AND LICENSE

Copyright 2005-2010 Marvin Humphrey

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.