17 results (1.07 seconds)
Lingua::Stem::Snowball - Perl interface to Snowball stemmers. 3 ++

Stemming reduces related words to a common root form -- for instance, "horse", "horses", and "horsing" all become "hors". Most commonly, stemming is deployed as part of a search application, allowing searches for a given term to match documents which...

CREAMYG/Lingua-Stem-Snowball-0.952 - 28 Aug 2008 08:17:59 GMT - Search in distribution

Lingua::Stem::Snowball::Da - Porters stemming algorithm for Denmark ++

The stem function takes a scalar as a parameter and stems the word according to Martin Porters Danish stemming algorithm, which can be found at the Snowball website: <http://snowball.tartarus.org/>. It also supports caching if you pass the use_cache ...

CINE/Lingua-Stem-Snowball-Da-1.01 - 05 Mar 2003 02:47:51 GMT - Search in distribution

Lingua::Stem::Snowball::No - Porters stemming algorithm for Norwegian ++

The stem function takes a scalar as a parameter and stems the word according to Martin Porters Norwegian stemming algorithm, which can be found at the Snowball website: <http://snowball.tartarus.org/>. It also supports caching if you pass the use_cac...

ASKSH/Snowball-Norwegian-1.2 - 08 May 2007 19:55:54 GMT - Search in distribution

Lingua::Stem::Snowball::Se - Porters stemming algorithm for Swedish ++

The stem function takes a scalar as a parameter and stems the word according to Martin Porters Swedish stemming algorithm, which can be found at the Snowball website: <http://snowball.tartarus.org/>. It also supports caching if you pass the use_cache...

ASKSH/Snowball-Swedish-1.2 - 08 May 2007 20:03:51 GMT - Search in distribution

Lingua::Stem::Snowball::Lt - Perl interface to Snowball stemmer for the Lithuanian language. ++

Stemming reduces related words to a common root form -- for instance, "horse", "horses", and "horsing" all become "hors". Most commonly, stemming is deployed as part of a search application, allowing searches for a given term to match documents which...

LVALIUKAS/Lingua-Stem-Snowball-Lt-0.03 - 27 Nov 2013 15:34:22 GMT - Search in distribution

Plucene::Plugin::Analyzer::SnowballAnalyzer - Stemmed analyzer with Lingua::Stem::Snowball and Lingua::StopWords ++

Filters StandardTokenizer with SnowballAnalyzer. Change $Plucene::Plugin::Analysis::SnowballAnalyzer::LANG to the language of your choice. (see Lingua::Stem::Snowball documentation for all available languages). EXAMPLE #!/usr/bin/perl use strict; use...

FABPOT/Plucene-Plugin-Analyzer-SnowballAnalyzer-1.1 - 01 May 2004 09:12:49 GMT - Search in distribution

Lingua::Stem - Stemming of words 2 ++

This routine applies stemming algorithms to its parameters, returning the stemmed words as appropriate to the selected locale. You can import some or all of the class methods. use Lingua::Stem qw (stem clear_stem_cache stem_caching add_exceptions del...

SNOWHARE/Lingua-Stem-0.84   (1 review) - 29 Apr 2010 21:19:59 GMT - Search in distribution

Lingua::Stem::Any - Unified interface to any stemmer on CPAN 2 ++

This module aims to provide a simple unified interface to any stemmer on CPAN. It will provide a default available source module when a language is requested but no source is requested. Attributes All attribute-setting methods can be chained. $stem =...

PATCH/Lingua-Stem-Any-0.03 - 05 Jun 2014 17:32:28 GMT - Search in distribution

Text::MultiPhone - Package to retrieve the phonetics of a word ++

This is yet another solution to the problem of phonetic similarities. In contrast to Soundex or Metaphone, vowels matter, and it is thus more useful for other (germanic?) languages. In languages, there are often cases where an automated phonetic anal...

HEIKOK/Text-MultiPhone-0.01 - 11 Apr 2005 06:37:37 GMT - Search in distribution

Lingua::Stem::UniNE::CS - Czech stemmer 1 ++

Light and aggressive stemmers for the Czech language. The light stemmer removes grammatical case endings from nouns and adjectives, possessive adjective endings from names, and takes care of palatalization. The aggressive stemmer also removes diminut...

PATCH/Lingua-Stem-UniNE-0.07 - 14 May 2014 20:38:04 GMT - Search in distribution

Lingua::StopWords - Stop words for several languages. 1 ++

In keyword search, it is common practice to suppress a collection of "stopwords": words such as "the", "and", "maybe", etc. which exist in in a large number of documents and do not tell you anything important about any document which contains them. T...

CREAMYG/Lingua-StopWords-0.09   (1 review) - 22 Aug 2008 15:34:58 GMT - Search in distribution

Text::StemTagPOS - Computes stemmed/POS tagged lists of text. ++

"Text::StemTagPOS" uses the modules Lingua::Stem::Snowball and Lingua::EN::Tagger to do part-of-speech tagging and stemming of English text. It was developed to pre-process text for other modules. Encoding of all text should be in Perl's internal for...

KUBINA/Text-StemTagPOS-0.61 - 31 Dec 2011 13:41:21 GMT - Search in distribution

Search::Tokenizer - Decompose a string into tokens (words) ++

This module builds an iterator function that will progressively extract terms from a given input string. Terms are defined by a regular expression (for example "\w+"). Term matching relies on the builtin "global match" operator of Perl (the 'g' flag)...

DAMI/Search-Tokenizer-1.01 - 15 Feb 2013 18:57:49 GMT - Search in distribution

KinoSearch1::Analysis::Stemmer - reduce related words to a shared root ++

Stemming reduces words to a root form. For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' and 'horsing'. This class is a wrapper around Lingua::Stem::Snowball...

CREAMYG/KinoSearch1-1.01   (2 reviews) - 28 Oct 2010 05:26:42 GMT - Search in distribution

KinoSearch::Analysis::Stemmer - Reduce related words to a shared root. ++

Stemmer is an Analyzer which reduces related words to a root form (using the "Snowball" stemming library). For instance, "horse", "horses", and "horsing" all become "hors" -- so that a search for 'horse' will also match documents containing 'horses' ...

CREAMYG/KinoSearch-0.315   (4 reviews) - 16 Apr 2012 21:20:13 GMT - Search in distribution

Text::Categorize::Textrank::En - Find potential keywords in English text. ++

"Text::Categorize::Textrank::En" provides methods for ranking the words in English text as potential keywords. It implements a version of the textrank algorithm from the report *TextRank: Bringing Order into Texts* by R. Mihalcea and P. Tarau. Encodi...

KUBINA/Text-Categorize-Textrank-0.51 - 12 Mar 2012 17:12:15 GMT - Search in distribution

Data::Classifier::NaiveBayes::Tokenizer ++

Data::Classifier::NaiveBayes METHODS SEE ALSO Moose, Lingua::Stem::Snowball AUTHOR Logan Bell, "<logie@cpan.org>" COPYRIGHT & LICENSE Copyright 2012, Logan Bell This program is free software; you can redistribute it and/or modify it under the same te...

LOGIE/Data-Classifier-NaiveBayes-0.001 - 05 Apr 2012 06:02:07 GMT - Search in distribution




Hosting generously
sponsored by Bytemark