This routine applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. It is derived from the C program "stemmer.c" as found in freewais and elsewhere, which contains these notes: Purpose: Implementation of the Porter stem...
ULPFR/perlindex-1.606 - 06 Jan 2013 21:39:59 GMTA pangram is a text that contains every letter of the alphabet. This module provides utilities for identifying pangrams. $pangram->new Constructor. Receives the text that will be analyzed. $pangram->is_pangram Returns true if the supplied text is a p...
JESTER/Text-Pangram-0.01 - 16 Jul 2014 20:59:24 GMTText::DAWG implements implements string set recognition by way of directed acyclic word graphs....
BLGL/Text-DAWG-0.001 - 20 Feb 2011 17:32:11 GMTThis started as a spell and quality check for my novel, "Manuel the Magnificent Mechanical Man". Eventually, it can be used for checking any kind of markdown-formatted text, be it fiction or non-fiction. The first version included as documentation, t...
JMERELO/Test-Text-0.5.0 - 10 Sep 2017 18:11:26 GMTParallel to the idea of JSP and PHP, PORE was designed to separate data from presentation. The data is encapsulated in Perl objects, like Java Beans. The presentation is represented by PORE::Template, like JSP files. PORE consists of a templating lan...
ZTANG/Text-PORE-1.02 - 23 Feb 2004 00:21:00 GMTn-Gram analysis is a field in textual analysis which uses sliding window character sequences in order to aid topic analysis, language determination and so on. The n-gram spectrum of a document can be used to compare and filter documents in multiple l...
AMBS/Text-Ngram-0.15 - 17 Jul 2014 15:52:11 GMTThis modules provides a number of name normalization routines, plus high-level parsing and name comparison utilities such as those illustrated in the synopsis. While it tries to accommodate non-Western names, this module definitely works better with ...
DBOURGET/Text-Names-0.46 - 06 Jan 2017 19:16:44 GMTKAPPA/Text-Hyphen-0.2 - 06 Oct 2015 17:38:03 GMT
Collins English Dictionary defines a "widow" in typesetting as: A short line at the end of a paragraph, especially one that occurs as the top line of a page or column. For example, in the text... How much wood could a woodchuck chuck if a woodchuck c...
DCARDWELL/Text-Widont-0.01 - 01 Apr 2007 14:33:42 GMTThis module implement text n-gram analysis, supporting several types of analysis, including character and word n-grams. The module Text::Ngrams is very flexible. For example, it allows a user to manually feed a sequence of any tokens. It handles seve...
VLADO/Text-Ngrams-2.006 - 12 Feb 2017 15:18:35 GMTThis module provides a Perl interface to the GNU Aspell library. This module is to meet the need of looking up many words, one at a time, in a single session, such as spell-checking a document in memory. The GNU C interface is described at: http://as...
HANK/Text-Aspell-0.09 - 21 Sep 2007 00:30:49 GMTThe "Text::BibTeX" module serves mainly as a high-level introduction to the "Text::BibTeX" library, for both code and documentation purposes. The code loads the two fundamental modules for processing BibTeX files ("Text::BibTeX::File" and "Text::BibT...
AMBS/Text-BibTeX-0.88 - 29 Apr 2019 15:03:09 GMTText::CSV_XS provides facilities for the composition and decomposition of comma-separated values. An instance of the Text::CSV_XS class will combine fields into a "CSV" string and parse a "CSV" string into fields. The module accepts either strings or...
HMBRAND/Text-CSV_XS-1.40 - 15 Sep 2019 12:50:52 GMT"Text::Summarize" contains routines for ranking the sentences in English text for inclusion in a summary using the sumBasic algorithm....
KUBINA/Text-Summarize-0.50 - 12 Mar 2012 17:24:35 GMTText::Toalpha converts arbitary characters into letters. The interface is the functions toalpha($var) and fromalpha($alpha). They do what there names suggest. NOTE: This module does not use a code format used anywhere else. NOTE 2: The code for this ...
SAMUELL/Text-Toalpha-0.02 - 06 May 2005 02:47:05 GMTThe module provides a way to extract shingles from a piece of text. Shingles can then be used for other operations such as clustering, deduplication, etc. Given a document, the w-shingles represent a set of sorted groups of *w* adjacent words in the ...
NIDS/Text-Shingle-0.03 - 07 Jun 2018 10:24:33 GMTSoundex is a phonetic algorithm for indexing names by sound, as pronounced in English. The goal is for names with the same pronunciation to be encoded to the same representation so that they can be matched despite minor differences in spelling. Sound...
RJBS/Text-Soundex-3.05 - 10 Nov 2015 16:52:53 GMT