**Text::Ngram - Ngram analysis of text**

n-Gram analysis is a field in textual analysis which uses sliding window character sequences in order to aid topic analysis, language determination and so on. The n-gram spectrum of a document can be used to compare and filter documents in multiple l...

AMBS/Text-Ngram-0.15 - 17 Jul 2014 15:52:11 UTC**Text::TFIDF::Ngram - Compute the TF-IDF measure for ngram phrases**

This module computes the TF-IDF ("term frequency - inverse document frequency") measure for a corpus of text documents. This module will only work when given more than one document. Because the idf method is computed based on all documents, a single ...

GENE/Text-TFIDF-Ngram-0.0505 - 15 Apr 2020 20:58:43 UTC**Text::Positional::Ngram**

1. Introduction The Text::Positional::Ngram module is a module that allows for the retrieval of variable length ngrams. An ngram is defined as a sequence of 'n' tokens that occur within a window of at leaste 'n' tokens in the text. What constitutes a...

BTMCINNES/Text-Positional-Ngram-0.5 - 28 Aug 2007 19:33:28 UTC**Text::Mining::Algorithm::Ngram - Perl Tools for Text Mining**

ROGERHALL/Text-Mining-0.08 - 15 Mar 2009 17:06:03 UTC

**Text::Ngrams - Flexible Ngram analysis (for characters, words, and more)**

This module implement text n-gram analysis, supporting several types of analysis, including character and word n-grams. The module Text::Ngrams is very flexible. For example, it allows a user to manually feed a sequence of any tokens. It handles seve...

VLADO/Text-Ngrams-2.006 - 12 Feb 2017 15:18:35 UTC**Text::NSP - Extract collocations and Ngrams from text**

The Ngram Statistics Package (NSP) is a collection of perl modules that aid in analyzing Ngrams in text files. We define an Ngram as a sequence of 'n' tokens that occur within a window of at least 'n' tokens in the text; what constitutes a "token" ca...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NGrammer - Pure Perl extraction of n-grams and skip-grams**

The module provides a way to extract both n-grams and skip-grams from a text, a sentence or fro man array of tokens. A n-gram is defines as an ordered sequence of tokens in a piece or text. Some frequent n-grams such as 2-grams, are also called bigra...

NIDS/Text-NGrammer-0.03 - 07 Jun 2018 10:24:22 UTC**Text::Ngramize - Computes lists of n-grams from text.**

"Text::Ngramize" is used to compute the list of n-grams derived from the bytes, characters, or words of the text provided. Methods are included that provide positional information about the n-grams computed within the text....

KUBINA/Text-Ngramize-1.03 - 07 Mar 2010 04:04:11 UTC**Text::NSP::Measures - Perl modules for computing association scores of Ngrams. This module provides the basic framework for these measures.**

Introduction These modules provide perl implementations of mathematical functions (association measures) that can be used to interpret the co-occurrence frequency data for Ngrams. We define an Ngram as a sequence of 'n' tokens that occur within a win...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::SenseClusters - Cluster similar contexts using co-occurrence matrices and Latent Semantic Analysis**

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

**Text::NSP::Measures::4D - Perl module that provides basic framework for building measure of association for 4-grams.**

The methods in this module retrieve observed 4-gram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the 4-gram (4d)...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::2D - Perl module that provides basic framework for building measure of association for bigrams.**

This module is to be used as a foundation for building 2-dimensional measures of association. The methods in this module retrieve observed bigram frequency counts, marginal totals, and also compute expected values. They also provide error checks for ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::3D - Perl module that provides basic framework for building measure of association for trigrams.**

The methods in this module retrieve observed trigram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the trigram (3...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::4D::MI - Perl module that provides error checks and framework to implement Loglikelihood for 4-grams.**

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::3D::MI - Perl module that provides error checks and framework to implement Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson Stirling Measure for trigrams.**

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::2D::MI - Perl module that provides error checks for Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson-Stirling Measure.**

This module is the base class for the Loglikelihood, Total Mutual Information and the Pointwise Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computat...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::2D::CHI - Perl module that provides error checks for the Pearson's chi squared, phi coefficient and the Tscore measures.**

This module is the base class for the Chi-squared and Phi coefficient measures. This module provides error checks specific for these measures, it also implements the computations that are common to these measures. Pearson's Chi-Squared x2 = 2 * [((n1...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::3D::MI::ll - Perl module that implements Loglikelihood measure of association for trigrams.**

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::2D::MI::ps - Perl module that implements Poisson-Stirling measure of association for bigrams.**

The log-likelihood ratio measures the deviation between the observed data and what would be expected if <word1> and <word2> were independent. The higher the score, the less evidence there is in favor of concluding that the words are independent. Assu...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC**Text::NSP::Measures::3D::MI::ps - Perl module that implements Poisson Stirling Measure for trigrams.**

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC