Text::Ngram - Ngram analysis of text River stage two • 4 direct dependents • 10 total dependents

n-Gram analysis is a field in textual analysis which uses sliding window character sequences in order to aid topic analysis, language determination and so on. The n-gram spectrum of a document can be used to compare and filter documents in multiple l...

AMBS/Text-Ngram-0.15 - 17 Jul 2014 15:52:11 UTC

Text::TFIDF::Ngram - Compute the TF-IDF measure for ngram phrases River stage zero No dependents

This module computes the TF-IDF ("term frequency - inverse document frequency") measure for a corpus of text documents. This module will only work when given more than one document. Because the idf method is computed based on all documents, a single ...

GENE/Text-TFIDF-Ngram-0.0505 - 15 Apr 2020 20:58:43 UTC

Text::Positional::Ngram River stage zero No dependents

1. Introduction The Text::Positional::Ngram module is a module that allows for the retrieval of variable length ngrams. An ngram is defined as a sequence of 'n' tokens that occur within a window of at leaste 'n' tokens in the text. What constitutes a...

BTMCINNES/Text-Positional-Ngram-0.5 - 28 Aug 2007 19:33:28 UTC

Text::Mining::Algorithm::Ngram - Perl Tools for Text Mining River stage zero No dependents

ROGERHALL/Text-Mining-0.08 - 15 Mar 2009 17:06:03 UTC

Text::Ngrams - Flexible Ngram analysis (for characters, words, and more) River stage zero No dependents

This module implement text n-gram analysis, supporting several types of analysis, including character and word n-grams. The module Text::Ngrams is very flexible. For example, it allows a user to manually feed a sequence of any tokens. It handles seve...

VLADO/Text-Ngrams-2.006 - 12 Feb 2017 15:18:35 UTC

Text::NSP - Extract collocations and Ngrams from text River stage one • 3 direct dependents • 5 total dependents

The Ngram Statistics Package (NSP) is a collection of perl modules that aid in analyzing Ngrams in text files. We define an Ngram as a sequence of 'n' tokens that occur within a window of at least 'n' tokens in the text; what constitutes a "token" ca...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NGrammer - Pure Perl extraction of n-grams and skip-grams River stage one • 1 direct dependent • 1 total dependent

The module provides a way to extract both n-grams and skip-grams from a text, a sentence or fro man array of tokens. A n-gram is defines as an ordered sequence of tokens in a piece or text. Some frequent n-grams such as 2-grams, are also called bigra...

NIDS/Text-NGrammer-0.03 - 07 Jun 2018 10:24:22 UTC

Text::Ngramize - Computes lists of n-grams from text. River stage zero No dependents

"Text::Ngramize" is used to compute the list of n-grams derived from the bytes, characters, or words of the text provided. Methods are included that provide positional information about the n-grams computed within the text....

KUBINA/Text-Ngramize-1.03 - 07 Mar 2010 04:04:11 UTC

Text::NSP::Measures - Perl modules for computing association scores of Ngrams. This module provides the basic framework for these measures. River stage one • 3 direct dependents • 5 total dependents

Introduction These modules provide perl implementations of mathematical functions (association measures) that can be used to interpret the co-occurrence frequency data for Ngrams. We define an Ngram as a sequence of 'n' tokens that occur within a win...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::SenseClusters - Cluster similar contexts using co-occurrence matrices and Latent Semantic Analysis River stage zero No dependents

TPEDERSE/Text-SenseClusters-1.05 - 03 Oct 2015 14:33:01 UTC

Text::NSP::Measures::4D - Perl module that provides basic framework for building measure of association for 4-grams. River stage one • 3 direct dependents • 5 total dependents

The methods in this module retrieve observed 4-gram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the 4-gram (4d)...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D - Perl module that provides basic framework for building measure of association for bigrams. River stage one • 3 direct dependents • 5 total dependents

This module is to be used as a foundation for building 2-dimensional measures of association. The methods in this module retrieve observed bigram frequency counts, marginal totals, and also compute expected values. They also provide error checks for ...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D - Perl module that provides basic framework for building measure of association for trigrams. River stage one • 3 direct dependents • 5 total dependents

The methods in this module retrieve observed trigram frequency counts and marginal totals, and also compute expected values. They also provide support for error checking of the output produced by count.pl. These methods are used in all the trigram (3...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::4D::MI - Perl module that provides error checks and framework to implement Loglikelihood for 4-grams. River stage one • 3 direct dependents • 5 total dependents

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI - Perl module that provides error checks and framework to implement Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson Stirling Measure for trigrams. River stage one • 3 direct dependents • 5 total dependents

This module is the base class for the Loglikelihood and the True Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computations that are common to these m...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::MI - Perl module that provides error checks for Loglikelihood, Total Mutual Information, Pointwise Mutual Information and Poisson-Stirling Measure. River stage one • 3 direct dependents • 5 total dependents

This module is the base class for the Loglikelihood, Total Mutual Information and the Pointwise Mutual Information measures. All these measure are similar. This module provides error checks specific for these measures, it also implements the computat...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::CHI - Perl module that provides error checks for the Pearson's chi squared, phi coefficient and the Tscore measures. River stage one • 3 direct dependents • 5 total dependents

This module is the base class for the Chi-squared and Phi coefficient measures. This module provides error checks specific for these measures, it also implements the computations that are common to these measures. Pearson's Chi-Squared x2 = 2 * [((n1...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI::ll - Perl module that implements Loglikelihood measure of association for trigrams. River stage one • 3 direct dependents • 5 total dependents

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::2D::MI::ps - Perl module that implements Poisson-Stirling measure of association for bigrams. River stage one • 3 direct dependents • 5 total dependents

The log-likelihood ratio measures the deviation between the observed data and what would be expected if <word1> and <word2> were independent. The higher the score, the less evidence there is in favor of concluding that the words are independent. Assu...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

Text::NSP::Measures::3D::MI::ps - Perl module that implements Poisson Stirling Measure for trigrams. River stage one • 3 direct dependents • 5 total dependents

The log-likelihood ratio measures the devitation between the observed data and what would be expected if <word1>, <word2> and <word3> were independent. The higher the score, the less evidence there is in favor of concluding that the words are indepen...

TPEDERSE/Text-NSP-1.31 - 04 Oct 2015 16:42:20 UTC

41 results (0.025 seconds)