Ted Pedersen
and 1 contributors

Documentation

  • CHANGES
  • FAQ
  • INSTALL - Installation instructions for Text-NSP
  • README
  • TODO
  • USAGE
  • combig.pl - Combine frequency counts to determine co-occurrence
  • count.pl - Count the frequency of Ngrams in text
  • count2huge.pl - Convert the output of count.pl to huge-count.pl.
  • find-compounds.pl - find compound words in a text that are specified in a list.
  • huge-combine.pl - Combine two bigram files created by count.pl into single file
  • huge-combine3.pl - Combine two trigram files created by count.pl into single file
  • huge-count.pl - Divide huge text into pieces and run count.pl separately on each (and then combine)
  • huge-count.pl - Count all the bigrams in a huge text without using huge amounts of memory.
  • huge-count3.pl - Divide huge text into pieces and run huge-count3.pl for 3grams separately on each (and then combine)
  • huge-delete.pl - Delete bigrams found by huge-count.pl based on low/high frequency.
  • huge-merge.pl - Merge the results of multiple huge-sort generated files into a single sorted file.
  • huge-sort.pl - Sort a --tokenlist of bigrams from huge-count.pl in alphabetical order.
  • huge-split.pl - Split bigram files from huge-count.pl into pieces.
  • kocos.pl - Find the Kth order co-occurrences of a word
  • rank.pl - Calculate Spearman's Correlation on two ranked lists output by count.pl or statistic.pl
  • sort-bigrams.pl - Sort output from count.pl or statistic.pl in descending order based on frequency or association score
  • sort-trigrams.pl - Sort output from count.pl or statistic.pl in descending order based on frequency or association score
  • split-data.pl - Divide a text file in N approximately equal parts
  • statistic.pl - Measure the association of Ngrams in text

Modules