Lingua::EN::Summarize - A simple tool for summarizing bodies of English text.
This is a simple module which makes an unscientific effort at summarizing English text. It recognizes simple patterns which look like statements, abridges them, and concatenates them into something vaguely resembling a summary. It needs more work on ...FIMM/Lingua-EN-Summarize-0.2 - 20 Feb 2001 09:49:48 UTC - Search in distribution
Lingua::DE::Sentence - Perl extension for tokenizing german texts into their sentences.
The "Lingua::DE::Sentence" module contains the function get_sentences, which splits text into its constituent sentences. The result can be either the list of sentences in the text or the list of sentences plus and a list of their absolute positions i...HOLSTEN/Lingua-DE-Sentence-0.07 - 25 Apr 2003 07:46:43 UTC - Search in distribution
Lingua::LinkParser - Perl module implementing the Link Grammar Parser by Sleator, Temperley and Lafferty at CMU.
To quote the Link Grammar documentation, "the Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of set o...DBRIAN/Lingua-LinkParser-1.17 - 25 Mar 2014 22:27:08 UTC - Search in distribution
Lingua::HE::Sentence - Module for splitting Hebrew text into sentences.
The "Lingua::HE::Sentence" module contains the function get_sentences, which splits Hebrew text into its constituent sentences, based on regular expressions. The module assumes text encoded in UTF-8. Supporting other input formats will be added upon ...SHLOMOY/Lingua-HE-Sentence-0.13 - 25 Jan 2005 17:59:42 UTC - Search in distribution
Lingua::EN::SENNA - Perl wrapper for the SENNA NLP toolkit
This package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 UTC - Search in distribution
Renard::Block::NLP - Natural language processing for English
17 Oct 2020 08:08:55 UTC
Search in distribution
Lingua::EO::Orthography - A orthography/substitute converter for Esperanto characters
6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 UTC - Search in distribution
Text::Corpus::Inspec::Document - Parse Inspec abstract for research.
"Text::Corpus::Inspec::Document" provides methods for accessing specific portions of Inspec abstracts for researching and testing of information processing methods....KUBINA/Text-Corpus-Inspec-1.00 - 09 Dec 2009 03:41:43 UTC - Search in distribution
Lingua::EN::CMUDict - Perl extension for utilizing the CMU dictionary file
This version of the CMU Pronouncing dictionary was generated from the original dictionary and designed to syllabify it. The paper *On the Syllabification of Phonemes* by Susan Bartlett, Grzegorz Kondrak and Colin Cherry (NAACL-HLT 2009) covers the me...LMETCALF/Lingua-EN-CMUDict-0.06 - 01 May 2019 00:00:51 UTC - Search in distribution
Lingua::EN::Opinion - Measure the emotional sentiment of text
A "Lingua::EN::Opinion" object measures the emotional sentiment of text and saves the results in the scores and nrc_scores attributes. When run against the positive and negative classified training reviews in the dataset referenced under "SEE ALSO", ...GENE/Lingua-EN-Opinion-0.1701 - 12 Mar 2021 14:48:26 UTC - Search in distribution
Text::Corpus::VoiceOfAmerica::Document - Parse a VOA article for research.
Text::Categorize::Textrank::En - Find potential keywords in English text.
"Text::Categorize::Textrank::En" provides methods for ranking the words in English text as potential keywords. It implements a version of the textrank algorithm from the report *TextRank: Bringing Order into Texts* by R. Mihalcea and P. Tarau. Encodi...KUBINA/Text-Categorize-Textrank-0.51 - 12 Mar 2012 17:12:15 UTC - Search in distribution
Lingua::Translate::Babelfish - Translation back-end for Altavista's Babelfish, version 0.01
Lingua::Translate::Babelfish is a translation back-end for Lingua::Translate that contacts babelfish.altavisa.com to do the real work. It is normally invoked by Lingua::Translate; there should be no need to call it directly. If you do call it directl...SAMV/Lingua-Translate-0.09 - 23 May 2008 09:02:47 UTC - Search in distribution
Lingua::EN::GeniaTagger - There's no fear with this elegant site scraper
22 Feb 2006 16:30:36 UTC
Search in distribution
This module represents a corpus. Basically, it allows to extract the number of occurrences of a given word or a given word combination in a "fast" way. "fast" hereby means faster than just iterating over the lines and matching patterns. The basic acc...REITER/Lingua-EN-WSD-CorpusBased-0.11 - 03 Oct 2006 17:24:00 UTC - Search in distribution
Lingua::EN::Tokenizer::Offsets - Finds word (token) boundaries, and returns their offsets.
17 Nov 2012 01:45:08 UTC
Search in distribution
Uplug::PreProcess::SentDetect - Moses/Europarl sentence boundary detector
This module is basically a copy of Lingua::Sentence by Achim Ruopp adapted to Uplug which is based on tools developed for Moses and the Europarl corpus. All credits go to the original authors. This version includes some additional non-breaking prefix...TIEDEMANN/uplug-main-0.3.8 - 16 Mar 2013 20:19:32 UTC - Search in distribution
POE::Component::Lingua::Translate - A non-blocking wrapper around Lingua::Translate
POE::Component::Lingua::Translate is a POE component that provides a non-blocking wrapper around Lingua::Translate. It accepts "translate" events and emits "translated" events back....HINRIK/POE-Component-Lingua-Translate-0.06 - 01 Sep 2010 16:52:30 UTC - Search in distribution