Lingua::EN::Summarize - A simple tool for summarizing bodies of English text. River stage zero No dependents

This is a simple module which makes an unscientific effort at summarizing English text. It recognizes simple patterns which look like statements, abridges them, and concatenates them into something vaguely resembling a summary. It needs more work on ...

FIMM/Lingua-EN-Summarize-0.2 - 20 Feb 2001 09:49:48 UTC - Search in distribution

Lingua::DE::Sentence - Perl extension for tokenizing german texts into their sentences. River stage zero No dependents

The "Lingua::DE::Sentence" module contains the function get_sentences, which splits text into its constituent sentences. The result can be either the list of sentences in the text or the list of sentences plus and a list of their absolute positions i...

HOLSTEN/Lingua-DE-Sentence-0.07 - 25 Apr 2003 07:46:43 UTC - Search in distribution

Lingua::LinkParser - Perl module implementing the Link Grammar Parser by Sleator, Temperley and Lafferty at CMU. River stage one • 1 direct dependent • 1 total dependent

To quote the Link Grammar documentation, "the Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of set o...

DBRIAN/Lingua-LinkParser-1.17 - 25 Mar 2014 22:27:08 UTC - Search in distribution

Lingua::HE::Sentence - Module for splitting Hebrew text into sentences. River stage zero No dependents

The "Lingua::HE::Sentence" module contains the function get_sentences, which splits Hebrew text into its constituent sentences, based on regular expressions. The module assumes text encoded in UTF-8. Supporting other input formats will be added upon ...

SHLOMOY/Lingua-HE-Sentence-0.13 - 25 Jan 2005 17:59:42 UTC - Search in distribution

Lingua::EN::SENNA - Perl wrapper for the SENNA NLP toolkit River stage zero No dependents

This package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...

DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 UTC - Search in distribution

Renard::Block::NLP - Natural language processing for English River stage one • 1 direct dependent • 1 total dependent

ZMUGHAL/Renard-Block-NLP-0.001 - 17 Oct 2020 08:08:55 UTC - Search in distribution

Lingua::EO::Orthography - A orthography/substitute converter for Esperanto characters River stage zero No dependents

6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...

MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 UTC - Search in distribution

Text::Corpus::Inspec::Document - Parse Inspec abstract for research. River stage zero No dependents

"Text::Corpus::Inspec::Document" provides methods for accessing specific portions of Inspec abstracts for researching and testing of information processing methods....

KUBINA/Text-Corpus-Inspec-1.00 - 09 Dec 2009 03:41:43 UTC - Search in distribution

Lingua::EN::CMUDict - Perl extension for utilizing the CMU dictionary file River stage zero No dependents

This version of the CMU Pronouncing dictionary was generated from the original dictionary and designed to syllabify it. The paper *On the Syllabification of Phonemes* by Susan Bartlett, Grzegorz Kondrak and Colin Cherry (NAACL-HLT 2009) covers the me...

LMETCALF/Lingua-EN-CMUDict-0.06 - 01 May 2019 00:00:51 UTC - Search in distribution

Lingua::EN::Opinion - Measure the emotional sentiment of text River stage zero No dependents

A "Lingua::EN::Opinion" object measures the emotional sentiment of text and saves the results in the scores and nrc_scores attributes. When run against the positive and negative classified training reviews in the dataset referenced under "SEE ALSO", ...

GENE/Lingua-EN-Opinion-0.1701 - 12 Mar 2021 14:48:26 UTC - Search in distribution

Text::Corpus::VoiceOfAmerica::Document - Parse a VOA article for research. River stage zero No dependents

"Text::Corpus::VoiceOfAmerica::Document" provides methods for accessing the content of VOA news articles for the researching and testing of information processing techniques. Read the Voice of America's Terms of Use statement to ensure you abide by i...

KUBINA/Text-Corpus-VoiceOfAmerica-1.03 - 24 Aug 2010 14:15:50 UTC - Search in distribution

Text::Categorize::Textrank::En - Find potential keywords in English text. River stage one • 1 direct dependent • 1 total dependent

"Text::Categorize::Textrank::En" provides methods for ranking the words in English text as potential keywords. It implements a version of the textrank algorithm from the report *TextRank: Bringing Order into Texts* by R. Mihalcea and P. Tarau. Encodi...

KUBINA/Text-Categorize-Textrank-0.51 - 12 Mar 2012 17:12:15 UTC - Search in distribution

Lingua::Translate::Babelfish - Translation back-end for Altavista's Babelfish, version 0.01 River stage one • 7 direct dependents • 8 total dependents

Lingua::Translate::Babelfish is a translation back-end for Lingua::Translate that contacts babelfish.altavisa.com to do the real work. It is normally invoked by Lingua::Translate; there should be no need to call it directly. If you do call it directl...

SAMV/Lingua-Translate-0.09 - 23 May 2008 09:02:47 UTC - Search in distribution

Lingua::EN::GeniaTagger - There's no fear with this elegant site scraper River stage zero No dependents

XERN/Lingua-EN-GeniaTagger-0.01 - 22 Feb 2006 16:30:36 UTC - Search in distribution

Lingua::EN::WSD::CorpusBased::Corpus River stage zero No dependents

This module represents a corpus. Basically, it allows to extract the number of occurrences of a given word or a given word combination in a "fast" way. "fast" hereby means faster than just iterating over the lines and matching patterns. The basic acc...

REITER/Lingua-EN-WSD-CorpusBased-0.11 - 03 Oct 2006 17:24:00 UTC - Search in distribution

Lingua::EN::Tokenizer::Offsets - Finds word (token) boundaries, and returns their offsets. River stage zero No dependents

ANDREFS/Lingua-EN-Tokenizer-Offsets-0.03 - 17 Nov 2012 01:45:08 UTC - Search in distribution

Uplug::PreProcess::SentDetect - Moses/Europarl sentence boundary detector River stage two • 10 direct dependents • 10 total dependents

This module is basically a copy of Lingua::Sentence by Achim Ruopp adapted to Uplug which is based on tools developed for Moses and the Europarl corpus. All credits go to the original authors. This version includes some additional non-breaking prefix...

TIEDEMANN/uplug-main-0.3.8 - 16 Mar 2013 20:19:32 UTC - Search in distribution

POE::Component::Lingua::Translate - A non-blocking wrapper around Lingua::Translate River stage one • 1 direct dependent • 1 total dependent

POE::Component::Lingua::Translate is a POE component that provides a non-blocking wrapper around Lingua::Translate. It accepts "translate" events and emits "translated" events back....

HINRIK/POE-Component-Lingua-Translate-0.06 - 01 Sep 2010 16:52:30 UTC - Search in distribution
38 results (0.056 seconds)