Lingua::EN::Sentence - split text into sentences River stage one • 7 direct dependents • 8 total dependents

The "Lingua::EN::Sentence" module contains the function get_sentences, which splits text into its constituent sentences, based on a regular expression and a list of abbreviations (built in and given). Certain well know exceptions, such as abbreviatio...

KIMRYAN/Lingua-EN-Sentence-0.31 - 19 Aug 2018 08:28:30 GMT

Lingua::EN::Semtags::Sentence - a DTO used by Lingua::EN::Semtags::Engine River stage zero No dependents

A DTO used by "Lingua::EN::Semtags::Engine". Aggregates instances of "Lingua::EN::Semtags::LangUnit"s. METHODS add_lunit($lunit) Adds $lunit to "$self->{lunits}". lunits() Returns "$self->{lunits}". phrase_tokens() Returns "$self->{phrase_tokens}". R...

IGORM/Lingua-EN-Semtags-Engine-0.01 - 25 Apr 2008 17:48:16 GMT

Lingua::EN::Sentence::Offsets - Finds sentence boundaries, and returns their offsets. River stage zero No dependents

ANDREFS/Lingua-EN-Sentence-Offsets-0.03 - 03 Mar 2014 11:40:09 GMT

Lingua::EN::Semtags::LangUnit - a DTO used by Lingua::EN::Semtags::Sentence River stage zero No dependents

A DTO used by "Lingua::EN::Semtags::Sentence" and "Lingua::EN::Semtags::LangUnit". METHODS add_isa($lunit) Adds $isa to "$self->{isas}". is_phrase() Returns "true" is this language unit is a phrase. is_word() Returns true if this language unit is a w...

IGORM/Lingua-EN-Semtags-Engine-0.01 - 25 Apr 2008 17:48:16 GMT

Lingua::Align - Perl modules for the alignment of parallel corpora River stage zero No dependents

Lingua::Align contains modules for automatic tree alignment based on discriminative classification and alignment inference. More details about the tree aligner can be found in Lingua::Align::Trees. The following gives a general overview and motivatio...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 GMT

Lingua::YaTeA - Perl extension for extracting terms from a corpus and providing a syntactic analysis in a head-modifier format. River stage one • 2 direct dependents • 2 total dependents

This module is the main module of the software named YaTeA. It aims at extracting noun phrases that look like terms from a corpus. It provides their syntactic analysis in a head-modifier representation. As an input, the term extractor requires a corp...

THHAMON/Lingua-YaTeA-0.626 - 26 Oct 2018 12:48:02 GMT

Lingua::Sentence - Perl extension for breaking text paragraphs into sentences River stage one • 3 direct dependents • 3 total dependents

This module allows splitting of text paragraphs into sentences. It is based on scripts developed by Philipp Koehn and Josh Schroeder for processing the Europarl corpus (<http://www.statmt.org/europarl/>). The module uses punctuation and capitalizatio...

CAPOEIRAB/Lingua-Sentence-1.100 - 26 Feb 2017 23:06:04 GMT

Lingua::NATools - A framework for Parallel Corpora processing River stage zero No dependents

This is a collection of functions used on the NATools tools. Some of them can be used independently. Check documentation bellow. "init" Use this function to initialize a parallel corpora repository. You must supply a "directory" where the repository ...

AMBS/Lingua-NATools-v0.7.10 - 31 Oct 2015 16:52:31 GMT

Lingua::LinkParser - Perl module implementing the Link Grammar Parser by Sleator, Temperley and Lafferty at CMU. River stage one • 1 direct dependent • 1 total dependent

To quote the Link Grammar documentation, "the Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of set o...

DBRIAN/Lingua-LinkParser-1.17 - 25 Mar 2014 22:27:08 GMT

Lingua::EN::SENNA - Perl wrapper for the SENNA NLP toolkit River stage zero No dependents

This package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...

DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 GMT

Lingua::EN::Fathom - Measure readability of English text River stage one • 1 direct dependent • 1 total dependent

This module analyses English text in either a string or file. Totals are then calculated for the number of characters, words, sentences, blank and non blank (text) lines and paragraphs. Three common readability statistics are also derived, the Fog, F...

KIMRYAN/Lingua-EN-Fathom-1.22 - 31 Oct 2018 21:39:45 GMT

Lingua::EN::Opinion - Measure the emotional sentiment of text River stage zero No dependents

A "Lingua::EN::Opinion" object measures the emotional sentiment of text and saves the results in the scores and nrc_scores attributes. When run against the positive and negative classified training reviews in the dataset referenced under "SEE ALSO", ...

GENE/Lingua-EN-Opinion-0.1600 - 22 Sep 2019 15:23:25 GMT

Lingua::EN::CMUDict - Perl extension for utilizing the CMU dictionary file River stage zero No dependents

This version of the CMU Pronouncing dictionary was generated from the original dictionary and designed to syllabify it. The paper *On the Syllabification of Phonemes* by Susan Bartlett, Grzegorz Kondrak and Colin Cherry (NAACL-HLT 2009) covers the me...

LMETCALF/Lingua-EN-CMUDict-0.06 - 01 May 2019 00:00:51 GMT

Lingua::HE::Sentence - Module for splitting Hebrew text into sentences. River stage zero No dependents

The "Lingua::HE::Sentence" module contains the function get_sentences, which splits Hebrew text into its constituent sentences, based on regular expressions. The module assumes text encoded in UTF-8. Supporting other input formats will be added upon ...

SHLOMOY/Lingua-HE-Sentence-0.13 - 25 Jan 2005 17:59:42 GMT

Lingua::DE::Sentence - Perl extension for tokenizing german texts into their sentences. River stage zero No dependents

The "Lingua::DE::Sentence" module contains the function get_sentences, which splits text into its constituent sentences. The result can be either the list of sentences in the text or the list of sentences plus and a list of their absolute positions i...

HOLSTEN/Lingua-DE-Sentence-0.07 - 25 Apr 2003 07:46:43 GMT

Lingua::EN::Inflexion - Inflect English nouns, verbs, adjectives, and articles River stage one • 6 direct dependents • 6 total dependents

Lingua::EN::Inflexion allows you to correctly inflect all English nouns and verbs, as well as the small number of adjectives and articles that still decline in modern English. By default, the module follows the conventions of modern formal British En...

DCONWAY/Lingua-EN-Inflexion-0.001008 - 12 Mar 2019 01:19:01 GMT

Lingua::EN::Summarize - A simple tool for summarizing bodies of English text. River stage zero No dependents

This is a simple module which makes an unscientific effort at summarizing English text. It recognizes simple patterns which look like statements, abridges them, and concatenates them into something vaguely resembling a summary. It needs more work on ...

FIMM/Lingua-EN-Summarize-0.2 - 20 Feb 2001 09:49:48 GMT

Lingua::EO::Orthography - A orthography/substitute converter for Esperanto characters River stage zero No dependents

6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...

MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 GMT

Lingua::EN::GeniaTagger - There's no fear with this elegant site scraper River stage zero No dependents

XERN/Lingua-EN-GeniaTagger-0.01 - 22 Feb 2006 16:30:36 GMT

Lingua::EN::Alphabet::Shaw - transliterate the Latin to Shavian alphabets River stage one • 1 direct dependent • 1 total dependent

The Shaw or Shavian alphabet was commissioned by the will of the playwright George Bernard Shaw in the early 1960s as a replacement for the Latin alphabet for representing English. It is designed to have a one-to-one phonemic (not phonetic) mapping w...

MARNANEL/Lingua-EN-Alphabet-Shaw-0.64 - 16 Sep 2010 12:24:57 GMT

27 results (0.043 seconds)