The "Lingua::EN::Sentence" module contains the function get_sentences, which splits text into its constituent sentences, based on a regular expression and a list of abbreviations (built in and given). Certain well know exceptions, such as abbreviatio...
KIMRYAN/Lingua-EN-Sentence-0.31 - 19 Aug 2018 08:28:30 GMTA DTO used by "Lingua::EN::Semtags::Engine". Aggregates instances of "Lingua::EN::Semtags::LangUnit"s. METHODS add_lunit($lunit) Adds $lunit to "$self->{lunits}". lunits() Returns "$self->{lunits}". phrase_tokens() Returns "$self->{phrase_tokens}". R...
IGORM/Lingua-EN-Semtags-Engine-0.01 - 25 Apr 2008 17:48:16 GMTANDREFS/Lingua-EN-Sentence-Offsets-0.03 - 03 Mar 2014 11:40:09 GMT
A DTO used by "Lingua::EN::Semtags::Sentence" and "Lingua::EN::Semtags::LangUnit". METHODS add_isa($lunit) Adds $isa to "$self->{isas}". is_phrase() Returns "true" is this language unit is a phrase. is_word() Returns true if this language unit is a w...
IGORM/Lingua-EN-Semtags-Engine-0.01 - 25 Apr 2008 17:48:16 GMTLingua::Align contains modules for automatic tree alignment based on discriminative classification and alignment inference. More details about the tree aligner can be found in Lingua::Align::Trees. The following gives a general overview and motivatio...
TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 GMTThis module is the main module of the software named YaTeA. It aims at extracting noun phrases that look like terms from a corpus. It provides their syntactic analysis in a head-modifier representation. As an input, the term extractor requires a corp...
THHAMON/Lingua-YaTeA-0.626 - 26 Oct 2018 12:48:02 GMTThis module allows splitting of text paragraphs into sentences. It is based on scripts developed by Philipp Koehn and Josh Schroeder for processing the Europarl corpus (<http://www.statmt.org/europarl/>). The module uses punctuation and capitalizatio...
CAPOEIRAB/Lingua-Sentence-1.100 - 26 Feb 2017 23:06:04 GMTThis is a collection of functions used on the NATools tools. Some of them can be used independently. Check documentation bellow. "init" Use this function to initialize a parallel corpora repository. You must supply a "directory" where the repository ...
AMBS/Lingua-NATools-v0.7.10 - 31 Oct 2015 16:52:31 GMTTo quote the Link Grammar documentation, "the Link Grammar Parser is a syntactic parser of English, based on link grammar, an original theory of English syntax. Given a sentence, the system assigns to it a syntactic structure, which consists of set o...
DBRIAN/Lingua-LinkParser-1.17 - 25 Mar 2014 22:27:08 GMTThis package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...
DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 GMTThis module analyses English text in either a string or file. Totals are then calculated for the number of characters, words, sentences, blank and non blank (text) lines and paragraphs. Three common readability statistics are also derived, the Fog, F...
KIMRYAN/Lingua-EN-Fathom-1.22 - 31 Oct 2018 21:39:45 GMTA "Lingua::EN::Opinion" object measures the emotional sentiment of text and saves the results in the scores and nrc_scores attributes. When run against the positive and negative classified training reviews in the dataset referenced under "SEE ALSO", ...
GENE/Lingua-EN-Opinion-0.1600 - 22 Sep 2019 15:23:25 GMTThis version of the CMU Pronouncing dictionary was generated from the original dictionary and designed to syllabify it. The paper *On the Syllabification of Phonemes* by Susan Bartlett, Grzegorz Kondrak and Colin Cherry (NAACL-HLT 2009) covers the me...
LMETCALF/Lingua-EN-CMUDict-0.06 - 01 May 2019 00:00:51 GMTThe "Lingua::HE::Sentence" module contains the function get_sentences, which splits Hebrew text into its constituent sentences, based on regular expressions. The module assumes text encoded in UTF-8. Supporting other input formats will be added upon ...
SHLOMOY/Lingua-HE-Sentence-0.13 - 25 Jan 2005 17:59:42 GMTThe "Lingua::DE::Sentence" module contains the function get_sentences, which splits text into its constituent sentences. The result can be either the list of sentences in the text or the list of sentences plus and a list of their absolute positions i...
HOLSTEN/Lingua-DE-Sentence-0.07 - 25 Apr 2003 07:46:43 GMTLingua::EN::Inflexion allows you to correctly inflect all English nouns and verbs, as well as the small number of adjectives and articles that still decline in modern English. By default, the module follows the conventions of modern formal British En...
DCONWAY/Lingua-EN-Inflexion-0.001008 - 12 Mar 2019 01:19:01 GMTThis is a simple module which makes an unscientific effort at summarizing English text. It recognizes simple patterns which look like statements, abridges them, and concatenates them into something vaguely resembling a summary. It needs more work on ...
FIMM/Lingua-EN-Summarize-0.2 - 20 Feb 2001 09:49:48 GMT6 letters in the Esperanto alphabet did not exist in ASCII. Their letters, which have supersigns (eo: supersignoj), are often spelled in substitute notations (eo: surogataj skribosistemoj) for the history, namely, for the ages of typography and typew...
MORIYA/Lingua-EO-Orthography-0.04 - 24 Dec 2013 17:11:43 GMTXERN/Lingua-EN-GeniaTagger-0.01 - 22 Feb 2006 16:30:36 GMT
The Shaw or Shavian alphabet was commissioned by the will of the playwright George Bernard Shaw in the early 1960s as a replacement for the Latin alphabet for representing English. It is designed to have a one-to-one phonemic (not phonetic) mapping w...
MARNANEL/Lingua-EN-Alphabet-Shaw-0.64 - 16 Sep 2010 12:24:57 GMT