Lingua::EN::Dict - BETA Version of XML english dictionary storage. River stage zero No dependents

Note: BETA VERSION. See main reason for release of this module, three paragraphs down. Description This is a small module I came up with to use as a storage format for my humble attempt at a natural language parser (or a subset of natural language - ...

JBRYAN/Lingua-EN-Dict-0.20 - 07 Oct 2000 03:39:24 UTC - Search in distribution

Lingua::FR::Ladl - represent, query and investigate the Ladl tables, a french linguistic resource River stage zero No dependents

A namespace for a set of modules to handle the Ladl tables, a french linguistic resource describing syntactico-semantic properties of basic French sentences. Part of them are available as a set of excel tables from <http://ladl.univ-mlv.fr/> The modu...

INGRIF/Lingua-FR-Ladl-v0.0.4 - 19 Apr 2007 12:28:07 UTC - Search in distribution

Lingua::JA::Fold - to fold a Japanese text. River stage one • 1 direct dependent • 1 total dependent

This module is used to fold a Japanese text and so on. The Japanese (the Chinese and the Korean would be the same) text has traditionally unique manner in representing. Basically those characters are used to be represented in two kind of size which i...

HATA/Lingua-JA-Fold-0.08 - 19 Mar 2008 12:52:31 UTC - Search in distribution

Lingua::JA::Kana - Kata-Romaji related utilities River stage one • 1 direct dependent • 1 total dependent

This module is a simple utility to convert katakana, hiragana, and romaji at ease. This module makes use of utf8 semantics which is introduced in Perl 5.8.0 and became stable enough in Perl 5.8.1 so you need Perl 5.8.1 or better. Also note that strin...

DANKOGAI/Lingua-JA-Kana-0.07 - 06 Aug 2012 01:59:05 UTC - Search in distribution

Lingua::JA::Mail - compose mail with Japanese charset River stage zero No dependents

This module is produced mainly for Japanese Perl programmers those who wants to compose an email with Perl extention. For some reasons, most Japanese internet users have chosen ISO-2022-JP 7bit character encoding for email rather than the other 8bit ...

HATA/Lingua-JA-Mail-0.03 - 23 Sep 2005 00:30:53 UTC - Search in distribution

Lingua::JA::Moji - Handle many kinds of Japanese characters River stage one • 4 direct dependents • 5 total dependents

This module provides methods to convert different written forms of Japanese into one another. It enables conversion between romanized Japanese, hiragana, and katakana. It also includes a number of unusual encodings such as Japanese braille and morse ...

BKB/Lingua-JA-Moji-0.59 - 18 Feb 2021 11:27:39 UTC - Search in distribution

Lingua::JA::Yomi - convert English into Japanese katakana River stage zero No dependents

Lingua::JA::Yomi uses a dictionary to convert. The dictionary defaults to partly modified Bilingual Emacspeak Project dictionary...

MASH/Lingua-JA-Yomi-0.01 - 13 Dec 2008 14:43:58 UTC - Search in distribution

Lingua::Stem::Es - Perl Spanish Stemming River stage zero No dependents

This module uses Porter's Stemming Algorithm to return an array reference of stemmed words. The algorithm is implemented as described in: http://snowball.tartarus.org/algorithms/spanish/stemmer.html The interface was made to follow the conventions se...

JFRAIRE/Lingua-Stem-Es-0.04 - 15 Sep 2008 23:01:33 UTC - Search in distribution

Lingua::Stem::Fr - Perl French Stemming River stage three • 1 direct dependent • 126 total dependents

This module use the a modified version of the Porter Stemming Algorithm to return a stemmed words. The algorithm is implemented as described in: http://snowball.tartarus.org/french/stemmer.html with some improvement. The code is carefully crafted to ...

SDP/Lingua-Stem-Fr-0.02 - 27 Apr 2004 06:22:46 UTC - Search in distribution

Lingua::Stem::It - Porter's stemming algorithm for Italian River stage three • 1 direct dependent • 126 total dependents

This module applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. The algorithm is implemented exactly (I hope :-) as described in: http://snowball.tartarus.org/algorithms/italian/stemmer.html The code is carefully cra...

ACALPINI/Lingua-Stem-It-0.02 - 08 Jun 2007 10:08:23 UTC - Search in distribution

Lingua::Stem::Ru - Porter's stemming algorithm for Russian (KOI8-R only) River stage three • 1 direct dependent • 126 total dependents

This module applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. The algorithm is implemented exactly as described in: http://snowball.tartarus.org/algorithms/russian/stemmer.html The code is carefully crafted to work...

NEILB/Lingua-Stem-Ru-0.04 - 12 Feb 2016 22:19:56 UTC - Search in distribution

Lingua::Stem::Uk - Porter's stemming algorithm for Ukrainian River stage zero No dependents

This module applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. The code is carefully crafted to work in conjunction with the Lingua::Stem module by Benjamin Franz. This stemmer is also based on the work of Aldo Capi...

RRVCKU/Lingua-Stem-Uk-0.01 - 17 Oct 2017 22:55:58 UTC - Search in distribution

Lingua::ZH::TaBE - Chinese processing via libtabe River stage one • 1 direct dependent • 1 total dependent

This module is a Perl interface to the TaBE (Taiwan and Big5 Encoding) library, an unified interface and library dealing with Chinese words, phrases, sentences, and phonetic symbols; it is intended to be used as the foundation of Chinese text process...

AUTRIJUS/Lingua-ZH-TaBE-0.07 - 31 Dec 2005 07:37:55 UTC - Search in distribution

Lingua::ZH::Toke - Chinese Tokenizer River stage zero No dependents

This module puts a thin wrapper around Lingua::ZH::TaBE, by blessing refereces to TaBE's objects into its English counterparts. Besides offering more readable class names, this module also offers various overloaded methods for tokenization; please se...

AUTRIJUS/Lingua-ZH-Toke-0.02 - 11 Jan 2004 13:13:35 UTC - Search in distribution

Lingua::ZH::Wrap - Wrap Chinese text River stage one • 1 direct dependent • 1 total dependent

"Lingua::ZH::Wrap::wrap()" is a very simple paragraph formatter. It formats a single paragraph at a time by breaking lines at Chinese character boundries. Indentation is controlled for the first line ($initial_tab) and all subsequent lines ($subseque...

AUTRIJUS/Lingua-ZH-Wrap-0.03 - 25 Jul 2004 16:34:53 UTC - Search in distribution

Task::Lingua::PT - Replacement for the PT NLP tools Bundle River stage zero No dependents

AMBS/Task-Lingua-PT-0.02 - 17 Oct 2010 19:46:04 UTC - Search in distribution

Lingua::DE::ASCII - Perl extension to convert german umlauts to and from ascii River stage one • 1 direct dependent • 1 total dependent

This module enables conversion from and to the ASCII format of german texts. It has two methods: "to_ascii" and "to_latin1" which one do exactly what they say. Please note that both methods take only one scalar as argument and not whole a list. to_as...

BIGJ/Lingua-DE-ASCII-0.14 - 02 May 2020 07:38:52 UTC - Search in distribution

Lingua::EN::Ngram - Extract n-grams from texts and list them according to frequency and/or T-Score River stage one • 2 direct dependents • 2 total dependents

This module is designed to extract n-grams from texts and list them according to frequency and/or T-Score. To elaborate, the purpose of Lingua::EN::Ngram is to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phr...

EMORGAN/Lingua-EN-Ngram-0.03 - 29 Mar 2018 03:28:09 UTC - Search in distribution

Lingua::EN::SENNA - Perl wrapper for the SENNA NLP toolkit River stage zero No dependents

This package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...

DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 UTC - Search in distribution

Lingua::JA::TFIDF - TF/IDF calculator based on MeCab. River stage one • 2 direct dependents • 2 total dependents

* This software is still in alpha release * Lingua::JA::TFIDF is TF/IDF calculator based on MeCab. It has DF(Document Frequency) data set that was fetched from Yahoo Search API, beforehand....

MIKI/Lingua-JA-TFIDF-0.00004 - 11 Aug 2009 07:39:17 UTC - Search in distribution
647 results (0.191 seconds)