Lingua::Treebank - Perl extension for manipulating the Penn Treebank format

This class knows how to read two treebank formats, the Penn format and the Chomsky Normal Form (CNF) format. These formats differ in how they handle terminal nodes. The Penn format places pre-terminal part of speech tags in the left-hand position of ...

KAHN/Lingua-Treebank-0.16 - 28 Aug 2008 20:08:52 GMT - Search in distribution

Lingua::Align::Corpus::Treebank - Factory class for reading treebanks

Factory class of modules for reading treebanks in different formats. The default format is the Penn Treebank format. Other supported formats are the format produced by the Berkeley parser, the Stanford parser (including typed dependencies), TigerXML ...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 GMT - Search in distribution

App::lcpan - Manage your local CPAN mirror

PERLANCAR/App-lcpan-0.54 - 09 Oct 2015 04:57:06 GMT - Search in distribution

Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped.

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-2.050 - 30 Sep 2015 19:17:49 GMT - Search in distribution

Text::StemTagPOS - Computes stemmed/POS tagged lists of text.

"Text::StemTagPOS" uses the modules Lingua::Stem::Snowball and Lingua::EN::Tagger to do part-of-speech tagging and stemming of English text. It was developed to pre-process text for other modules. Encoding of all text should be in Perl's internal for...

KUBINA/Text-StemTagPOS-0.61 - 31 Dec 2011 13:41:21 GMT - Search in distribution


Each node in analytical tree is tagged using "Lingua::EN::Tagger" (Penn Treebank POS tags). Because Lingua::EN::Tagger does its own tokenization, it checks if tokenization is same....

VARISD/Treex-EN-0.13095   (1 review) - 01 Sep 2014 17:58:25 GMT - Search in distribution