5 results (0.937 seconds)
Lingua::Treebank - Perl extension for manipulating the Penn Treebank format ++

This class knows how to read two treebank formats, the Penn format and the Chomsky Normal Form (CNF) format. These formats differ in how they handle terminal nodes. The Penn format places pre-terminal part of speech tags in the left-hand position of ...

KAHN/Lingua-Treebank-0.16 - 28 Aug 2008 20:08:52 GMT - Search in distribution

Lingua::Align::Corpus::Treebank - Factory class for reading treebanks ++

Factory class of modules for reading treebanks in different formats. The default format is the Penn Treebank format. Other supported formats are the format produced by the Berkeley parser, the Stanford parser (including typed dependencies), TigerXML ...

TIEDEMANN/Lingua-Align-0.04 - 10 Dec 2012 18:31:24 GMT - Search in distribution

Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. ++

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-2.005 - 11 Jul 2014 15:12:15 GMT - Search in distribution

Text::StemTagPOS - Computes stemmed/POS tagged lists of text. ++

"Text::StemTagPOS" uses the modules Lingua::Stem::Snowball and Lingua::EN::Tagger to do part-of-speech tagging and stemming of English text. It was developed to pre-process text for other modules. Encoding of all text should be in Perl's internal for...

KUBINA/Text-StemTagPOS-0.61 - 31 Dec 2011 13:41:21 GMT - Search in distribution

Treex::Block::W2A::EN::TagLinguaEn ++

Each node in analytical tree is tagged using "Lingua::EN::Tagger" (Penn Treebank POS tags). Because Lingua::EN::Tagger does its own tokenization, it checks if tokenization is same. AUTHORS Tomáš Kraut <kraut@ufal.mff.cuni.cz> COPYRIGHT AND LICENSE Co...

TKR/Treex-EN-0.08171 - 15 Feb 2012 23:55:18 GMT - Search in distribution




Hosting generously
sponsored by Bytemark