Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Trie - A trie-like structure for DZ Interset features and their values. River stage one • 1 direct dependent • 5 total dependents

The "Trie" class defines a trie-like data structure for DZ Interset features and their values. It is an auxiliary data structure that an outside user should not need to use directly. It is used to describe all feature-value combinations that are perm...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Atom - Atomic driver for a surface feature. River stage one • 1 direct dependent • 5 total dependents

Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset - The root class for all physical tagsets covered by DZ Interset 2.0. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "Tagset" class is the inheritance root for all classes descr...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Converter - Implements a converter between two physical tagsets via Interset. River stage one • 1 direct dependent • 5 total dependents

"Converter" is a simple class that implements Interset-based conversion of tags between two physical tagsets. It includes caching, which will improve performance when converting tags in a large corpus....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::SimpleAtom - Atomic driver for a surface feature. River stage one • 1 direct dependent • 5 total dependents

SimpleAtom is a special simple case of Lingua::Interset::Atom. Unlike in general Atom, for SimpleAtom there is an *injective* function mapping the surface strings to values of just one Interset feature. This makes defining the decoding and encoding m...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::Conll - Common code for drivers of tagsets from files in CoNLL 2006 format. River stage one • 1 direct dependent • 5 total dependents

Common code for drivers of tagsets from files in the CoNLL 2006 format. These tags always consists of three tab-separated parts: "pos" (from the CoNLL "CPOS" column), "subpos" (from the CoNLL "POS" column), and "features" (from the CoNLL "FEATS" colu...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::OldTagsetDriver - A temporary envelope that provides access to the old (Interset 1.0) drivers from Interset 2.0. River stage one • 1 direct dependent • 5 total dependents

Provides object envelope for an old, non-object-oriented driver from Interset 1.0. This makes the old drivers at least partially usable until they are fully ported to Interset 2.0. Note however that the old drivers use Interset features and/or values...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::Multext - Common code for drivers of tagsets of the Multext-EAST project. River stage one • 1 direct dependent • 5 total dependents

Common code for drivers of tagsets of the Multext-EAST project. All the Multext-EAST tagsets use the same inventory of parts of speech and the same inventory of features (but not all features are used in all languages). Feature values are individual ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::FeatureStructure - Definition of morphosyntactic features and their values. River stage one • 1 direct dependent • 5 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. The "FeatureStructure" class defines all morphosyntactic feature...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::LA::It - Driver for the positional tagset of the Index Thomisticus Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Index Thomisticus Treebank in CoNLL format. The original tags are positional, there are eleven positions. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::SV::Suc - Driver for the Swedish tagset of the Stockholm-Umeå Corpus. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the Swedish tagset of the Stockholm-Umeå Corpus, <http://spraakbanken.gu.se/parole/tags.phtml>. POD ERRORS Hey! The above document had some coding errors, which are explained below: Around line 3: Non-ASCII character seen before =...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::SK::Snk - Driver for the tags of the Slovak National Corpus (Slovenský národný korpus) River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tags of the Slovak National Corpus (Slovenský národný korpus). POD ERRORS Hey! The above document had some coding errors, which are explained below: Around line 3: Non-ASCII character seen before =encoding in '(Slovenský'. Ass...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::CS::Cnk - Driver for the tagset of the Czech National Corpus (Český národní korpus). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset used in the Czech National Corpus (Český národní korpus). The tagset is a slight modification of the tagset used in the Prague Dependency Treebank (see Lingua::Interset::Tagset::CS::Pdt). The only difference is a sixtee...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::CS::Pmk - Driver for the Czech tagset of the Prague Spoken Corpus (Pražský mluvený korpus). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the long tags of the Prague Spoken Corpus (Pražský mluvený korpus, PMK). POD ERRORS Hey! The above document had some coding errors, which are explained below: Around line 3: Non-ASCII character seen before =encoding in '(Pražský'....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::RO::Rdt - Driver for the tagset of the Romanian Dependency Treebank (RDT). River stage one • 1 direct dependent • 5 total dependents

Interset driver for the tagset of the Romanian Dependency Treebank (RDT). The original RDT annotation is *not consistent:* Four of the twenty POS tags and one dependency type appear only in the first 6% of the material, reducing significantly the POS...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::UG::Udt - Driver for the tagset of the Uyghur Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Uyghur Dependency Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::NL::Cgn - Driver for the CGN/Lassy/Alpino Dutch tagset. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the CGN/Lassy/Alpino Dutch tagset. Tagset documentation at <http://www.let.rug.nl/~vannoord/Lassy/POS_manual.pdf>....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::CS::Pdt - Driver for the tagset of the Prague Dependency Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Prague Dependency Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC

Lingua::Interset::Tagset::EN::Penn - Driver for the tagset of the Penn Treebank. River stage one • 1 direct dependent • 5 total dependents

Interset driver for the part-of-speech tagset of the Penn Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 UTC
79 results (0.044 seconds)