Lingua::Interset - DZ Interset is a universal morphosyntactic feature set to which all tagsets of all corpora/languages can be mapped. River stage one • 2 direct dependents • 7 total dependents

DZ Interset is a universal framework for reading, writing, converting and interpreting part-of-speech and morphosyntactic tags from multiple tagsets of many different natural languages. Individual tagsets are mapped to the Interset using specialized ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::LA::It - Driver for the positional tagset of the Index Thomisticus Treebank. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Index Thomisticus Treebank in CoNLL format. The original tags are positional, there are eleven positions. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::UG::Udt - Driver for the tagset of the Uyghur Dependency Treebank. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the part-of-speech tagset of the Uyghur Dependency Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::CS::Pdt - Driver for the tagset of the Prague Dependency Treebank. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the part-of-speech tagset of the Prague Dependency Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::RO::Rdt - Driver for the tagset of the Romanian Dependency Treebank (RDT). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Romanian Dependency Treebank (RDT). The original RDT annotation is *not consistent:* Four of the twenty POS tags and one dependency type appear only in the first 6% of the material, reducing significantly the POS...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::EN::Penn - Driver for the tagset of the Penn Treebank. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the part-of-speech tagset of the Penn Treebank....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::FI::Turku - Driver for the Finnish tagset from the Turku Dependency Treebank. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Finnish tagset from the Turku Dependency Treebank. Tag is a sequence of features separated by vertical bars. There are just the feature values, not attribute-value pairs....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Atom - Atomic driver for a surface feature. River stage one • 2 direct dependents • 7 total dependents

Atom is a special case of a tagset driver. As the name suggests, the surface tags are considered atomic, i.e. indivisible. It provides environment for easy mapping between surface strings and Interset features. While Atom can be used to implement dri...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::RU::Syntagrus - Driver for Syntagrus (Russian Dependency Treebank) tags. River stage one • 2 direct dependents • 7 total dependents

Interset driver for Syntagrus (Russian Dependency Treebank) tags....

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::LA::Conll - Driver for the tagset of the Latin Dependency Treebank in CoNLL format. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Latin Dependency Treebank in CoNLL format. The original tags are positional, there are nine positions. This driver covers a format that we used in HamleDT processing where the input was first converted to CoNLL. ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::HU::Conll - Driver for the Hungarian tagset of the CoNLL 2007 Shared Task (derived from the Szeged Treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Hungarian tagset of the CoNLL 2007 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Hungarian, these values are derived fro...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::FA::Conll - Driver for the tagset of the Persian Dependency Treebank (in the CoNLL-X format). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Persian Dependency Treebank (in the CoNLL-X format). CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. Tagset documentation is ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::EU::Conll - Driver for the tagset of the Basque Dependency Treebank in the CoNLL format. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Basque Dependency Treebank version 2011 in the CoNLL format. Note that this version of the tagset is slightly different from the Basque data of the CoNLL 2007 Shared Task. For instance, the features now contain f...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::SV::Conll - Driver for the tagset of the Swedish treebank from the CoNLL 2006 Shared Task (Talbanken / Mamba). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Swedish treebank (Talbanken) from the CoNLL 2006 Shared Task. It was derived from the two-letter tags of the Mamba tagset. The sv::conll driver only handles a slight change in formatting. CoNLL tagsets in Interse...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::UR::Conll - Driver for the tagset of the Hyderabad Urdu Treebank, as used in the CoNLL data format. River stage one • 2 direct dependents • 7 total dependents

Interset driver for the tagset of the Urdu treebank from Hyderabad, as used in the CoNLL data format. CoNLL tagsets in Interset are traditionally three values separated by tabs, coming from the CoNLL columns CPOS, POS and FEAT. In the case of Urdu, t...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::JA::Conll - Driver for the Japanese tagset of the CoNLL 2006 Shared Task (derived from the TüBa J/S Verbmobil treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Japanese tagset of the CoNLL 2006 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Japanese, these values are derived from ...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::SL::Conll - Driver for the Slovene tagset of the CoNLL 2006 Shared Task (derived from the Slovene Dependency Treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Slovene tagset of the CoNLL 2006 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Slovene, these values are derived from th...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::IT::Conll - Driver for the Italian tagset of the CoNLL 2007 Shared Task (derived from the ISST, Italian Syntactic-Semantic Treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Italian tagset of the CoNLL 2007 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Italian, these values are derived from th...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::PT::Conll - Driver for the Portuguese tagset of the CoNLL 2006 Shared Task (derived from the Bosque / Floresta sintá(c)tica treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Portuguese tagset of the CoNLL 2006 Shared Task. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Portuguese, these values are derived f...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

Lingua::Interset::Tagset::ZH::Conll - Driver for the Chinese tagset of the CoNLL 2006 & 2007 Shared Tasks (derived from the Academia Sinica Treebank). River stage one • 2 direct dependents • 7 total dependents

Interset driver for the Chinese tagset of the CoNLL 2006 and 2007 Shared Tasks. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Chinese, these values are deriv...

ZEMAN/Lingua-Interset-3.014 - 31 Jan 2019 13:50:27 GMT

37 results (0.039 seconds)