The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::Interset::Tagset::CS::Conll - Driver for the Czech tagset of the CoNLL 2006 and 2007 Shared Tasks.

VERSION

version 3.012

SYNOPSIS

  use Lingua::Interset::Tagset::CS::Conll;
  my $driver = Lingua::Interset::Tagset::CS::Conll->new();
  my $fs = $driver->decode("N\tN\tGen=M|Num=S|Cas=1|Neg=A");

or

  use Lingua::Interset qw(decode);
  my $fs = decode('cs::conll', "N\tN\tGen=M|Num=S|Cas=1|Neg=A");

DESCRIPTION

Interset driver for the Czech tagset of the CoNLL 2006 and 2007 Shared Tasks. CoNLL tagsets in Interset are traditionally three values separated by tabs. The values come from the CoNLL columns CPOS, POS and FEAT. For Czech, these values are derived from the tagset of the Prague Dependency Treebank; however, there is an additional surface feature Sem, which is derived from PDT lemmas. Thus this driver extends the cs::pdt driver.

SEE ALSO

Lingua::Interset, Lingua::Interset::Tagset, Lingua::Interset::Tagset::CS::Pdt, Lingua::Interset::Tagset::CS::Conll, Lingua::Interset::FeatureStructure

AUTHOR

Dan Zeman <zeman@ufal.mff.cuni.cz>

COPYRIGHT AND LICENSE

This software is copyright (c) 2017 by Univerzita Karlova (Charles University).

This is free software; you can redistribute it and/or modify it under the same terms as the Perl 5 programming language system itself.