The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Treex::Block::W2A::EN::FixTags - Fixes tags for TectoMT purposes.

VERSION

version 0.06903_1

DESCRIPTION

sooner

"sooner" and "later" are always tagged as RBR (comparative adverb) Beware that this goes against Penn Treebank Tagging Guidelines.

"e. g." -> FW (according to PTB Guidelines)
numbers

All numbers (/^\d+$/) get tag CD.

plural abbreviations

Abbreviations like MPs, CDs or DVDs are tagged as plural proper noun (NNPS).

OVERRIDEN METHODS

from Treex::Core::Block

process_anode

AUTHORS

Zdeněk Žabokrtský <zabokrtsky@ufal.mff.cuni.cz>

Martin Popel <popel@ufal.mff.cuni.cz>

COPYRIGHT AND LICENSE

Copyright © 2008 - 2011 by Institute of Formal and Applied Linguistics, Charles University in Prague

This module is free software; you can redistribute it and/or modify it under the same terms as Perl itself.