The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Changes for version 0.08170 - 2012-02-16

  • added Featurama tagger

Modules

abstract ancestor for parallel-corpora document readers
abstract ancestor for parallel-corpora document readers
segment text on new lines
language independent rule based tokenizer
Base tokenizer, splits on whitespaces, fills no_space_after
Rule based pseudo language-independent sentence segmenter
collection of blocks parametrized by language and language independent

Provides

in lib/Treex/Block/W2A/BaseChunkParser.pm