The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

Lingua::RU::OpenCorpora::Tokenizer::List - represents a data file

DESCRIPTION

This module provides an API to access files that are used by tokenizer.

It's useful to know that this module actually has 2 versions: the code version and the data version. These versions do not depend on each other.

METHODS

new($name [, $args])

Constructor.

Takes one required argument: list name. List name is one of these: exceptions, prefixes and hyphens.

Optionally you can pass a hashref with additional arguments:

data_dir

Path to the directory where vectors file is stored. Defaults to distribution directory (see File::ShareDir).

in_list($value)

Checks if given value is in the list.

Returns true or false correspondingly.

SEE ALSO

Lingua::RU::OpenCorpora::Tokenizer::Vectors

Lingua::RU::OpenCorpora::Tokenizer::Updater

Lingua::RU::OpenCorpora::Tokenizer

AUTHOR

OpenCorpora team http://opencorpora.org

LICENSE

This program is free software, you can redistribute it under the same terms as Perl itself.