The London Perl and Raku Workshop takes place on 26th Oct 2024. If your company depends on Perl, please consider sponsoring and/or attending.

NAME

Lingua::RU::OpenCorpora::Tokenizer::Vectors - represents a file with vectors

DESCRIPTION

This module inherits most of its code from Lingua::RU::OpenCorpora::Tokenizer::List.

The reason to put this code into a separate class is that vectors file has a slightly different format and needs to be processed in a slightly different manner.

METHODS

new([$args])

Constructor.

Takes an optional hashref with arguments:

data_dir

Path to the directory where vectors file is stored. Defaults to distribution directory (see File::ShareDir).

in_list($vector)

Given a vector, checks if there is a probability value defined for it.

Returns probability or undef correspondingly.

SEE ALSO

Lingua::RU::OpenCorpora::Tokenizer::List

Lingua::RU::OpenCorpora::Tokenizer::Updater

Lingua::RU::OpenCorpora::Tokenizer

AUTHOR

OpenCorpora team http://opencorpora.org

LICENSE

This program is free software, you can redistribute it under the same terms as Perl itself.