NAME

Search::Fulltext::Tokenizer::Ngram - Character n-gram tokenizer for Search::Fulltext

VERSION

version 0.01

SYNOPSIS

use utf8;
use Search::Fulltext;
use Search::Fulltext::Tokenizer::Bigramm;

my $searcher = Search::Fulltext->new(
    docs => [
        'ハンプティ・ダンプティ 塀の上',
        'ハンプティ・ダンプティ 落っこちた',
        '王様の馬みんなと 王様の家来みんなでも',
        'ハンプティを元に 戻せなかった',
    ],
    tokenizer => q/perl 'Search::Fulltext::Tokenizer::Bigram::get_tokenizer'/,
);
my $hit_document_ids = $searcher->search('ハンプティ');  # [0, 1, 3]

DESCRIPTION

This module provides character N-gram tokenizers for Search::Fulltext.

By default {1,2,3}-gram tokenzers are available.

CREATING A N(> 3)-GRAM TOKENIZER

If you wish to use other N-grams where N > 3, you can create it by inheriting Search::Fulltext::Tokenizer::Ngram:

package My::Tokenizer::42gram;

use parent qw/Search::Fulltext::Tokenizer::Ngram/;

my $iterator_generator = __PACKAGE__->new(42);

sub get_tokenizer {
    sub { $iterator_generator->create_token_iterator(@_) };
}

AUTHOR

Koichi SATOH <sekia@cpan.org>

COPYRIGHT AND LICENSE

This is free software, licensed under:

The MIT (X11) License

1 POD Error

The following errors were encountered while parsing the POD:

Around line 63:: Non-ASCII character seen before =encoding in ''ハンプティ・ダンプティ'. Assuming UTF-8

To install Search::Fulltext::Tokenizer::Ngram, copy and paste the appropriate command in to your terminal.

cpanm

cpanm Search::Fulltext::Tokenizer::Ngram

CPAN shell

perl -MCPAN -e shell
install Search::Fulltext::Tokenizer::Ngram

For more information on module installation, please visit the detailed CPAN module installation guide.

	Global
`s`	Focus search bar
`?`	Bring up this help dialog

	GitHub
`g` `p`	Go to pull requests
`g` `i`	Go to GitHub issues (only if GitHub is preferred repository)

	POD
`g` `a`	Go to author
`g` `c`	Go to changes
`g` `i`	Go to issues
`g` `d`	Go to dist
`g` `r`	Go to repository/SCM
`g` `s`	Go to source
`g` `b`	Go to file browse

	Search terms
module: (e.g. module:Plugin)
distribution: (e.g. distribution:Dancer auth)
author: (e.g. author:SONGMU Redis)
version: (e.g. version:1.00)