The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::TFIDF::WordSegmenter::LetterNgram - Letter N-gram segmenter

VERSION

version 0.01

SYNOPSIS

  use Lingua::TFIDF::WordSegmenter::LetterNgram;
  
  my $segmenter = Lingua::TFIDF::WordSegmenter::LetterNgram->new(n => 2);
  my $iter = $segmenter->segment('ロンドン橋落ちた 落ちた 落ちた...');
  while (defined(my $word = $iter->())) { ... }

DESCRIPTION

This class provides a N-gram word segmenter.

METHODS

new(n => $n)

Constructor.

segment($document | \$document)

Executes word segmentation on given $document and returns an word iterator.

AUTHOR

Koichi SATOH <sekia@cpan.org>

COPYRIGHT AND LICENSE

This software is Copyright (c) 2014 by Koichi SATOH.

This is free software, licensed under:

  The MIT (X11) License