The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

NAME

Lingua::TFIDF::WordSegmenter::JA::MeCab - Word segmenter for Japanese documents

VERSION

version 0.01

SYNOPSIS

  use utf8;
  use Lingua::TFIDF::WordSegmenter::JA::MeCab;
  
  my $segmenter = Lingua::TFIDF::WordSegmenter::JA::MeCab->new;
  my $iter = $segmenter->segment('思い出せ、思い出せ 11月5日を...');
  while (defined(my $word = $iter->())) { ... }

DESCRIPTION

This class is a word segmenter for documents written in Japanese.

METHODS

new([ mecab => Text::MeCab->new ])

Constructor.

segment($document | \$document)

Executes word segmentation on given $document and returns an word iterator.

SEE ALSO

Text::MeCab

AUTHOR

Koichi SATOH <sekia@cpan.org>

COPYRIGHT AND LICENSE

This software is Copyright (c) 2014 by Koichi SATOH.

This is free software, licensed under:

  The MIT (X11) License