Kazuhiro Osawa
and 1 contributors

NAME

Lingua::JA::Summarize::Extract - summary generator for Japanese

SYNOPSIS

    use strict;
    use warnings;
    use utf8;
    use Lingua::JA::Summarize::Extract;

    my $text = '日本語の文章を適当に書く。';
    my $summary = Lingua::JA::Summarize::Extract->extract($text);

    print $summary->as_string;
    print "$summary";

    # cuts short to 20 length
    $summary->length(20);
    print "$summary";

    # mecab charset
    my $extractor = Lingua::JA::Summarize::Extract->new({ mecab_charset => 'utf8' });

DESCRIPTION

Lingua::JA::Summarize::Extract is a summary generator for Japanese text. The extraction method can be changed with the plug-in mechanism.

METHODS

new([options])

a object is made by using the options.

extract(text[, options])

text is summarized. blessed by using options if called direct. return to Lingua::JA::Summarize::Extract::ResultSet object.

OPTIONS

the content of processing can be changed by passing the constructor the options.

plugins

the processing of split of word and line and the scoring etc. can be done by using another modules. please pass it by the ARRAY reference.

rate

the weight at scoring can be changed.

thing to refer to POD of each plugin when you want to examine other options.

THANKS TO

Tatsuhiko Miyagawa

AUTHOR

Kazuhiro Osawa <ko@yappo.ne.jp>

SEE ALSO

http://gensen.dl.itc.u-tokyo.ac.jp/, http://www.ryo.com/getsen/

LICENSE

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

1 POD Error

The following errors were encountered while parsing the POD:

Around line 86:

Non-ASCII character seen before =encoding in ''日本語の文章を適当に書く。';'. Assuming UTF-8