Kazuhiro Osawa > Lingua-JA-Summarize-Extract-0.02 > Lingua::JA::Summarize::Extract

Download:
Lingua-JA-Summarize-Extract-0.02.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  0
View/Report Bugs
Module Version: 0.02   Source  

NAME ^

Lingua::JA::Summarize::Extract - summary generator for Japanese

SYNOPSIS ^

    use strict;
    use warnings;
    use utf8;
    use Lingua::JA::Summarize::Extract;

    my $text = '日本語の文章を適当に書く。';
    my $summary = Lingua::JA::Summarize::Extract->extract($text);

    print $summary->as_string;
    print "$summary";

    # cuts short to 20 length
    $summary->length(20);
    print "$summary";

    # mecab charset
    my $extractor = Lingua::JA::Summarize::Extract->new({ mecab_charset => 'utf8' });

DESCRIPTION ^

Lingua::JA::Summarize::Extract is a summary generator for Japanese text. The extraction method can be changed with the plug-in mechanism.

METHODS ^

new([options])

a object is made by using the options.

extract(text[, options])

text is summarized. blessed by using options if called direct. return to Lingua::JA::Summarize::Extract::ResultSet object.

OPTIONS ^

the content of processing can be changed by passing the constructor the options.

plugins

the processing of split of word and line and the scoring etc. can be done by using another modules. please pass it by the ARRAY reference.

rate

the weight at scoring can be changed.

thing to refer to POD of each plugin when you want to examine other options.

THANKS TO ^

Tatsuhiko Miyagawa

AUTHOR ^

Kazuhiro Osawa <ko@yappo.ne.jp>

SEE ALSO ^

http://gensen.dl.itc.u-tokyo.ac.jp/, http://www.ryo.com/getsen/

LICENSE ^

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

syntax highlighting: