
Lingua::JA::Summarize::Extract::Plugin::Parser::Ngram - a word parser by N-gram

use strict;
use warnings;
use utf8;
use Lingua::JA::Summarize::Extract;
my $text = '';
my $text = 'æ¥æ¬èªã®æç« ãé©å½ã«æ¸ãã';
my $summary = Lingua::JA::Summarize::Extract->extract($text); # default plugin
print "$summary";

parse dose the word by using N-gram. the number of N can be changed by KATAKANA, KANJI, and the Latin character.

latin character
katakana character
kanji character

Kazuhiro Osawa <ko@yappo.ne.jp>

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.