
Lingua::JA::Summarize::Extract::Plugin::Parser::NgramSimple - a word parser by N-gram Simply

use strict;
use warnings;
use utf8;
use Lingua::JA::Summarize::Extract;
my $text = '';
my $text = 'æ¥æ¬èªã®æç« ãé©å½ã«æ¸ãã';
my $summary = Lingua::JA::Summarize::Extract->extract($text, { plugins => [ 'ParserNgramSimple' ], gram => 2 });
print "$summary";

parse dose the word by using N-gram. all the character kinds are similarly treated. the number of N can be changed.


Kazuhiro Osawa <ko@yappo.ne.jp>

This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself.