Simon Cozens > Lingua-EN-Keywords-2.0 > Lingua::EN::Keywords

Download:
Lingua-EN-Keywords-2.0.tar.gz

Dependencies

Annotate this POD

CPAN RT

Open  0
View/Report Bugs
Module Version: 2.0   Source  

NAME ^

Lingua::EN::Keywords - Automatically extracts keywords from text

SYNOPSIS ^

  use Lingua::EN::Keywords;

  my @keywords = keywords($text);

DESCRIPTION ^

This is a very simple algorithm which removes stopwords from a summarized version of a text (generated with Lingua::EN::Summarize) and then counts up what it considers to be the most important "keywords". The keywords subroutine returns a list of five keywords in order of relevance.

This is pretty dumb. Don't expect any clever document categorization algorithms here, because you won't find them. But it does a reasonable job.

EXPORT

keywords subroutine.

AUTHOR ^

Simon Cozens, simon@cpan.org

SEE ALSO ^

perl(1).

syntax highlighting: