☺唐鳳☻ > Lingua-ZH-Keywords > Lingua::ZH::Keywords

Download:
Lingua-ZH-Keywords-0.04.tar.gz

Dependencies

Annotate this POD

View/Report Bugs
Module Version: 0.04   Source  

NAME ^

Lingua::ZH::Keywords - Extract keywords from Chinese text

SYNOPSIS ^

    # Exports keywords() by default
    use Lingua::ZH::Keywords;

    print join(",", keywords($text));       # Prints five keywords
    print join(",", keywords($text, 10));   # Prints ten keywords

DESCRIPTION ^

This is a very simple algorithm which removes stopwords from the text, and then counts up what it considers to be the most important keywords. The keywords subroutine returns a list of keywords in order of relevance.

The stopwords list is accessible as @Lingua::ZH::Keywords::StopWords.

If the input $text is an Unicode string, the returned keywords will also be Unicode strings; otherwise they are assumed to be Big5-encoded bytestrings.

SEE ALSO ^

Lingua::ZH::TaBE, Lingua::EN::Keywords

ACKNOWLEDGEMENTS ^

Algorithm adapted from the Lingua::EN::Keywords module by Simon Cozens, <simon@simon-cozens.org<gt>.

AUTHORS ^

Autrijus Tang <autrijus@autrijus.org>

COPYRIGHT ^

Copyright 2003 by Autrijus Tang <autrijus@autrijus.org>.

This program is free software; you can redistribute it and/or modify it under the same terms as Perl itself.

See http://www.perl.com/perl/misc/Artistic.html

syntax highlighting: