Lingua::StopWords - Stop words for several languages
use Lingua::StopWords; my @words = ...; my $stopwords = Lingua::StopWords::getStopWords('en'); my $stopwords = Lingua::StopWords::EN::getStopWords(); # Print non-stopwords in @words print join ' ', grep { !$stopwords->{$_} } @words;
Stopword list are encoded in UTF8.
The current supported languages are:
English
French
Spanish
Portuguese
Italian
German
Dutch
Swedish
Norwegian
Danish
Russian
Finnish
None by default.
The stopword lists was taken from the http://snowball.tartarus.org/ website.
This POD documentation inspired from the Lingua::EN::StopWords module.
Fabien POTENCIER, <fabpot@cpan.org>
Copyright (C) 2004 by Fabien POTENCIER
This library is free software; you can redistribute it and/or modify it under the same terms as Perl itself, either Perl version 5.8.3 or, at your option, any later version of Perl 5 you may have available.
To install Lingua::StopWords, copy and paste the appropriate command in to your terminal.
cpanm
cpanm Lingua::StopWords
CPAN shell
perl -MCPAN -e shell install Lingua::StopWords
For more information on module installation, please visit the detailed CPAN module installation guide.