View on
MetaCPAN
David James > Lingua-EN-Segmenter > Lingua::EN::StopWords

Download:
Lingua-EN-Segmenter-0.1.tar.gz

Dependencies

Annotate this POD

CPAN RT

New  4
Open  0
View/Report Bugs
Module Version: 0.1   Source  

NAME ^

Lingua::EN::StopWords - Typical stop words for an English corpus

SYNOPSIS ^

  use Lingua::EN::StopWords qw(%StopWords);
  
  my @words = ...;
  
  # Print non-stopwords in @words
  print join " ", grep { !$StopWords{$_} } @words; 

DESCRIPTION ^

See synopsis.

AUTHORS ^

David James <splice@cpan.org>

The stopword list was taken from http://www.askeric.org/Eric/Help/stop.shtml (The original stopword list was in the public domain)

SEE ALSO ^

Lingua::EN::Segmenter::TextTiling, Lingua::EN::Segmenter::Baseline, Lingua::EN::Segmenter::Evaluator, http://www.cs.toronto.edu/~james

LICENSE ^

  Copyright (c) 2002 David James
  All rights reserved.
  This program is free software; you can redistribute it and/or
  modify it under the same terms as Perl itself.
syntax highlighting: