Search results for "module:Lingua::EN::StopWords"
Lingua::EN::StopWords - Typical stop words for an English corpus
See synopsis....
SPLICE/Lingua-EN-Segmenter-0.1 - 03 Mar 2005 03:20:54 UTC
Lingua::StopWords - Stop words for several languages.
In keyword search, it is common practice to suppress a collection of "stopwords": words such as "the", "and", "maybe", etc. which exist in in a large number of documents and do not tell you anything important about any document which contains them. T...
WOLLMERS/Lingua-StopWords-0.12 - 18 Apr 2021 08:32:07 UTC
Lingua::EN::Ngram - Extract n-grams from texts and list them according to frequency and/or T-Score
This module is designed to extract n-grams from texts and list them according to frequency and/or T-Score. To elaborate, the purpose of Lingua::EN::Ngram is to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phr...
EMORGAN/Lingua-EN-Ngram-0.03 - 29 Mar 2018 03:28:09 UTC
Lingua::EN::Bigram - Extract n-grams from a text and list them according to frequency and/or T-Score
This module is designed to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phrases according to their frequency. Using this module is it possible to create lists of the most common phrases in a text as well as o...
EMORGAN/Lingua-EN-Bigram-0.03 - 24 Aug 2010 02:01:46 UTC
Lingua::ZH::Keywords - Extract keywords from Chinese text
This is a very simple algorithm which removes stopwords from the text, and then counts up what it considers to be the most important keywords. The "keywords" subroutine returns a list of keywords in order of relevance. The stopwords list is accessibl...
AUTRIJUS/Lingua-ZH-Keywords-0.04 - 20 Jan 2003 22:42:35 UTC
Lingua::EN::Splitter - Split text into words, paragraphs, segments, and tiles
See synopsis. This module can be used in an object-oriented fashion or the routines can be exported....
SPLICE/Lingua-EN-Segmenter-0.1 - 03 Mar 2005 03:20:54 UTC
Lingua::EN::Keywords - Automatically extracts keywords from text
This is a very simple algorithm which removes stopwords from a summarized version of a text (generated with Lingua::EN::Summarize) and then counts up what it considers to be the most important "keywords". The "keywords" subroutine returns a list of f...
SIMON/Lingua-EN-Keywords-2.0 - 28 Apr 2003 10:23:29 UTC
Lingua::EN::StopWordList - A sorted list of English stop words
"Lingua::EN::StopWordList" is a pure Perl module. It returns a sorted arrayref of 659 English stop words....
RSAVAGE/Lingua-EN-StopWordList-1.02 - 16 Aug 2015 04:55:38 UTC