The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "module:Lingua::EN::StopWords"

Lingua::EN::StopWords - Typical stop words for an English corpus River stage one • 1 direct dependent • 1 total dependent

See synopsis....

SPLICE/Lingua-EN-Segmenter-0.1 - 03 Mar 2005 03:20:54 UTC

lib/Lingua/StopWords/EN.pm River stage two • 15 direct dependents • 32 total dependents

WOLLMERS/Lingua-StopWords-0.12 - 18 Apr 2021 08:32:07 UTC

Lingua::StopWords - Stop words for several languages. River stage two • 15 direct dependents • 32 total dependents

In keyword search, it is common practice to suppress a collection of "stopwords": words such as "the", "and", "maybe", etc. which exist in in a large number of documents and do not tell you anything important about any document which contains them. T...

WOLLMERS/Lingua-StopWords-0.12 - 18 Apr 2021 08:32:07 UTC

Lingua::EN::Ngram - Extract n-grams from texts and list them according to frequency and/or T-Score River stage one • 2 direct dependents • 2 total dependents

This module is designed to extract n-grams from texts and list them according to frequency and/or T-Score. To elaborate, the purpose of Lingua::EN::Ngram is to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phr...

EMORGAN/Lingua-EN-Ngram-0.03 - 29 Mar 2018 03:28:09 UTC

Lingua::EN::Bigram - Extract n-grams from a text and list them according to frequency and/or T-Score River stage zero No dependents

This module is designed to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phrases according to their frequency. Using this module is it possible to create lists of the most common phrases in a text as well as o...

EMORGAN/Lingua-EN-Bigram-0.03 - 24 Aug 2010 02:01:46 UTC

Lingua::ZH::Keywords - Extract keywords from Chinese text River stage zero No dependents

This is a very simple algorithm which removes stopwords from the text, and then counts up what it considers to be the most important keywords. The "keywords" subroutine returns a list of keywords in order of relevance. The stopwords list is accessibl...

AUTRIJUS/Lingua-ZH-Keywords-0.04 - 20 Jan 2003 22:42:35 UTC

Lingua::EN::Splitter - Split text into words, paragraphs, segments, and tiles River stage one • 1 direct dependent • 1 total dependent

See synopsis. This module can be used in an object-oriented fashion or the routines can be exported....

SPLICE/Lingua-EN-Segmenter-0.1 - 03 Mar 2005 03:20:54 UTC

Lingua::EN::Keywords - Automatically extracts keywords from text River stage zero No dependents

This is a very simple algorithm which removes stopwords from a summarized version of a text (generated with Lingua::EN::Summarize) and then counts up what it considers to be the most important "keywords". The "keywords" subroutine returns a list of f...

SIMON/Lingua-EN-Keywords-2.0 - 28 Apr 2003 10:23:29 UTC

Lingua::EN::StopWordList - A sorted list of English stop words River stage one • 2 direct dependents • 3 total dependents

"Lingua::EN::StopWordList" is a pure Perl module. It returns a sorted arrayref of 659 English stop words....

RSAVAGE/Lingua-EN-StopWordList-1.02 - 16 Aug 2015 04:55:38 UTC
9 results (0.045 seconds)