The Perl Toolchain Summit needs more sponsors. If your company depends on Perl, please support this very important event.

Search results for "Lingua::EN"

Lingua::EN::ABC - American, British, and Canadian English River stage zero No dependents

This module offers functions to convert between the spellings and vocabulary of American, British, and Canadian versions of English....

BKB/Lingua-EN-ABC-0.12 - 16 Nov 2021 08:26:14 UTC - Search in distribution

Lingua::EN::Dict - BETA Version of XML english dictionary storage. River stage zero No dependents

Note: BETA VERSION. See main reason for release of this module, three paragraphs down. Description This is a small module I came up with to use as a storage format for my humble attempt at a natural language parser (or a subset of natural language - ...

JBRYAN/Lingua-EN-Dict-0.20 - 07 Oct 2000 03:39:24 UTC - Search in distribution

Lingua::Stem::En - Porter's stemming algorithm for 'generic' English River stage three • 16 direct dependents • 136 total dependents

This routine applies the Porter Stemming Algorithm to its parameters, returning the stemmed words. It is derived from the C program "stemmer.c" as found in freewais and elsewhere, which contains these notes: Purpose: Implementation of the Porter stem...

SNOWHARE/Lingua-Stem-2.31 - 26 Sep 2020 06:27:58 UTC - Search in distribution

Lingua::EN::Ngram - Extract n-grams from texts and list them according to frequency and/or T-Score River stage one • 2 direct dependents • 2 total dependents

This module is designed to extract n-grams from texts and list them according to frequency and/or T-Score. To elaborate, the purpose of Lingua::EN::Ngram is to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phr...

EMORGAN/Lingua-EN-Ngram-0.03 - 29 Mar 2018 03:28:09 UTC - Search in distribution

Lingua::EN::SENNA - Perl wrapper for the SENNA NLP toolkit River stage zero No dependents

This package wraps around and bundles with the SENNA NLP toolkit. SENNA performs sentence-level analysis, hence it expects each inidividual input to be a natural language sentence. Thus, one needs to independently discover sentences, e.g. by using Li...

DGINEV/Lingua-EN-SENNA-0.04 - 02 Jan 2015 12:04:08 UTC - Search in distribution

Lingua::EN::Bigram - Extract n-grams from a text and list them according to frequency and/or T-Score River stage zero No dependents

This module is designed to: 1) pull out all of the ngrams (multi-word phrases) in a given text, and 2) list these phrases according to their frequency. Using this module is it possible to create lists of the most common phrases in a text as well as o...

EMORGAN/Lingua-EN-Bigram-0.03 - 24 Aug 2010 02:01:46 UTC - Search in distribution

Lingua::EN::Fathom - Measure readability of English text River stage one • 1 direct dependent • 1 total dependent

This module analyses English text in either a string or file. Totals are then calculated for the number of characters, words, sentences, blank and non blank (text) lines and paragraphs. Three common readability statistics are also derived, the Fog, F...

KIMRYAN/Lingua-EN-Fathom-1.27 - 10 Aug 2023 23:57:24 UTC - Search in distribution

Lingua::EN::Gender - Inflect pronouns for gender River stage zero No dependents

Small module for inflecting pronouns for a bunch of different genders. Genders currently supported are: neuter male female either spivak splat plural egotistical royal 2nd sie/hir zie/zir...

GOSSAMER/Lingua-EN-Gender-0.02 - 06 Jan 1999 22:13:43 UTC - Search in distribution

Lingua::EN::Tagger - Part-of-speech tagger for English natural language processing. River stage three • 6 direct dependents • 116 total dependents

The module is a probability based, corpus-trained tagger that assigns POS tags to English text based on a lookup dictionary and a set of probability values. The tagger assigns appropriate tags based on conditional probabilities - it examines the prec...

ACOBURN/Lingua-EN-Tagger-0.31 - 12 Jul 2019 15:32:49 UTC - Search in distribution

Lingua::EN::VarCon - Provides access to the VarCon (Variant Conversion Info) River stage zero No dependents

VarCon is a data set provided as part of the Word List project. It contains a number of lists and tables of words that can be used to help automatically convert the English language between the differing spellings of it's local dialects (American, Br...

ADAMK/Lingua-EN-VarCon-1.00 - 01 Oct 2006 04:24:13 UTC - Search in distribution

Lingua::EN::CMUDict - Perl extension for utilizing the CMU dictionary file River stage zero No dependents

This version of the CMU Pronouncing dictionary was generated from the original dictionary and designed to syllabify it. The paper *On the Syllabification of Phonemes* by Susan Bartlett, Grzegorz Kondrak and Colin Cherry (NAACL-HLT 2009) covers the me...

LMETCALF/Lingua-EN-CMUDict-0.07 - 04 Sep 2021 00:10:13 UTC - Search in distribution

Lingua::EN::Inflect - Convert singular to plural. Select "a" or "an". River stage three • 48 direct dependents • 311 total dependents

[*Note:* This module is strictly in maintenance mode now. Take a look at the newer Lingua::EN::Inflexion module, which offers a cleaner and more convenient interface, has many more features (including plural->singular inflexions), and is also much be...

DCONWAY/Lingua-EN-Inflect-1.905 - 08 Dec 2020 19:22:12 UTC - Search in distribution

Lingua::EN::Numbers - turn "407" into "four hundred and seven", etc. River stage two • 15 direct dependents • 18 total dependents

This module provides a function "num2en", which converts a number (such as 123) into English text ("one hundred and twenty-three"). It also provides a function "num2en_ordinal", which converts a number into the ordinal form in words, so 54 becomes "f...

NEILB/Lingua-EN-Numbers-2.03 - 07 Nov 2015 10:08:53 UTC - Search in distribution

Lingua::EN::Opinion - Measure the emotional sentiment of text River stage zero No dependents

A "Lingua::EN::Opinion" object measures the emotional sentiment of text and saves the results in the scores and nrc_scores attributes. When run against the positive and negative classified training reviews in the dataset referenced under "SEE ALSO", ...

GENE/Lingua-EN-Opinion-0.1702 - 22 Oct 2022 05:51:07 UTC - Search in distribution

Lingua::EN::Phoneme - Simple and fast access to cmudict English pronunciation data River stage one • 1 direct dependent • 1 total dependent

"Lingua::EN::Phoneme" provides simple access to the phonemic English data in the Carnegie-Mellon pronouncing dictionary. Unlike "Lingua::Phoneme", it does not require a DBI connection to run, or any setup time on the host computer. The data is suppli...

MARNANEL/Lingua-EN-Phoneme-0.01 - 28 Jan 2009 01:27:08 UTC - Search in distribution

Lingua::EN::Segment - Split English-language domain names etc. into words River stage zero No dependents

Sometimes you have a string that to a human eye is clearly made up of many words glommed together without spaces or hyphens. This module uses some mild cunning and a large list of known words from Google to try and work out how the string should be s...

SKINGTON/Lingua-EN-Segment-0.004 - 29 Feb 2020 15:17:43 UTC - Search in distribution

Lingua::EN::Squeeze - Shorten text to minimum syllables using hash table lookup and vowel deletion River stage zero No dependents

This module squeezes English text to the most compact format possible, so that it is barely readable. Be sure to convert all text to lowercase before using the SqueezeText() for maximum compression, because optimizations have been designed mostly for...

NEILB/Lingua-EN-Squeeze-2020.02 - 20 Feb 2020 13:22:05 UTC - Search in distribution

Lingua::En::Victory - Perl extension for egotistically expressing victory. River stage zero No dependents

I developed a gaming site and got a little tired of reporting the results as "A beat B" so I wrote this module to spice up the results reporting. METHODS new() This must be called first to create a Lingua::En::Victory object for use with the remainin...

TBONE/Lingua-En-Victory-0.01 - 11 Dec 2003 20:14:57 UTC - Search in distribution

Lingua::EN::Keywords - Automatically extracts keywords from text River stage zero No dependents

This is a very simple algorithm which removes stopwords from a summarized version of a text (generated with Lingua::EN::Summarize) and then counts up what it considers to be the most important "keywords". The "keywords" subroutine returns a list of f...

SIMON/Lingua-EN-Keywords-2.0 - 28 Apr 2003 10:23:29 UTC - Search in distribution

Lingua::EN::NameCase - Correctly case a person's name from UPERCASE or lowcase River stage one • 3 direct dependents • 3 total dependents

Forenames and surnames are often stored either wholly in UPPERCASE or wholly in lowercase. This module allows you to convert names into the correct case where possible. Although forenames and surnames are normally stored separately if they do appear ...

NHORNE/Lingua-EN-NameCase-1.21 - 15 Nov 2020 12:40:17 UTC - Search in distribution
292 results (0.598 seconds)